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GROUPS OF CREMONA TRANSFORMATIONS IN SPACE OF 
PLANAR TYPE 


By Artuur B. CoBLe 


1. Introduction. We shall say that a group G of space Cremona transforma- 
tions is of planar type if it possesses the distinguishing characteristics of the 
entire group of Cremona transformations in the plane. The characteristics 
which we shall stress are the following: 

(a) G has an infinite continuous set of generators all of the same type. 

In the plane this set is the set of quadratic transformations with distinct F- 

points. 
(8) A particular element of G is defined by the choice of certain discontinuous 
parameters, positive or zero integers, which fix the type of the element (i.e., the 
nature of its F-system), and of certain continuous parameters which fix the 
position of its F-system. 

This requirement rules out the group of inversions in space which has only 
two types of elements, namely: the collineation, and the quadratic transformation 
with a simple F-point and with a conic as an F-curve of the first kind. 

(y) Associated with G there is a group g of linear transformations on an unre- 
stricted number of variables with integer coefficients. Each element of g defines 
a type of element in G. The product of two elements of G has a type defined 
by the product of the corresponding elements in g. 

(6) The linear group g of types in G has a linear and a quadratic invariant. 

The number of groups of the type indicated which have thus far been exhibited 
is quite limited. In each space S, (n = 2) there is the group of “regular Cremona 
transformations”,' which has interesting applications. These transformations 
have been called “‘punctual” by Miss Hudson.? 

In S; there is a group whose generators are the cubic transformations which 
have a degenerate sextic F-curve of the first kind made up of a space cubic 
curve, fixed for the entire group, and of three variable bisecants of this curve. 
Montesano’ has shown that in this group the types are isomorphic with the 
ternary types. 

Snyder‘ reports a somewhat more special type of cubic transformation whose 


Received December 20, 1935. 

1A. B. Coble, Point sets and allied Cremona groups II, Transactions of the American 
Mathematical Society, vol. 17 (1916), pp. 345-385. 

? Hilda P. Hudson, Cremona Transformations in Plane and Space, Cambridge University 
Press, 1927. 

3D. Montesano, Su alcuni tipi di corrispondenze cremoniane spaziali collegati alle cor- 
rispondenze birazionali piane di ordine n, Napoli Rendiconti, (3), vol. 27 (1921), pp. 164-175. 

4V. Snyder, Some recent contributions to algebraic geometry, Bulletin of the American 
Mathematical Society, vol. 40 (1934), pp. 673-687. 
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sextic F-curve breaks up into four skew lines and their two transversals. With 
the two transversals fixed (for convenience) this type serves as a generator of a 
group of ternary type. 

In this paper we develop a novel group G of space transformations whose 
elements have, in addition to isolated F-points, a fixed F-curve C of the first 
kind which is a generic space sextic of genus four. The elements are therefore 
definitely not products of cubic transformations. The generating type of ele- 
ment for this group is obtained in §2. It is a de Jonquiéres or monoidal involu- 
tion of order four with one isolated F-point. This generating involution is 
listed by Sharpe and Snyder [5§5, §10] with only the briefest mention. Since it 
is fundamental for the group G we develop it here somewhat more fully. 

The type of homaloidal web for the generic element of G is obtained in §3. 
This web depends upon a certain integer “characteristic”, and the linear group g 
of this characteristic with a quadratic and linear invariant is discussed in §4. 
In §§5, 6 the relation between corresponding elements of g and of G, and the 
effect of these elements upon the characteristic of a linear system, are discussed. 
In a later paper a number of other groups of this planar type will be given. The 
one discussed here is of exceptionally simple character. 


2. The Cremona involution J, of order four. Let C be a generic space 
sextic curve of genus 4 on a quadric Q. If then C is the complete intersection 
of Q with a cubic surface K, the system (*) of cubic surfaces on C has the form 


(1) ao K + (air, + at, + a3x3 + ats) Q = aK + rQ = 0. 


The generic pencil of cubic surfaces on C contains one member of the form 7Q. 
The base curve of the pencil is the common curve of some K and rQ. It is 
therefore the sextic curve (K, Q) and the plane cubic curve (K, 7), these two 
curves meeting in six points (K, Q, 7) which are on the conic (7, Q) on x. The 
generic net of cubic surfaces on C contains a pencil of the form 7Q and is obtained 
by adding to the above pencil another surface, 7’Q. Since Q meets the residual 
cubic (K, z) in the six points mentioned on C, and 7’ meets this cubic in three 
points on the line (x, 7’), we see that 
(2) Three generic cubic surfaces on the sextic curve C meet outside of C in three 
collinear points. 

The system (*), or web, of cubic surfaces on C and a point P not on Q is also 
of the form (1), aK + 7Q = 0. There follows at once that 
(3) The web of cubic surfaces on C and on a fixed point P not on Q is such that the 
net of surfaces of the web on a point x is on a second point x’, x and x’ being collinear 
with P. The points z, x’ are partners in a Cremona involution Ip, perspective with 
center at P. 

It is easy to set up equations for the involution and the analytic method has 
certain advantages over the synthetic method. Let P be the point 1, 0, 0, 0 and 


5 F. R. Sharpe and V. Snyder, Certain types of involutorial space transformations, Trans- 
actions of the American Mathematical Society, vol. 21 (1920), pp. 52-78. 
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let K be that cubic surface on C which has a node at P. Since P is not on Q, we 
can take z; = 0 to be the polar plane of P as to Q. Then K and Q have the 
form 


K = ko(xorsx4) 21 + ka(zersxs) = 0, 
Q = 2? oa lo(x223 24) = 0, 


where ke, k3, l. are ternary forms of orders 2, 3, 2 respectively in r2, 23, 2s. The 
web of cubic surfaces on P has the form 


(4) 


(5) ao K + (2x2 + asr3 + a424)Q = 0. 


The condition on the pair z, x’ is that they yield in (5) the same condition on 
G@, &2, a3, a. Thus 








in |K@) 2Q@) 2Q@) 2) || _, 
|K(e’)  22Q(z’) —-23Q(z’) 2, Q(e’) ; 
Since z, x’ are collinear with P, we may take 
x =%+A, rs = Ze, z3 = £3, ry = 2%. 


The conditions (6) are then satisfied if we determine \ so that K(x) Q(z’) = 
K(x’) Q(x). Replacing the values of x’ and using the forms (4), we find that 


A= (—kez? _ Qk3x1 a kele)/(ker1 + ks), 
TZ oe A = (—k3x ~ kele)/(kea1 -f ks) ° 


Hence the equations of Jp are 


(7) a, = —kstit+ kale, 2; = 2i(kea: + ks) (i = 2,3, 4). 
The locus of fixed points of Ip for which x, /ti= x; /2; is 
(8) F= kez? oa Qksx1 _ kgle = 0. 


This is a quartic surface with node at P and the polar cubic surface of P is K. 
Eliminating z; from K and F we get ko(k} + kil.) = 0. The factor 


(9) G=k; + kil = 0 


is the sextic cone from P to C as may be seen by eliminating x; from (4). The 
factor ke indicates the six lines 


(10) F3: ke = kz = 0, 


which are on P and bisecant to C. Hence 
(11) If F is the quartic surface, with the same node P and nodal tangents as K, 
which touches the cone G from P to C along C, then the quartic involution Ip is the 
locus of pairs x, x’ on a line d through P and harmonic to the two further points in 
which on the node P of F meets F again. 

It is clear from the equations (7) of Jp that K is the P-surface which cor- 
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responds to the isolated F-point P. It is also clear from (11) that C is an F- 
curve of the first kind whose P-surface is the sextic cone G. Hence 

(12) The homaloidal web (10) is the system (P®C)*. The point P is an isolated 
F-point whose P-surface is K = (P?C)*®. The F-curves of the second kind are the 
siz lines F, on P and bisecant to C. 

That we have enumerated all the isolated F-points and F-curves of the first 
kind is clear since the web (P*°C)* is transformed by Ip into (P¥C*)* = 
{(P?C)*}*. (P®C)s. (P°C°)!, i.e., into the web of planes. Also two members of the 
homaloidal system (P*C)* meet in a curve of order 16 with 9-fold point at P. 
From this, C of order 6 and F2 of order 6 and 6-fold at P separate, leaving a 
quartic curve with triple point at P, which is the proper transform by I> of a line. 
We have therefore all the F-curves of the second kind. 

A point on Q has codrdinates +/ —lb, x2, x3, 41. This yields in (7) the point 
x’ = —V —h, £2, 23,21. Hence Qis invariant under Jp. The definition (11) of 
Ip may therefore be replaced by the following which makes no use of F: 

(13) A line \ on P meets K and Q each in two pairs of distinct points. A pair 
x, x’ of Ip on d is a pair of the involution defined by the two pairs mentioned. 


3. The Cremona group G generated by involutions Jp. Let P;, P2, --- be 
any sequence of points in S; which are not on Q. Then the sequence of 
involutions, Ip,, Ip,, --- , has a definite product which is by definition a particu- 
lar element of a Cremona group G. We ordinarily assume in forming such a 
product that the point P; is in generic position with respect to the F-points of 
Ip, Ip, -- + Ip,_,, or else that it coincides with one of them. If P; were, for ex- 
ample, on a P-surface of the product just mentioned, we would have a case of 
coalescent ordinary singularities. 

An examination of a few of the simpler products indicates the following 
theorem: 

(1) The generic element of the group G has a homaloidal web of the form 


—1 ,,3 3 3 3z,—2 
(C*" pipe --- pet), 
where 2, --- , t% have zero or positive values and 
L=x2— ™%— %2— ooo — Ze =. 


It is to be observed that the isolated F-points pi, p2, --- are not the F-points 
P,, Ps, --- of the involutions which generate the above element. Thus under 
Ip, Ip, planes pass into members of a web with F-points at pe = P2 and p, = 
transform of P; by Ip,. 

We note that the theorem is true for zx = l and z; = 0 (i > 0). The web 
in this case is the web of planes and the element of G is the identical collineation 
since we wish to keep C fixed and C admits no other collineations. For xp = 2, 
a = 1,2; = 0 (¢ > 1), the web is that of Jp, in 2(12). To prove the theorem it 
is therefore necessary only to show that the web (1) goes into a similarly defined 
web under J», and under J»,,,.. Under J», the position of the F-points pe, --- , px 
will change, and under J»,,, the positions of p:, --- , px will change. If, how- 
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ever, we assume the multiplicity zi; = 0 at pis: for the given web, then, under 
either involution, we find that the change in the multiplicities z is given by the 
equations 


ro = 2% — 32;, 
(2) ij: z; = % — 22;, 
r, = 2; (l ¥ 0,1 ¥ 5), 


where j = 1 for Jp, andj = k + 1for Z»,,,. Since the linear transformation 7; 
leaves L unaltered, (1) is established. 

With the help of 7; we list those types of transformations in G which can be 
expressed as products of not more than five involutions Jp. The numbers given 
are the non-zero values of 


Xo; Viy Ta, + °° 


1;0,0,--- (5.1.1) 14; 652 
(1) 2;1 (5.1.2) 20; 10, 522 
(2) 4;21 (5.2.1.1) 17; 844 
(3.1) 5; 22 (5.2.1.2) 20; 10, 441 
(3.2) 8; 421 (5.2.2.1) 14; 652 
(3) (4.1) 10; 522 (5.2.2.2) 20; 964 
(4.2.1) 10; 441 (5.2.2.3) 26; 13, 642 
(4.2.2) 13; 642 (5.2.3.1) 20; 8821 
(4.2.3) 16; 8421 (5.2.3.2) 26; 12, 841 


(5.2.3.3) 29; 14, 842 
(5.2.3.4) 32; 16, 8421. 


The numbers given in parentheses represent the genesis of the type opposite 
from earlier types. 


4. The linear group g of types in the Cremona group G. We have seen 
in 3 that the characteristic x, 2, --- of the homaloidal web 3(1) of the generic 
element of G may be obtained from the characteristic 1, 0, 0, --- of the web of 
planes by a sequence with repetitions of the involutions 4, #2, - - - defined in 3(2). 
We consider now the linear group g generated by these involutions ¢;. 

A particular generator 7; of g has the period 2 and the determinant —1. So far 
as Xo, 2, alone are concerned, 7; has the invariant linear forms 2, — 2; = % — 2 
and x, — 32, = —(ao — 321). By combining the squares of these to eliminate 
2ox, we find the invariant quadratic form 23 — 327. Hence 
(1) The linear group g generated by involutions i;, which permutes the characteristics 
x of the Cremona elements in G, has the invariant linear and quadratic forms 


Len—m—-m—--, 
Q = z§ — 32? — 3c) — ... 


For characteristics x of Cremona elements, L = Q = 1. 
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By solving the equations L = Q = 1 for given zo we can again find the webs 
given in the table of 3(3) in the following order: 


1;00.-.- 8;421. 
2;10.--. 10;5 22... 

(2) 4:31... 441... 
5;22--- 11;611110.-.. 


” 


Ti cvpnecn  “¢00000nadas ; 


However not all of the positive integer solutions x of the equations L = Q = 1 
yield geometrically existent Cremona webs. The last case in (2) is an example. 
Let 21, X2, 23, --- be so arranged that 1, >z2 223 >---. Then 3x22 = 323, 
3x3 => 3x3,---. Hence the difference 32, (L — 1) — (Q — 1) = O yields the 
inequality 32:(z. — 1) 223 —1. Thus, if x > 1,32, 22 +1 and 32x, > 2%. 
Now 7; applied to the characteristic x yields x, = 2x9 — 32; < 2%. Thus the 
value 2» of the characteristic x can always be reduced and eventually reduced to 
the zo = 1 of the web of planes unless in the process negative values of some of the 
21, t2,--- appear. We have already imposed the restriction that the values of 
x; be positive or zero, i.e., that z; 20 (¢ = 1,2, --- ). These inequalities under 
the involutions 7; become successively x» — 2x; = 0, 2x9 — 32; — 2x; 2 0, ete., 
the number of inequalities becoming infinite if as many as three variables x, £2, 
z3 appear. Hence 
(3) Every solution in positive integers of the equations L = Q = 1 represents a 
geometrically existent Cremona web unless the solution fails to satisfy some one 
inequality in the set which is conjugate to the initial inequalities x; = 0 under the 
group g. The group g on Xo, 2X1, X2, X3 alone is infinite and the set of conjugate 
inequalities is likewise infinite. 

Let g, be the subgroup of g which is generated by 1, t2, --- , 7%, alone. To 
complete the proof of (3) we have yet to show that g; is infinite and also that 
under g; the number of conjugate inequalities mentioned is also infinite. We 
observe that the involution 7; has a single invariant linear space, tr» — 32, with 
a multiplier —1. Thus 7, is a harmonic perspectivity determined by the point 
1, 1, 0, 0 --- and its polar space as toQ. It is thus convenient to represent 7, by 
this polar space and we prove that 
(4) In the gs generated by 11, t2, 13 these three involutions are in an infinite conjugate 
set of generating involutions represented by the linear spaces 
A; (k +) = (8k + 1)xo — 3(k + 1)x, — 3kxr, — 3kz,(l, m, n = 1, 2,3; k 2 0), 
and the group g, (p = 3) is infinite. 

For, 2» — 32; is transformed by i2 into 2x) — 32, — 322 and this by 7% into 
Xo — 322, whence 7; and 72 are conjugate even in gz. These representative linear 
spaces are all of the generic form given in (4). We find that 7 interchanges 
A, (k+), and also interchanges A2(k+) with A;([k + 1)). Beginning then 
with any form of the set such as A;(k+) we apply i; to get Ai(k—). Then 72 
applied to Ai(k+) yields A3([k — 1] +) and A,([k + 1] —), and 7; applied to these 
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yields A;((k — 1]—) and A;([k + 1]+). Also 7, applied to A;({k — 1)+) yields 
A.(k—), and 72 applied to this yields A2(k+). Thus from Ai(k+) we can get 
Ai(k+), Ai(fk — 1] +), and Ai([k + 1]4). This proves the theorem (4). 

We examine finally the left member of the inequalities in (3). The form 2, 
is unaltered by 72 and 73, but 2 carries it into x, + Ai(0 +). In fact we can 
write i; as 2) = % + Ai(0 +), 2; = 2, + Ai(0 +). It is clear then from (4) 
that all the conjugates of x; can be expressed as x, plus a sum of forms A), the 
number of forms A; being the number of times 7; was used in forming the con- 
jugate. We consider the element e¢ = iset;i3. Since it; is the transform of 
iz by v1, it, along with is, is a member of the conjugate set of generating involu- 
tions. The product e is cyclic and of infinite period. In fact e transforms 2; 
into x, + A,(1—) + A2(1—). Moreover e transforms A;(k—) + A2(k—) 

G5 
into A;({k + 1]—) + A2({k + 1]—). Hence e’ transforms z; into 1 + >> 
i=1 
{Ai(i—) + A2(i—)}. These, for all values of j, are distinct, whence e has an 
infinite period. Hence 
(5) Under gs each of x1, x2, 23 gives rise to an infinite set of conjugates. 

A question which frequently yields interesting results is that of the symmetric 
types in G—the types for which all the positive x; have the same value. These 
are given by the equations 2») — pz, = 1,25 — 3px] = 1. For p = 0 we have the 
case 2 = 1, the identical collineation. Otherwise x,(3 — p) = 2 and x = 1, 
or x; = 2. Thus we find only the two cases: 2; 1 and 5; 2 2 given in the tables 
above. 


5. Relation between the generic elements of g and of G. The generic 


element of g generated by 7, 72, --- , 7, has the form 

i, = Gato — Ait — +--+ — apt, (i =0,1,---,p), 
(1) fi: , ° 

zj = zj Gj > p) ’ 


where the a’s are positive integers or zero. Due to the invariance of Q and L 
the coefficients a satisfy the relations: 
i=p 


DD an = an — 1 (k = 0,1,---, 9); 


i=1 


i=p 
(2) 3 > aio = ah, —1; 3 >) at, = ah. +3 (k =1,---,~); 


i=1 t=1 


i=p 


3 Do aix in = exon cron (k,l = 0,1,---,p;k#)). 


t=1 


These conditions are sufficient to verify that the inverse of g; is 


Zo = ante — Santi — --- ~Saeh, 
(3) gi 2, = (a/3) x — axa — --- — i TL, (i =1,---,9), 
r,=2; (j >»), 
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since gg,‘ is the identity by virtue of the relations (2). This gj’ also leaves Q, 
L unaltered and thus we obtain a further set, complementary to (2), of relations 
on the coefficients a: 


i=p i=» 
Dd a; = 300 —1, D a; = 3 a0 —1 (i= 1,---,p); 
j=l i=1 
i=p i=p 
(4) , as = 3 (aj, —1), ai; = 3ai, -—1 (i = 1, » Pp); 


nk 
Dd. aij oj = 3 aio aro (i,k =1,---,pjt#k). 
j7=1 
We now prove the theorem: 
(5) To the generic element g, of g there corresponds a Cremona transformation G, 
in G with C as an F-curve of the first kind for both the direct and inverse trans- 
formations, with pi, --- , p, as isolated F-points, and with q, --- ,q, as isolated 
F-points of the inverse. This element G, transforms the web of planes into the 
homaloidal web (C*°~' g}*" ..- q3%™)**~~-*. The P-surface of C is 


(C?20-F 9h 0 — go %m)*(eu-1) , 
and the P-surface of p; is (C%i/*q{1i - - - g%#i)%i, 


The theorem being true for J;, --- , J,, we have only to show that it remains 
true for products. Let J, be the generating involution with F-point at gq; which 
sends qo, --- ,q, into qs, --- i . For convenience let q, = q,;. Then the 
product GJ; transforms the web of planes into the transform of the homaloidal 
web described in (5) by J;. This transform has the order 4(3a — 2) — 
6(a0 — 1) — Yaw = 3(2a0 — 3aw) — 2 = 3a,) — 2. It has, for multiplicity 
on C, the value 3a0 — 2 — (aw — 1) — 3an = (2a — 3aw) — 1 = ayy — 1. 
It has, for multiplicity at q{, the value 3(3aw— 2) — 6(a0— 1) — 6ay = 
3(a0 — 2ew) = 3aj 9. It has, for multiplicity at q3, the value 3a = 3a». 
Thus the web G,J,; has the form given in (5) for the values a’. But the values 
a’ defined above are precisely the a@’’s of the product gi: [ef. (1) and 3(2)]. To 
complete the proof a similar check must be made for the P-surface of C by trans- 
forming the surface given in (5) by J,; for the P-surface of pi; and for the P-sur- 
face of pe. It turns out as above that these transforms can be read off as in (5) 
from the coefficients a’ of the product g:i:. It is necessary also to carry out the 
same argument for the product GiJ,4:, Z,4: being the generating involution 
with isolated F-point at 9g,4:. We obtain the same check with the product 
gitp41 and theorem (5) is established. 


6. The transformation of particular surfaces or of linear systems of surfaces 
by elements of G. We define the characteristic of a surface or linear system, 
S, with respect to the element G; of G described in (5) to be a set of numbers 


Ys Yo) Yip Y2, + ** Ue 
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of which y is the order of S, yo the multiplicity of S on C, and y, yo, --- the 
multiplicity of S at pi, po, ---. Then S is transformed by G, into a system S’ 
with characteristic y’, ys, ¥1, Y2, -+- With respect to the F-system C, qi, 42, --- 
of G;'. From the properties of G; given in 5(5) we find at once that 


y’ = (Bam — 2)y — Claw — 1)yo — ani — -+- — any, 
Yo = (ao — 1)y — (2a — 3)yo — (an/3)yi — --- — (a0,/3)yp , 
y; = 3any — 6awyo — aay — --- — aigy, (i= 1,---,p), 
yi =U (>). 
We observe first that y’ — 3y, = y — 3yo and secondly that 
(y’ — 2yo) = aly — 2yo) — am(yr/3) — --- — aop(y,/3) , 
y;/3 = awly — 2yo) — aa(y/3) — --- —au(y,/3) (= 1,---,9), 
yi/3 = y:/3 (l> p). 


From this there follows that 
(1) If in the linear transformation 5(1) we make the change of variable x = 
y — 2yo, 32; = yi (@ = 1, 2,--- ), then the new transformation, coupled with 
y’ — 3y, = y — 3yo, yields the linear transformation on the characteristic y, yo, yi 
which is effected by the Cremona transformation G, of 5(5). 

It is clear that the infinite Cremona group G here defined has an arithmetic 
theory which is precisely parallel to that of the infinite ternary Cremona group. 


UNIVERSITY OF ILLINOIS. 














COLLECTIONS FILLING A PLANE 
By J. H. RoBerts 


Introduction. In 1928 the author showed! that there exists an upper semi- 
continuous collection G filling a plane S such that every element of G is a bounded 
continuum not separating S. Later he stated* that the elements of G could all 
be taken to be bounded continuous curves. If M is a bounded continuous curve 
lying in a plane S and not separating S, either M is an are or M contains a triod.* 
But any collection of mutually exclusive triods lying in a plane* is necessarily 
countable. Consequently, all of the elements of the collection G of continuous 
curves filling S, except possibly a countable number, are ares. In view of this 
result it seemed likely that there existed an upper semi-continuous collection G 
filling S such that every element of G was an arc. In fact, the author has since 
stated‘ erroneously that such is the case. The principal object of the present 
paper is to prove that there does not exist an upper semi-continuous collection G 
of arcs filling a plane S. In view of this result, the fact that there is a collection 
G, every element of which is a bounded continuous curve not separating S, be- 
comes of more interest, and accordingly an example of such a collection G is 
given. 

Derinition. A collection G of closed point sets lying in a metric space is 
said to be upper semi-continuous’ if for each element g of G and each positive e 
there exists a positive d such that if z is an element of G and I(x, g) < d, then 
u(z,g) < e. 

DeriniTion. The element g of G is a limit element of a subcollection K 
of G if for every positive e there is an element x of K distinct from g such that 
u(t, g) <e. 


Received November 15, 1935. 

1 Fundamenta Mathematicae, vol. 14 (1929), pp. 96-102. 

2 This result was presented to the North Carolina Academy of Sciences, May, 1934, but 
no published statement of it has appeared. 

3 A triod is the sum of three arcs AP;, AP; and AP;, each pair having only A in common. 
Cf. R. L. Moore, Foundations of Point Set Theory, Theorem 71, p. 250, and Theorem 75, 
p. 254. Theorem 75 is stated for a closed and compact set, but the present result ob- 
viously follows, since the plane is the sum of a countable number of such sets. 

* See abstract #196, Bull. Amer. Math. Soc., vol. 41 (1935), p. 330. 

5 R. L. Moore, Concerning upper semi-continuous collections of continua, Trans. Amer. 
Math. Soc., vol. 27 (1925), pp. 416-428. If Misa point set and P is a point, then by l(P, M) 
is meant the lower bound of the distances from P to all the different points of M. If Mand 
N are point sets, then by 1(M, N) is meant the lower bound of the values l(P, N) for all points 
P of M, while by u(M, N) is meant the upper bound of these values for all points P of M. 
It is to be observed that u(M, N) may be different from u(N, M), while 1(M, N) = I(N, M). 
The quantities 1(M, N) and u(M, N) are called the lower, respectively upper, distances of M 
from N. 
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DerrniTion. A collection G of point sets is said to fill a space S if every 
element of G is a subset of S and every point of S belongs to some element of G. 


1. Let G denote an upper semi-continuous collection filling some metric 
space S. Let n be a positive integer and suppose d, is a domain in S having 
the following properties: (1) d, = R, + R: + --- + Ry, where R; is a domain 
and® 6(R;) < 1/n, (2) d, covers some element of G, but (3) Ri + --- + Ria t+ 
Riss + --- + Ri (& = 1, 2, --- , k) does not cover any element of G. Let e, 
be the set of elements of G covered by d,. Then e, is? a domainin G. Let E, 
be the sum of all such domains e,. Clearly EZ, D E,s;. Let G, be the inner 
limiting set (= G,; set) common to Ej, Es, E3, --- . 

THEOREM 1. [If the elements of G are closed and compact, then G, is a con- 
tinuous® collection. Furthermore, if S is complete, then G, is maximal with respect 
to the property of being a continuous subcollection of G, and is dense in G. 

Suppose G; is not a continuous collection. There exist elements g, 91, g2, gs, - - - 
such that lim l(g,, g) = 0 but u(g, g,) does not approach zeroasn— ©. There 


exists a positive e and a sequence g,, gs, 93, --- such that u(g, g,) > e and 
9. = gmfor some m. For each n there is a point P,, of g such that U(P,, g,) > e. 
There exists a limit ne P of Pi + P,+ P3;+ ---, and there is an infinite 
subsequence hy, he, hs, --- of 91,92) 9a, --- Such that UP, h,) > e/2 for every n. 

Now choose a sable integer m enh that 1/m < e/2. Since g is 
in G,, there is some domain d,, covering g and having the properties 
(1) and (3) as well. For some j the element h; is covered by d,. Set 
d, = Ri + RR. + --- + Ry, these being domains of diameter < 1/m < e/2. 
For some integer r (r S k) the domain R, contains P. Then R, contains no 
point of h;; i.e., hj is covered by Ri + --- + Roi + Rai + --- + Ry. This 
contradicts property (3) of d,. Hence it has been proved that G, is a con- 
tinuous collection. 

Suppose now that S is complete. It is to be shown that G, is dense in G. 
Let D, be any domain of elements of G, and let g; be a particular element in D,. 
One can take a finite set of domains in S whose sum covers g; and throw out 
domains of this set until what is left covers some element of G but no more 
domains can be omitted and that property retained. Thus there exists a domain 
d, having properties (1), (2) and (3) such that® d; C DT and u(d;, g:) < 1. Let 
Dz be the set of elements covered by d, and let gz be a particular such element. 


6 If X and Y are points of a metric space, then 6(X, Y) will denote their distance apart. 
More generally, if R is a point set, then 5(R) will denote the diameter of R. 

7 Moore, loc. cit., Theorem 1. 

8 The collection K is said to be continuous provided that for each element g and sequence 
91, 92, 9s, -** Of elements of K such that l(g,, g) ~Oasn— , it follows that both u(g,, g) 
and u(g, gn) ~Oasn— ©. Thus in addition to being upper semi-continuous, a convergent 
sequence of elements of K has a whole element of K as its liiniting set. 

* If Dis a set of elements of G, then D* will denote the point set in S obtained by adding 
together all elements of D. 
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As above, there exists a domain dz having properties (1), (2) and (3), such that 
d, C D3 and u(ds, g2) < 1/2. This process can be continued indefinitely. Thus 
there exist elements gi, g2, gs, --- of G and domains d), de, d3, -- - in S such that 
(a) the domain d, has properties (1), (2) and (3), and in addition covers g,, 
(b) dayi C dp, and (c) u(gnsi, gn) < 2/2". It remains to show that there is an 
element g of G which is a subset of every d,. For this it will be sufficient to 
show that there is some element g which is either g, for infinitely many values 
of n or is a limit element of the set of elements g: + gz + gs + ---. 

Clearly there exists an infinite sequence of domains in S, Fi, E2, E3, --- such 
that (a) E, C d,, (b) 6(Z,) < 2/2", (c) EZ, contains a point P, of g, and (d) E, 
and £4; have a point in common. The sequence P;, P2, Ps, --- satisfies the 
Cauchy convergence condition, and since S is complete, there is a point P 
which is the sequential limit point of this sequence. Let g be the element of G 
which contains P. Then since g,,(m = n) is contained in the domain d,, it 
follows that g is in d,, for every n. Hence g is covered by dh, de, ds, --- and is, 
therefore, an element of G, lying in D,. 

It remains to show that G,; is maximal with respect to the property of being a 
continuous collection. Let g be an element of G and suppose G,; + g is a con- 
tinuous collection. Let n be given. There exists a domain c, covering g such 
that (a) c, = K, + Ky + --- + Ky, where K; is a domain in S of diameter 
less than 1/n and (b) for each 7 there is a point P; in g and in K; such that P; 
is neither in nor on the boundary of K; if 7 + 7. There exists a positive ¢ such 
that every distance U(P;, Ki + --- + Kin + Kis + --- + Ky) is greater 
than «. Since G; + g is continuous, there exists a 6, such that if A is in G, and 
l(h, g) < 6,., then u(g, h) < «. Then h has a point in K; for everyi (i < k). 
Let R; be the subset containing every point of K; at a lower distance less than 
5. from g, and set d, = R, + R. + --- + R,. Then d, covers g, and every R; 
is of diameter less than 1/n. Furthermore, if some R; is omitted, the remaining 
domain covers no element of Gi; and since G; is dense in G, it can cover no 
element of G. Thus d, has properties (1), (2) and (3). Therefore g is in G. 
This completes the proof of Theorem 1. 


2. DEFINITIONS. Suppose 9, g2, gs, --- is a sequence of arcs converging to an 
arc g. Let AB be any subare of g. Suppose that for every positive ¢« there 
exists an m such than if n > m, then g, contains two mutually exclusive subarcs, 
each containing points at a distance less than ¢ from A and points at a distance 
less than ¢ from B. Then the subare AB of g will be said to be approached 
doubly by the sequence gi, g2, 93, --- . If no-subare of g is approached doubly by 
the sequence 91, g2, gz, -- - Which converges to g, then g is said to be approached 
equi-continuously” by the sequence gi, ge, gs, --- 

TuHeoreM 2. If S is complete and G is an upper semi-continuous collection of 


10 R. L. Moore, Concerning certain equicontinuous systems of curves, Trans. Amer. Math. 
Soc., vol. 22 (1921), definition, p. 42. 
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arcs filling S, there is a subcollection Gz of G, (G, as defined in §1) such that (1) 
G2 is dense in G, (and therefore dense in G) and (2) every element g of Gz is ap- 
proached equi-continuously by every sequence of elements of G, converging to g. 

The author’s detailed proof of Theorem 2 is long, and in some respects similar 
to the proof of Theorem 1. Therefore only an outline of the proof will be given. 

The following lemma is first established. 

Lemma 2.1. Let € be a positive number and let D be adomaininG. Let g be an 
arc of G, in D which contains n, but not n + 1, disjoint arcs each of diameter 
greater than «¢; and suppose that g contains a subarc of diameter greater than « 
which is approached doubly by some sequence of elements of G;. Then there is an 
element g' of G, in D which contains n + 1 disjoint subarcs each of diameter greater 
than e. 

With the help of the above lemma and processes described in the proof of 
Theorem 1, we next arrive at 

Lemma 2.2. If € isa positive number and D is a domain in G, there is a domain 
E in G such that E C D, and if g is any element of G, in E, then no subare of g of 
diameter greater than ¢ is approached doubly by any sequence gi, go, gs, --- of ele- 
ments of G, converging to g. 

Theorem 2 is then proved by showing the existence of a sequence of domains 
E,, E>, Es, --- lying in an arbitrary domain D of G such that (a) B,4: C Ey, 
(b) there is an element of G; common to K;, Es, E3, --- , and (c) the domain 
E,, has the property of the domain E of Lemma 2.2 with ¢ equal to 1/n. 


3. The next three sections will be devoted to the proof of the following 

THEOREM 3. There does not exist an upper semi-continuous collection of arcs 
filling a plane. 

The proof is indirect. A contradiction is reached at the end of §5. 

Suppose, then, that G is an upper semi-continuous collection of ares filling a 
cartesian plane S. Let a and b denote mutually exclusive closed subsets of S. 
Let n and k be positive integers. Suppose D is a simple chain of domains 
such that (1) every domain of the chain D is of diameter < 1/k, (2) the sum 
of the domains of the chain D covers some element of G, and (3) there do not 
exist n subchains of D having at most end domains in common and each con- 
taining both a point of a and a point of b. Let H,,x (a, b) be the set of all ele- 
ments of G, each of which is covered by some such chain D. Let K,x(a, b) be 
G — H,,.(a,b). Let H,(a, b) and K,, (a, b) be defined as follows: 


H,(a, b) = Hm-Hn2-Hns- --- , 
K,,(a, b) = Kui + Kae + Kas + cee, 


Then, as H,,; is a domain, it follows that K,, is closed. Thus H, is a G,; set and 
K,, is an F,. It might be noticed that H, contains just those elements of G 
which do not contain as many as n mutually exclusive segments each with end 
points on a and b, respectively. 
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4. We now specialize the sets aand b. Let p(x, y) denote a distance function 
defined over G with respect to which G is a cartesian plane." Let g be an ele- 
ment of G.. Let J be a simple closed curve enclosing g. Let a, a, and b denote 
mutually exclusive ares crossing g, having end points on J but otherwise lying 
within J, each having only one point on g and such that a separates a; and 6 in 
J plus its interior. There exists an ¢« such that (1) if A is in G and p(h, g) S «, 
then h lies within J and cuts both a; and b, but (2) if h is in G,; and p(h, g) S «, 
then h does not cut a, or b, on both sides of g, and does not contain two mutually 
exclusive segments each having end points on a; and b, respectively. Let 
N denote the subcollection of G consisting of all elements A such that 
p(h,g) S «. The following can now be established. 

Lemma 4.1. The point sel® (N - K.(a, b)|* contains a domain. 

Of the two domains into which a; divides the interior of J, let E denote the 
one such that FE > a + b. Let M be the collection of all maximal connected 
subsets of h-E for all elements h of N. The collection M is upper semi-continu- 
ous and each of its elements is an are or a single point. Each element of M con- 
tains a point of a;. Obviously M* contains a connected domain D which contains 
no point of g but does have on its boundary points of each of the segments AC and 
CB, where ACB is the are a, C being ong. Then of the elements of M having 
points in D, some cut the segment AC, some BC, and all cut AC or BC. Since 
M is upper semi-continuous .(and D connected), there is some element /; 
of M with a point in D such that A; cuts each of the segments AC and CB. Let 
gi be that element of M which is a subset of g and cuts the are b. There is an 
element k of M which is a subset of an element of G, and which cuts’? AC in a 
point 7’, and is such that k is not separated from g, in E by hy. Then there is an 
element h: of M which cuts both AC and CB, and does separate k from gi in E. 
Let E, be the set of all points P of E such that P can be joined in E to hi by an 
are not intersecting h2 and P can be joined to he in E by an arc not intersecting 
h;. Then E, is connected, has boundary points on AC and on CB. Hence 
some element h; of M having points in E, cuts both of the segments AC and CB. 
Let M; be the set of all elements of M which cut both of the segments AC and 
CB and contain points of £;. The elements of M, are obviously in a linear 
order, as one of every pair separates the other in E from g:. We may say hi 
precedes all other elements of M, and hz follows all other elements of M,. But 
M, is a closed collection. Furthermore, between any two elements of M; (as, 
for example, between h; and he) there is a third element of M;. From all this it 
follows that M, is an arc of elements from h; to he. There is a first element l 
of this are which cuts the segment TC. Then | also cuts AT (and also, of 
course,CB). The arc 1 contains a subare pqrst, where p, r, and ¢ are on a, and q 
and s are in E on the non-a side of the are b. There is a domain L with the 
following properties: (1) L lies in E between a, and a, (2) L is bounded by a 
simple closed curve which contains as a subset a subarc of | containing r in its 
interior, and (3) the diameter of L is so small that if P is a point of L and g, 


1 R. L. Moore, footnote 5. 
12 There is such a k cutting AC or CB, and to be explicit, we may suppose it is AC. 
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is the element of G containing P, then g, is in N, and therefore cuts a;. Then 
L is the domain desired, being in [N - K2(a, b)]*. For starting on the are g, from 
the point P of L and going in either direction, one intersects the arcs a, b and a 
again before it is possible to intersect a,;. Thus g, is in K,(a, b). This com- 
pletes the proof of 4.1. 


5. Since K}(a, b) (a and b as defined in the previous section) contains the 
domain L, it follows" that for some i; the closed set K?;, (a, b) contains a bounded 
domain Dz, which is a subset of L. It may be that no matter how small the 
positive number e¢; is, there exist arcs a; and b; similar to a and b (as regards 
cutting g and J) with u(as, a) < €;and u(bs, b) < €3such that K3 (as, bs) contains 
a domain which is a subset of De. In this case choose ¢; to be 1/8 of l(a, b) and 
select an integer i; such that K3,,(as, bs) contains a domain D; such that D; C Ds. 

Suppose this process can be continued and that (1) for every n the set K*,. 
(a,, b,) contains a bounded domain D,, (2) Das: C D,, (3) u(an, a) and u(b,, b) 
are less than 1/4 of l(a, b), and (4) the ares a, and b, cross g, and have end points 
on J. Then there is a point P common to D,, De, D3, --- . Let gp be the are 
of G which contains P, and let a’ and b’ be ares crossing g (similar to a and b 
and a, and 6,) such that (1) l(a’, b’) > 1/8 of l(a, b), and (2) a’, as well as b’, 
separates every a, from every b, within J. Then every subare of gp which cuts 
a, and b, cuts a’ and b’. Now g, belongs to K,(a,, b,), and hence to K,(a’, b’) 
for every n. But then g, contains infinitely many arcs spanning from a’ to 
b’, mutually exclusive closed sets. This is impossible, and we have arrived at 
the following: 

Lemma 5.1. There exist arcs a and b, an element g of G2, a positive ¢ and integers 
n and k, and a simple closed curve J such that (1) J encloses g, (2) the arcs a and b 
are mutually exclusive, have their end points on J, otherwise lie within J, and cross 
g in points E and F, respectively, these being the only points of g on a + b, (3) every 
element of G which lies within J cuts both a and b, but no element h of G, which lies 
within J cuts either a or b on both sides of g, nor does h contain two mutually ex- 
clusive segments each having its end points respectively on a and b, (4) K*,(a, b) con- 
tains a domain D and every element of G with a point in D is within J, and (5) if a’ 
and b’ are arcs such that u(a’, a) < «and u(b’, b) < «, then K* ,,(a’, b’) does not con- 
tain a domain which has a point in common with D. 

Select definite arcs a’ and b’ distinct from a and b with properties similar to 
those obtaining for a and b and such that (1) in the interior of J the order is 
aa'b’b, and (2) u(a’, a) < «and u(b’,b) < «. Let M be the collection of ares of 
K,,x.(a, b) which contain pointsin D. Let L be the set of all elements of M which 
have exactly n distinct arcs spanning" a’ to b’, i.e., 


L = M-IK,(a’, b’) — Kys:(a’, b’)). 


13 The theorem is as follows: If for every n the closed plane set M, contains no domain, 
then M, + M, + M; + --- contains no domain. 

4 A set Z of arcs will be called a set of distinct arcs spanning a’ to b' if no two ares of Z 
have more than one end point in common, and each arc of Z has its end points on a’ and b’, 
respectively. 
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It is merely a matter of notation to assume that D is the maximal domain which 
is a subset of M*. The following assertion follows obviously from (5) of Lemma 
5.1: 

Lemma 5.2. The point set L* is dense in D. 

Now let g: denote any element of L. Then g; contains at least n arcs spanning 
a to b, since g: isin K,(a, b). But since every arc from a to b contains a subare 
from a’ to b’, it follows that, since g, is in L, it contains not more than n distinct 
ares spanning a to b. Thus if A and B are the end points of gi, there exist on 
gm. points A, Qi, Ri, Q2, Re, --- ,Q,, R,», B in the order written such that (1) the 
segment Q;R; contains no point of a or of b, (2) one of the points Q; and R; is 
on a, the other on b, and (3) any subarc of g,; which contains both a point of a 
and a point of 6 contains the entire arc Q;R;, for some 7. 

There exists a simple chain N of connected domains irreducibly™ covering g: 
such that (1) every domain of N is of diameter less than 1/k, (2) for each 7 (¢ = 
1, 2, --- ,m) N contains a subchain C; which irreducibly covers the are Q;Rj, 
(3) only one domain of the chain C; cuts a and only one cuts b, (4) no domain 
of N contains points of both a and a’, or of both b and b’, and (5) neither of the 
two subchains of N consisting of all domains of N which contain a point of AQ, 
or R,B, respectively, cuts both a’ and b’. 

Lemma 5.3. Every element of K,x% which is covered by N must contain a point 
in every region of C; for everyt (it = 1,2, --- ,n). 

The proof of Lemma 5.3 is immediate. For if h is an element of G which is 
covered by N but does not intersect every domain of C; for some i, then h is 
covered by a subchain of N which does not contain as many as n subchains 
each spanning from a to b. Then h is in H,,, and hence not in K,x. (See §3 
for definitions.) 

There are now two cases to be disposed of. 

Case 1. Suppose the point A (an end point of g;) is in D. 

We can let g; play the rdle of the arc g of §4 and apply the argument which 
proved Lemma 4.1, with scarcely a change, except in notation. The following 
lemma results: 

Lemma 5.4. There exists an arc grs which is a subarc of an element h of M 
such that 

(1) q and s are on* a, and r is on b, 

(2) if J denotes the simple closed curve qrs plus the subare qs of a, then the in- 
terior of J and the segment qs of a are subsets of D, 

(3) the interior of J contains points of a’ and of b’, but no point of a or of b, and 

(4) J plus its interior is covered by the chain C, (the subchain of N covering the 
arc QR). 

By Lemma 5.2 some element V of L has a point Z within J between b’ and b. 
If a point P moves along V, starting at Z, then before it can cut the arc b it 


16 A set N of domains will be said to cover irreducibly a point set g, if N covers gi, but no 
domain of the set N can be omitted and that property retained. 

16 The pairs (a, a’) and (b, b’) are essentially interchangeable, and so to avoid ambiguity 
it can be supposed that the notation has been properly assigned. 
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must cut b’, a’, a, a’, and 6’. But this is impossible for an are V of L, which 
contains precisely n arcs spanning a to b and precisely n arcs spanning a’ to b’. 

Case 2. Neither end point of g,isin D. The subares AQ,R; and Q,R,B of 
are in S — D, since every element of G (sufficiently close to g:) cuts both a and b. 
It follows that 7 can be so chosen (¢ < n) that Q:R; contains no point of D, but 
RQissRis: does contain a point of D. Due to symmetry with respect to a 
and b we can suppose that Q; and Rj,; are on a, while R; and Q;,: are on b. 

We again use an argument very similar to that given in §4, and prove that 
there exists an element h of G which is covered by N and contains a subarc 
pgrst with the following properties: (1) some point P of the segment pg is in D, 
(2) p, r, and ¢ are on a and are the only points of a on pgrst, (3) q and s are on b, 
and (4) p and r are in domains of the chain N containing R;,, and Q;, respec- 
tively. Now an element V of L can be chosen with 1(V, P) so small that V has 
points in every domain of C; and C;,;. Such an are V cannot have points on 
both sides of pgrst within C; or Ci4: (since Visin L). Think of a point Z moving 
along V into the chain C; (see Lemma 5.3). If V is taken close enough to A, 
and on the proper side, then Z will cut the ares b, b’, a’, a’, b’, b, a’ before it can 
cut the are a, since it cannot cross pgrst. But then V has more distinct arcs 
spanning a’ to b’ than it has spanning a to b. This is impossible and the proof 
of Theorem 3 is complete. 


6. In this section the following theorem is established. 

THEOREM 4. There exists an upper semi-continuous collection F filling a plane 
S such that every element of F is a bounded continuous curve not separating S. 

There will first be described a collection G of arcs such that G is an open 
curve and® G* is a continuum M. The set M will be the common part of an 
infinite sequence of sets M,, M2, M3, ---. Let M;, be the set of all points with 
coérdinates (z, y) such that 0 S y S 1. 

By a V, in the following description, will be meant an are which is the sum 
of two intervals, with end points respectively on the lines y = 0 and y = 1, the 
intervals being subsets of lines whose slopes are in absolute value > 1. It will 
be said that the V opens upward, or downward, depending upon whether the 
end points of the V are on the line y = 1, or on the line y = 0. There exist two 
sets of V’s, Gu and Giz, such that 

(1) the elements of Gy + Gi: are mutually exclusive, 

(2) the V’s of Gu open upward, and those of Gi: open downward, 

(3) the number of V’s in Gy + Giz within any circle is finite, and 

(4) as a point P moves along the line y = 0 it intersects a V of Gu between 
every two V’s of Gy, and vice versa. 

Let Mz be M, minus all points of M, which lie within some V (i.e., points 
which lie in the pair of smaller angles which the two lines whose sum contains 
the V make). Next, there exist sets of V’s, Gx and Gz such that if A and k are 
consecutive” V’s of the set Gu + Giz (whence one is in Gy—call it h—and the 


7 The linear order is obvious. 
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other in Giz), then between h and k there is precisely one element g; of Gn and 
one element g2 of G2, the order being h, go, gi, k. Let M; be M2 minus all 
points of M. within some V of Ge; + Ge. 

Clearly, then, there exists an infinite sequence of pairs of sets of V’s Gu, Gi2; Ga, 


G22; Gu, G32; -- - such that for every n the V’s of the set K, [K, = >> (Ga + Ga)] 
i=l 


are mutually exclusive, are alternately (as one intersects them along the z-axis) 
open upward and downward, and such that every interval of the line y = k 
(0 = k S 1) neither intersecting nor lying within any V of the set K,, is of 
length less than 1/n. 

Let M,4;: denote M, minus all points of M, which lie within any V of 
Gir + Gaze. Let M be the common part of M,, M2, M3, ---. Let G be the col- 


lection > (Gi + Giz) plus all maximal connected subsets of M — > (Ga + Giz). 


i=1 ¢=3 

Each of these maximal connected subsets is, clearly, an interval with end points 
on the lines y = 0 and y = 1, respectively. It is also seen that G is an open 
curve of arcs. 

Now in my paper Concerning atriodic continua I showed that if M is a con- 
tinuum in a plane S which, for every positive number e, can be covered by a 
simple chain of connected domains all of diameter < e, there exists in S an un- 
countable set K of mutually exclusive continua each homeomorphic with M. 
This result can be extended to the case where M can be covered, for every posi- 
tive e, by an unbounded chain of connected domains (i.e., a domain D; for every 
integer 7 such that (1) D; and Dj, have a point in common but (2) D; and D;,; 
have no point in common if j > 1), all of diameter < e. Now clearly the con- 
tinuum M defined above can be covered, for every positive e, by an unbounded 
chain of connected domains of diameter < e. 

The following lemma will be stated without further proof.” 

Lemma 6.1. There exists a set T of topological transformations of the plane S 
into itself such that: 

1. If A denotes the set of all triadic decimals k (0 S k S 1) each of whose digits 
is 0 or 2, then for every k in A there is a transformation T; of the set T, and every 
transformation of the set T is some T',.. 

2. If we denote T;.(M) by M, and T,(G) by H;, (i.e., Hi ts the upper semi-continu- 
ous collection of arcs of which M;, is the sum), then every M, separates the plane into 
two domains, one containing every M), such that h < k, the other every M,, such 
that h > k. 

3. Every element of H;, is of diameter greater than one. 

4. If k; and kz denote respectively the fractions (triadic) nynz --- n,0222 --- 


18 Monats. fiir Math. und Phys., vol. 37 (1930), pp. 223-230. The argument given proves 
more than is stated in the theorem. 

19 Part of the argument omitted is an easy generalization of that in my paper, ibid. 
The remainder is fairly obvious, graphically, but a detailed logical proof is tedious in the 
extreme. 
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and nynz --- ni2000 --- , and D; the domain of S complementary to My, + Mi, 
which contains no M,, then there exists a collection K; of arc segments such that 

(a) K* fills Dj, 

(b) there is a one-to-one correspondence between the segments of K; and pairs of 
corresponding elements of H;, and H,, (i.e., elements which are images under Tx, 
and T,., respectively, of the same element of G), 

(c) of r is a segment of K; and g; and g2 are the corresponding elements of Hx, 
and H;,,, then the end points of r are end points of g, and ge, respectively, unless 
gi: and gs are V’s (see definition of M), in which case the end points of r are the 
vertices of the V’s, and the diameter of g: + g2 + 1 does not exceed the diameter of 
gi (t = 1, 2) by more than 1/1, and finally, 

(d) the collection L; of continuous curves g: + g2 + r is upper semi-continuous 
and is an open curve of elements. 

Let k; be .000 --- and ke be .222---. Let D denote the complementary 
domain of M;, + M;,, in S which is bounded by M;, + My. Let W be the 
collection consisting of (1) all elements of L; (¢ = 1, 2, 3,---), and (2) all 
elements of H; for every k of the set A which is either not periodic at all, or 
has a period greater than 1 (i.e., in k the digits 0 and 2 each occur infinitely 
many times). Now W fills D. Furthermore W is upper semi-continuous. 
For let h; and he be distinct elements of W. There is a positive integer i such 
that l(hi, he) > 2/i. Suppose the elements gi, go, gs, --- , all distinct from h, 
contain points P;, Ps, Ps, --- converging to a point P of h;. Let g, be an ele- 
ment of some H, which is a subset of g,. Let Q be the set of all elements g, 
such that d(g,) — d(g,) > 1/i, and let R be the set of all other elements gp. 
Clearly hz contains no limit point of the sum of the elements of R. On the other 
hand, there exists an m such that every element of Q belongs to L; + Lz + --- 
+ Ln». Since L; is upper semi-continuous for every j, the element A, contains 
no limit point of the sum of the elements of Q. 

Now suppose Z is a topological transformation of the domain D into the 
plane S which does not decrease the distance between any pair of points of D. 
Let F denote the collection of images under Z of the elements of W. Then F 
is an upper semi-continuous collection filling the plane S such that (1) every 
element of S is either an arc, or the sum of three ares a, b, and c, which make an 
H (i.e., a and b are mutually exclusive, and c has just its end points on a + 6, 
these being interior points of a and b, respectively), and (2) every element of F 
is of diameter greater than 1. 


Duke UNIVERSITY. 














ON CERTAIN ANALYTIC CONTINUATIONS AND ANALYTIC 
HOMEOMORPHISMS 


By Artuur B. Brown 


1. Introduction. We generalize to the case of n complex variables and one 
real variable a theorem of Severi' regarding analytic continuation, over a limited 
domain? in the (2n + 1)-space of the variables, of a function given analytic near 
the boundary B. The theorem states that if B is connected the continuation 
is possible. Severi proves the theorem only for the case that n = 1 and the 
domain is of simple type. We remove all restrictions as to simplicity of the 
domain and its boundary. 

The similar theorem for a region in the 2n-space of n > 1 complex variables is 
Osgood’s* extension of a theorem of Hartogs.‘ Because of certain geometric 
difficulties which seem not to be fully met in Osgood’s proof, we give a detailed 
proof of this theorem. The proof applies without essential modification to the 
case of meromorphic continuation.® 

As an application, we prove in the case of n complex variables that if the con- 
nected boundary of a limited domain in the space undergoes an analytic homeo- 
morphism with non-vanishing jacobian, the transformation can be continued 
analytically over the domain to yield an analytic homeomorphism of the domain 
and its boundary (Theorem 4.II)._ A somewhat similar result is obtained for the 
case of one real and n complex variables (Theorem 4.ITI). 


2. Functions of n complex variables. The following is the Osgood form 
of the theorem of Hartogs. 
TueoreM 2.1. Let & be a limited domain with connected boundary B in the 2n- 


Received October 16, 1935; presented to the American Mathematical Society, February 
23, 1935. 

1 F. Severi, Una proprieta fondamentale dei campi di olomorfismo di una variabile reale e 
di una variabile complessa, Atti della Reale Accademia Nazionale dei Lincei, Rome, Rendi- 
conti, (6), vol. 15 (1932), pp. 487-490. Our theorem is numbered 3.II. 

2 By a domain we mean an open set. A region is a connected open set. A limited point 
set is one of finite diameter. 

3 W. F. Osgood, Lehrbuch der Funktionentheorie, vol. 2, part 1, Chapter 3, §11. We refer 
to the book as Osgood II. 

‘F. Hartogs, Einige Folgerungen aus der Cauchyschen Integralformel bei Funktionen 
mehrerer Verdnderlichen, Sitzungsberichte der mathematisch-physikalischen Klasse der K. 
B. Akademie der Wissenschaften, Miinchen, vol. 36 (1906), pp. 223-241. Hartogs proves 
only that if a function is given defined over the entire region and boundary, analytic at the 
boundary and without removable singularities in the region, it is analytic in the region. 

’ Theorem2.1I. See Osgood II, Chapter 3, §13, and E. E. Levi, Studii sui punti singolari 
essenziali delle funzioni analitiche di due o pit variabili complesse, Annali di Matematica, 
(3), vol. 17 (1910), pp. 61-87. 
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space of the n complex variables x, --- , Xn, n > 1, and f(a, ---,2%,) = f(x) a 
function single-valued and analytic in a domain 5 containing B. Then f admits a 
single-valued analytic continuation throughout R + 5.8 

First we divide 2n-space into 2n-cubes of diameter less than the minimum 
distance from B to the boundary of 5, determined by (2n — 1)-planes parallel to 
the coérdinate planes. Let A be the set of all points on closed cubes which meet 
R = R4+B. Since B is connected, R is connected; hence A is connected. If 
the points of A are removed from 2n-space, there remains a set of one or more 
regions one and only one of which, say &, is not limited. The boundary of the 
latter is denoted by C, which must then be the locus of a (2n — 1)-cycle both orien- 
table and (mod 2). The cells are faces, and edges of lower dimensions, of some 
of the cubes. We note that C is part or all of the boundary of A. Then C 
bounds a limited domain ‘S = 2n-space minus (£ + C), so that To) = J+ C 
contains A, and S contains R. Also, C is connected; otherwise it would be easy 
to show that A is not connected. Let 6 = Ty) +5. We shall prove that f can 
be continued analytically over all of &. 

Let Pbea point notin&. A (2n — 1)-sphere = with center at P will be said to 
be reachable if there exists a function ¢(r), defined, single-valued and analytic 
over the part of & outside 2, with ¢(z) = f(z) in the part of 5 outside = and 
not in 7. Now if it were impossible to continue over all of &, not all spheres 
with center at P would be reachable, and we could let 2» be the (2n — 1)-sphere 
with center at P whose radius was the greatest lower bound of the radii of all 
the reachable spheres. By considering analytic continuations radially towards 
P, one proves easily that  p is itself reachable. Evidently no sphere smaller 
than 2» can be reachable. Let us now suppose Xp» to exist, and show that a 
contradiction must arise. 

Case Il. If Qis any point of Xo-C (intersection of Xo and C), then f(x) = o(x) 
at the nearby points outside of Xo. It then follows that f(x) near Q gives a proper 
analytic continuation ¢(x) of ¢(x) throughout a small sphere 6 in 5 with center 
at Q. Now suppose Q is on Yo-‘S. Then ¢(z) is single-valued and analytic in 
the part outside Y» of a neighborhood of Q. According to a theorem’ resulting 
from the work of F. Hartogs and E. E. Levi,’ ¢(z) is analytically continuable, 
say by ¢(z) defined throughout a small spherical neighborhood 6 of Qin ‘f. We 
deduce easily that a sphere smaller than Y» will be reachable, a contradiction. 
Hence if there is any such sphere Yo, Case I cannot hold. 

Case Il. Not Case 1. There is at least one point Qo on C such that f(x) and 
¢(x) are unequal at some points near Qp outside of Xo. Then none of the part of 
C near Q)can be outside of Xo, for if it were, f(x) would equal ¢(7) where both are 
defined near Qo, as follows from the uniqueness of analytic continuation. Since 


6 R+ Sis the set of points each of which is in at least one of the sets R and S. Nota- 
tions of topology will be as in 8. Lefschetz, Topology, Amer. Math. Soc. Colloquium Pub- 
lications, vol. 12, New York, 1930, (Lefschetz 1). 

7 Osgood II, Chapter 3, §10, Zusatz. 

® Loc. cit. 
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the cells of C as a complex are planar, it follows that none of them of positive 
dimension can meet Yo near Qo, so that Qo is an isolated point of X»-C. The 
nearby points outside of or on Y» must then belong to ‘J, as otherwise we could 
not have Case II for Qo. 

In the following we use the property that certain closed loci are complexes in 
the sense of analysis situs, after a proper subdivision of the entire configuration 
into cells.® 

Let Fo denote the set obtained from 2»-C by removing each of the points at 
which the distance from P on C has a maximum. Consider the class (S) of 
(2n — 1)-spheres with center at P such that S is in the class, if the maximal 
connected set containing Qo which is a subset of the part of C not interior to S also 
contains points of Fo. Let So be the (2n — 1)-sphere with center at P whose 
radius is the least upper bound of the radii of the spheres of this class. Since C 
is closed, So is itself in the class, and it is the largest sphere of the class. 

Let D, be the closure of the maximal connected part of C — C-S» which con- 
tains Q. Since C is a (2n — 1)-cycle (mod 2), the chain boundary £, of D, is 
on So, and hence is a (2n — 2)-cycle of So. Then D, plus each of the two 
(2n — 1)-chains of So bounded by £; is a (2n — 1)-cycle, and hence bounds a 
limited chain in 2n-space. Let H, denote the closed locus of that one of the 
two 2n-chains which contains no cells interior to So. It is not hard to show 
that no point of Ho is outside Xp. 

Since the distance from P on C has a maximum at Qo, the part of C near Q) is 
the part near Q» of the boundary of only one of the 2n-cells used in determining 
A, and that one, say K, is in H,, since no point of H; is outside X». Also the 
part not on K of a neighborhood of Q) must be in ‘J, since the part outside 
X» of a neighborhood of Q is in ‘f. Consequently, if we let 7; be the part not 
interior to So of the set obtained by adding to 7) those points of H, not already 
in To, Qo is an interior point of T;. Let C; denote the closure of the part out- 
side So of the boundary of 7}. 

We now consider the auxiliary problem of the analytic continuation over 
T; + 5 of the values of f(x) as given near C;. 

Let us again consider spheres = with center at P and radii larger than that of 
So. Proceeding as in the earlier part of our proof, suppose first that for the 
auxiliary problem Case II does not arise for any of these spheres. The analytic 
continuation can be carried as far as So, and whether or not Case II arises at So, 
we can have a single-valued analytic function y(x) defined over a domain con- 
taining T;. Since the part of C not inside Sp is a complex, and Sp is in class (S), 
we can obtain a curve on C joining Q to a point Z of Fy and not passing into 
the interior of Sp. Since there are points near Z and outside 2» on C, hence on 
C,, Z ison C;. Therefore ¥(z) = f(z) near Z. By the uniqueness of analytic 


§ B. O. Koopman and A. B. Brown, On the covering of analytic loci by complezes, Trans. 
Amer. Math. Soc., vol. 34 (1932), pp. 231-251 (Theorem 6.I and Lemma 3.1). For another 
treatment, see 8. Lefschetz and J. H. C. Whitehead, Analytical complezes, ibid., vol. 35 
(1933), pp. 510-517, and Lefschetz I. 
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continuation along the curve back to Qo, ¥(z) must equal f(z) near Q. But 
¥(z) is analytic near Qo since Qo is an interior point of 7, and it equals ¢(z), 
where the latter is defined near Qo. Consequently f(x) and (7) must be equal 
where both are defined near Qo, contrary to the hypothesis that Case II arises 
at Qo. 

It is thus seen that Case II must arise for the auxiliary problem for 7), say at 
a point Q:, with Q; outside of X». Let =, be the (2n — 1)-sphere with center at 
P and passing through Q,, and ¢;(z) the single-valued analytic function defined 
over the part of T; + 5 outside of 3;, with ¢:(z) = f(x) in the part of 5 outside 
>, and not in 7;. Let F; be the set obtained from 2,-C; by removing each of 
the points at which the distance from P on C; has a maximum. We now dis- 
tinguish between two further subcases. 

Case Ila. Point Q: cannot be connected to any point of F, by a curve on C, not 
passing interior to So. We denote by Dz the closure of the maximal connected 
part of C, — C,-So which contains Q;. Next we determine a set H, bounded 
partly by De, just as H; was determined above, and add to T, those points of H»2 
not already in 7, calling the resulting set T2. This will make Q, an interior 
point of 7:, and we then proceed as before in a new auxiliary problem, with 
the process of continuing analytically as far as So if possible, thus obtaining a 
contradiction at Qo. 

Case IIb. Point Q:, while in Case II, is not in Case IIa. We let S, be the 
(2n — 1)-sphere with center at P of maximum radius such that Q, can be joined 
to a point of F; by a curve on C;, not passing into the interior of S;. We proceed 
in this case as we did above in the first consideration of Case II, with Qi, S:, F:, 
C, taking the réles of Qo, So, Fo and C, respectively. At no later step of this 
treatment of Case II will it be necessary to consider any sphere smaller than 8, 
which of course is at least as large as So. 

The procedure is now clear. After a finite number of steps we must obtain 
a contradiction, because each Q; is a 0-cell of the original C, and there is only a 
finite number of the latter. It follows that Case II cannot arise. Hence Theo- 
rem 2.1 is true. 

THEOREM 2.11. Theorem 2.1 remains true if the word “analytic” is replaced by 
“‘meromorphic”’ .° 

Since the proof is exactly similar to that of Theorem 2.1, we omit it. 


3. Functions of one real and n complex variables. We shall prove a 
theorem similar to Theorem 2.1. As a preliminary step we now extend that 
theorem to the case in which parameters are involved. 

Tueorem 3.1. Let be a limited domain, with connected boundary B, in the 
2n-space of the n complex variables 2, +++, 2n,n > 1, and f(xy, +++ 5 2ny Vy --* 5 
Yr) = f(x, y) a function single-valued and analytic in the cylindrical region for 
which (x) is in an open set 5 containing B and (y) in a region © in complex 
(y)-space. Then the analytic continuation of f over ® for each point (y) deter- 


1° See footnote in introduction referring to this theorem. 
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mines a function analytic in 21, +--+ , Yp in the cylindrical region for which (x) is 
in R + S and (y) is in V. 

We begin as at the beginning of the proof of Theorem 2.I, and let C and J be 
defined as in that proof. In the proof of Theorem 2.I we showed that for any 
fixed (y) in 0, f has a unique analytic continuation ¢(z, y) over all of 6 = J+ §. 
Thus ¢(z, y) = f(z, y) in 8. 

Let P with coérdinates (x) be in J and (y) be in ©. We must prove that 
¢(z, y) is analytic at (2°, y°). Let p be the 2-plane in (x)-space parallel to the 
2-dimensional z;-plane and passing through P. The set p-J contains a finite 
number of regions on p, one of which, say “U, contains P. Let ‘W be the region 
obtained by adding to ‘U all the isolated points of the boundary of Uon p. If L 
is the point set boundary of W on p, then W + L is easily shown to be a complex, 
and L is thus the locus of a l-cycle. Of course L need not be connected. 

Let J(L) denote the cylindrical point set consisting of the points of (x)-space 
with x; = any value on L, and 22, --- , z, near (z},---,2°). Since L is on C, 
¢(z, y) is analytic in a domain with (x) near J(L) and (y) near (y°). Let LZ; 
denote the projection of L on the z:-plane. Now consider the function 


1 S(t, La, +++ 5 Lny Yry +++ y Yo) 
_ Dri [. t— m * 





the integral being taken in the positive sense over L; as the boundary of the 
projection W, of W. Here x2, --- , tn, yi, --- » Yp are regarded as parameters, 
and the values taken are those of f with (z) near J(L). Evidently (3.1) defines 
an analytic function of (x, --- , ny Yr, --* » Yp) for (x, y) near (2°, y°). For 
fixed (y), @ is known to be analytic in (x) over &. Therefore for fixed 
(ze, ttt yTny Yr -° » Yr) near (x2, eis »%, yi, cee »Y>) ’ ¢ is analytic in 7 for v1 
over W, + Ly, since the locus in question will be in 6. Therefore the values 
determined by (3.1) are those of ¢(z, y). Consequently ¢ is analytic at 
(x®, y). The theorem follows immediately. 

TueoreM 3.11. Let & be a limited domain, with connected boundary B, in the 
(2n + 1)-space of the real variable y and the complex variables x, --- , Zn, n > 0, 
and f(x, --+ , tn, y) a function single-valued and analytic in an open set 5 of the 
(2n + 2)-space of the complex variables x, ---, Zn, y, containing B. Then f 
admits a single-valued analytic continuation throughout R + D, where D is the 
part of 5 for which y is real. 

This means, of course, that f has an analytic continuation over a domain in 
(2n + 2)-space containing R + D. We may for the most part restrict ourselves 
to real values of y. The proof follows. 

Let (a) denote a set of axes in the (2n + 1)-space, none of which is perpen- 
dicular to the y-axis. We divide (2n + 1)-space into (2n + 1)-cubes with edges 
parallel to the axes of the (a) system. As in the proof of Theorem 2.1, we let A 
be the locus of all of the closed cubes having points on R = R + B. Under 
definitions like those of that earlier case, C, part of the boundary of A, is the 


ee 
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connected locus of a 2n-cycle, with each point of C accessible from infinity by 
a curve not meeting C, and ‘Sis the limited domain bounded by C, with R a 
subset of S. We let Ty) denote J+ C,and& = 7%+D= 549. 

Remark. If a point Q of intersection of a plane =: y = real constant with C 
is not an isolated point of intersection, then C contains points near Q on both sides 
of =. This is an easy consequence of the fact that the y-axis is not perpendicular 
to any axis of the (a) system. 

If continuation over & were impossible, there would exist a smallest value, e, 
of y, such that there would be a single-valued analytic function ¢(z, y) defined 
over the part of & for which y > e, and with ¢(z, y) = f(z, y) in the part of D 
not in T) for which y > e. Let 2» be the plane y = e. 

Let Q be any point of X»-J. Plane Xp» intersects ‘f in a finite number of 
regions on Xo, and we let ‘U designate that one of those regions which contains Q. 
Then ‘U determines a 2n-chain which is bounded (mod 2) by a (2n — 1)-cycle 
whose locus H is not necessarily connected. - Let F denote the part of H consist- 
ing of all points of H accessible from infinity by curves on 2» not meeting H. 
Then, as in similar situations arising above, F is also the locus of a (2n — 1)- 
cycle, bounding a limited domain ‘f on 2 containing “U; and F is connected. 
We infer from the remark above that near any point Z on F there are points of 
C above 2» (where y > e). Therefore ¢(z, y) = f(z, y) above Zo near F. 

For a moment we consider separately the cases n > 1 and n = 1. 

If n > 1, we consider the analytic continuations of f(z, y) over parts of the 
planes y = e + 7 in (2, y)-space, with » any complex number near zero. For 
each » we continue f(z, y) over the part of the plane which projects onto F and 
F, using Theorem 2.1. Then the continued function y(z, y) is defined in the 
(2n + 2)-space in a neighborhood of Q among other points. Since it is easily 
seen that the hypotheses of Theorem 3.I are satisfied, where y is the parameter 
and B and & of Theorem 3.I are taken as F and ‘, respectively, of the present 
proof, it follows that ¥(z, y) is analytic near Q. Let us join Q by a curve l 
on U to a point Zon F. Then f, ¢ and y are defined and all equal at and near 
the points of C near Z where y > e. The analytic continuation of y along and 
near | from Z to Q must equal ¢ in the part above 2p of a region containing l. 
Therefore ¢ is analytically continuable throughout a neighborhood of Q, by 
means of the function y(z, y). 

If n = 1, let U be the set obtained by adding to ‘U all its isolated boundary 
points Y, so that H is the boundary of ©. Let Hi and ?, be the projections 





of H and 7° respectively, on the z;-plane, and ¥(z, y) = zs ftv) dt, an 
2mt Juz, t— 2% 


analytic function near Q. But for real y > e and y — e small, f(t, y) = o(t, y), 
and ¢ is analytic for x; on H,; or 0. Hence for x; in C; and y > e with y — e 
small, y¥ = ¢. Thus in this case also we obtain a proper analytic continuation of 
¢ in a neighborhood of Q, by means of the function" y(z, y). 

CaseI. Whenever a point Q is on Yo-C and there are points of T, above Xo near 


11 This paragraph uses in part the method of Severi, loc. cit. 
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Q, f(z, y) = o(z, y) above X» nearQ. From the proof above it then follows that 
in Case I we could continue analytically throughout a neighborhood in (2n + 2)- 
space of each point Q of Yo- To, always getting proper values near C. It then 
follows easily that the number e used in the definition of 2» could be replaced by 
a smaller number if Case I held, a contradiction. 

Case II. Not CaseI. There is at least one point Q) on C such that f(z, y) 
neighboring Qo does not equal ¢(z, y), where the latter is defined in that neighbor- 
hood. None of the part of C near Q is above the plane y = e, for if this were the 
case, the values of f(z, y) near Qo would have to equal those of (2, y) at and 
near the nearby points of C above Yo, and by the uniqueness of analytic continua- 
tion f(z, y) would provide a continuation of ¢(z, y) at Qo which would satisfy the 
conditions of Case I, contrary to hypothesis. Hence Q is isolated as an inter- 
section of X» and C, and y has a maximum at Q) on C, since if this were not the 
case it would follow from the remark above that there would be nearby points 
on C above the plane ). 

From this point on the proof runs exactly like that of Case II in the proof 
of Theorem 2.1, with the planes y = constant taking the place of the spheres 
with center at P in that proof, and with the remark above used occasionally. 
We need not repeat the details. It follows that Theorem 3.II is true. 


4. Analytic homeomorphisms. We begin with a topological property which 
is used in the final theorems. 

TueoreM 4.I. Let a self-compact point set A in a Hausdorff space be mapped in 
single-valued and continuous fashion on a set B in a Hausdorff space. If (i) the 
map is a homeomorphism for a neighborhood 2 on A of each point P of A and the 
image of 2; (ii) this image always forms a neighborhood on B of the image of P; and 
(iti) each point of B is either the image of only one point of A or can be joined by a 
curve * on B to such a point; then the map sets up a homeomorphism between A and B. 

The proof is similar to that in the more familiar case where B is assumed to 
be simply-connected. 


Derinition. A homeomorphism set up between a locus A in (&, --- , &m)- 
space and a locus A’ in (£;, --- , &,)-space will be called analytic if defined by 
relations £; = f (£1, --- , m) (j = 1, --- , m) where the functions are analytic in a 


domain containing A and the jacobian does not vanish on A. This definition 
is used both when the ¢’s are real and when they are allowed to be complex. 

Tueorem 4.11. Let R be a limited domain in the 2n-space of the complex 
variables 21, --- , Zn, 2 > 1, with connected boundary C. Let the equations w, = 
Silar, +++, tn) = Se(x) (k = 1,--- , n) set up an analytic homeomorphism between 
C and a locus C’ in (w)-space. The analytic continuations over R of the functions 
filz) determine an analytic homeomorphism between R + C and the image in 
(w)-space. 

Let R = R+C. Weshall use Theorem 4.1, with B = R’, image of R = A. 

According to Theorem 2.1, the functions f, do admit single-valued analytic 


12 A continuous image of a line segment is meant. 
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continuations over R. Since the jacobian J is not zero near C, its reciprocal is 
analytic there, and hence can be continued analytically over R. Therefore J ¥ 0 
in & and it follows that (7) of Theorem 4.1 is satisfied. Next we consider (iii). 

Let Q be any point of R’. Draw a half-line (ray) through Q in any direction, 
and let Q:, possibly Q itself, be the nearest point to Q on the half-line which is 
not an inner point on the whole line, say L, of the intersection set of L with R’. 
Now no point of ® can map on Q,, since the fact that J # 0 would imply that R’ 
contained a neighborhood of Qi, from which a contradiction to the choice of Qi 
would follow. Hence Q; must be the image of a point of C, hence itself on C’. 
Since C and C’ are in one-to-one correspondence it follows that Q, is the image 
of only one point of R. Hence (77) of Theorem 4.1 is satisfied. 

As for (7), this follows from the fact that J ¥ 0, if Pisa point of &. It will 
also follow when P is a point of C provided that we show that points outside a 
neighborhood 2 of P are imaged on points outside some neighborhood of its 
image,say P’. Butif this were not the case, we could find a limit point Q on R — 
NM having P’ as image, contrary to the fact already proved that a point on C’ is 
the image of only one point of R. Thus all the hypotheses of Theorem 4.I are 
satisfied, and it is seen that Theorem 4.II is true. 

A closed connected non-vacuous locus S (a continuum) in the space of the real 
variables £1, --- , &m will be called a regular analytic (m — 1)-spread if neighboring 
each point of S it coincides with the locus of an equation of the form $(é, --- , 
£m) = 0, where ¢ is real and analytic near P and $7, + --- + $;,, ¥ 0 there. 

TueoreM 4.III. Let & be a limited domain in the (2n + 1)-space of the real 
variable y and the complex variables 21, --+ , Zn, n > 0, with connected boundary C. 
Let 5 be a domain in complex (x, y)-space containing C, and D be the part of 5 for 
which y is real. Suppose the equations w; = f(a, --- , tn, y) (9 = 1, 2,---, 
n + 1) set up an analytic homeomorphism between C and a set C’, and transform D 
into a set on a regular analytic (2n + 1)-spread M. Let R' denote the transform of 
R = R + C under the analytic continuations over R of the functions f (x, y). 

Then R’ lies on M. Further, if R’ does not cover all of M, R’ is homeomorphic 
with R. 

For example, M might be a sphere. If M is a plane, then the final hypothesis 
is necessarily satisfied and hence need not be imposed. 

We first prove that, under the analytic continuations over % of the f’s, which 
by Theorem 3.II exist, the image of R is entirely on M. By hypothesis this is 
true for points near C. Now suppose there were a point Q; on R whose image is 
not on M. We join Q; to a point Q2 of C by a line on R. Let Qs be the point 
on that line farthest along from Q2 towards Q,; such that each point of the part of 
the line on the side of Q; towards Q, has a neighborhood on R all of whose points 
are mapped onto points of M by the transformation. Since M is closed, the 
image Q; of Q; ison M. Since M is a regular (2n + 1)-spread, neighboring Q; 
it is the locus of an equation $(t, v1, --+ , Unsi, Unga) = 0, with ¢ real and ana- 
lytic, where w; = u; + wv; (j = 1,---,n+ 1). Now the equations of the 
transformation give the u’s and the v’s as real analytic functions of rn, th, --- , Ta, 
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tny Tat) tenga, Where 2; = 7; + tt; (7 = 1,---,n) andy = rags + tags, SO that o 
equals a function analytic in 7, --- , tr41 for (ri, --- , tng) near Q3. Then ¢ = 0 
at each point of an open set of a neighborhood of the projection of Q; on the 
space of 7, ti, +++, Tny tny Tn41 and hence is zero in the entire neighborhood. 
Therefore Q; cannot be the point farthest along the line from Qe satisfying the 
condition stated. We conclude that the image of R is on M. 

The rest of the proof is similar to that of Theorem 4.II, aside from some con- 
siderations which we now mention. 

It is easily shown that any connected part of M is connected by curves, by using 
the fact that any point of M has a neighborhood on M which is a (2n + 1)-cell. 
The “half-line through Q” of the proof of Theorem 4.II is replaced here by any 
curve through Q on M joining Q to a point of M which is not in R’. In proving 
that hypotheses (777) and (iz) of Theorem 4.I are satisfied, we use the Brouwer 
theorem of invariance of regionality," for the dimension 2n + 1. 


CotumBIA UNIVERSITY. 


13 See Lefschetz I, page 100. Use of the theorem of invariance of regionality could be 
avoided by an argument involving jacobians. 











TRANSFORMATIONS OF MULTIPLE SEQUENCES 
By Hue J. Hamitton 
§1. Introduction and definition of notation 


1.1. Notation. In order to treat n-tuple sequences with any degree of 
facility, it is necessary to introduce an abbreviated notation. The present paper 
uses one defined as follows. 

The single letter m will denote an ordered set of n positive, integral variables, 
and k another such, homologous to m. A fixed value-set for m will be denoted by 
r, and the 7-th of an infinite sequence of such sets by m;. The symbols p and k,; 
are to be interpreted in an analogous sense with respect to k. 

Generic representation for conjugate, proper, ordered subsets of any of these 
sets is to be obtained by affixing the superscripts 1 and 2, respectively, to the 
symbol denoting the set, and further subsets of like character with respect to 
either of these are to be represented by adjoining to the present superscript 
further numbers 1 and 2, respectively, etc. ‘Two sets whose symbolic representa- 
tions involve the same superscript are to be considered homologous. When the 
implication of this homology is not intended, the numbers 1 or 2 are replaced by 
3 or 4, respectively, in one of the symbols. Thus k*, k* are conjugate, but homol- 
ogously independent of m', m’. 

A single element of k will be denoted generically by « (or A), and a fixed value of 
it by x. The corresponding element of m will be represented by yu. 

All other letters are to be interpreted in the customary sense. 

By relations like k"® = p” or m; > m;_, are to be understood all sets of relations 
of the same form between corresponding elements of the two sets. In particular, 
the equation k = p(m) is equivalent to the set of n equations x = r(m). The 
notations k* = 1, 2, --- or m' = M imply the corresponding range of variation 
for each separate element of the set. However, inequalities like k <¢ M mean 
simply that not every element in the set is less than or equal to M. 

Except when, by the nature of the situation, such would obviously be absurd, 
all relations involving subsets of k or of m are to be understood as implying the 
set of such relations for all possible choices of such subsets (with respect to posi- 
tion and, except when the subset consists only of « or wu, with respect to dimen- 
sion). 

Received July 8, 1935. Presented to the American Mathematical Society, September 13, 
1935. I am indebted to Professor C. R. Adams for suggesting the problems considered in 
this paper. A problem bearing a close analogy to the one considered here is treated in an 
article entitled On transformations of double series, which is expected to appear in the 


February, 1936 number of the Bulletin of the American Mathematical Society.. Reference 
to this article may prove helpful to one who wishes to read carefully the present paper. 
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1.2. Principal types of sequences considered. The sequence {s;} is said 
to be ultimately bounded' (abbreviated ub) if there exists a number Q such that 
8, is bounded for all k >Q; bounded (b) if in the preceding case Q can be taken to 
be zero; convergent (c) if lim s,; = 8s (read “principal limit’’) exists finite; 

kw 


bounded convergent (be) if both b and c; ultimately regularly convergent (urc) if c, 


and if there exists a number Q such that lim s,; = 8,2 (read “row limit”) exists 
kl 


finite for all k? > Q; regularly convergent? (rc) if in the preceding case Q can be 
taken to be zero; bounded ultimately regularly convergent (burc) if both b 
and ure. 


1.3. Nature of the transformation. A matrix || a»; || (m, k = 1, 2, --- ) 
is to be considered. By means of it the sequence {s,} is transformed into the 


Ko} 
sequence {om}, where om = >, Gme8z. Such a transformation is said to be 
k=1 


of infinite reference, and the matrix square. If an, = 0 for all « > x(m) 
p(m) 

(m = 1,2,---), then om = >> @mx8x, and the transformation is said to be of 
k=1 


finite reference, and the matrix row finite. If, in particular, r(m) = u, then 


om = >, Amex, and the matrix is said to be triangular. 
k=1 


1.4. Problems suggested. (i) If {sx} is of a specified one of the types de- 
fined in §1.2, under what conditions on |! a,,x || will {¢,} be of a specified one of 
these types? (ii) If {s,} is of a specified one of the last five types (involving 
convergence), under what conditions will {¢,,} be of a specified one of these 
types, with o = s? (The corresponding transformations and their matrices 
will be called regular.) (iii) If {s,} is of a specified one of the last three types 
(involving regular convergence, either complete or deferred), under what con- 
ditions will {¢,,} be of a specified one of these types, with o,: = s,. for all k? 
sufficiently large? (The corresponding transformations and their matrices will 
be called ultimately row regular.) (iv) If {s,} is re, under what conditions will 
fom} be re with o, = 8, for all k?? (The corresponding transformation and its 
matrix will be called row regular.) 


1.5. Auxiliary types of sequences. In order to attack these problems it is 
convenient to define under the classes of sequences listed in §1.3 several special 
types. Thus under the last five classes are introduced the corresponding null 
sequences (s = 0), abbreviated cn, ben, uren, ren, buren, respectively; under 
the last three classes, the corresponding ultimately row null sequences (s,. = 0 
for all k* greater than some number Q), abbreviated urern, reurn, burern, 


!R. P. Agnew, American Journal of Mathematics, vol. 54 (1932), p. 648. 
2G. H. Hardy, Proceedings of the Cambridge Philosophical Society, vol. 19 (1916-1919), 
p. 88. 
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respectively; under the re type, row null sequences (s,. = 0 for all k*), abbre- 
viated rern. 


1.6. Notation for the transformation. The process of transforming will be 
indicated by an arrow. “Necessary” will be abbreviated by N., “sufficient” 
by S., and “regularly’’, as applied to transformations (see 1.4 above), by reg. 
Thus 8.c — re reg reads ‘“‘a set of conditions sufficient that every convergent 
sequence be transformed into a regularly convergent sequence with preserva- 
tion of the principal limit’. 


1.7. Existence of the transform. Since under the infinite reference trans- 
formation ¢,, may not even exist for certain values of m, it becomes necessary to 
speak of the class of sequences, all of whose elements exist. Such sequences are 
called existent (abbreviated e). In the present paper all proofs of the necessity 

of conditions are based on the assumption that {¢,,} is e, though in the sufficiency 
' proofs this requirement is not made (unless implied by the nature of the trans- 
formation). 


1.8. Earlier literature. It appears that the questions proposed in (iii) and 
(iv) of §1.4 have thus far escaped attention, and that only a few of those in 
(i) and (ii) have been treated. Furthermore, of the four papers about to be 
cited, only one considers sequences of dimensionality greater than 2, and three 
contain slight errors in several of their conclusions. The nature of the results 
of these papers (in so far as they bear upon the problems suggested in §1.4) are 
indicated below. Unless otherwise stated, conditions will be lettered as in §3 
of this paper. 

Hallenbach? establishes conditions N. and 8. for the transformations ec — (e 
and ¢c), b — b, be — be (square matrix, n = 2). The conditions are equivalent 
in each case to the corresponding conditions in the present paper.‘ 

Kojima‘ finds conditions which he asserts to be N. and S. for en — en, cn > ¢, 
c—c,ce—c reg, be > ¢,§ re > c, re > re (triangular matrix, n = 2). In all but 
the last set of conditions, (¢,) should be replaced by (b:), Kojima’s proof of his 
inequality (6) being incorrect.’ Also, the word “converges” in condition 3° of 
his last set’ should be replaced by “converges regularly”’. 


3 Hallenbach, Zur Theorie der Limitierungsverfahren von Doppelfolgen, Dissertation, 
Bonn, 1933. 

* Hallenbach, loc. cit. His conditions (B), (C) (first part) and (D) on p. 12 are equiva- 
lent to conditions (b;) and (d,) of this paper. 

5 Kojima, On the theory of double sequences, Tdhoku Mathematical Journal, vol. 21 (1922), 
pp. 3-14. 

* Kojima, loc. cit., p. 12, Theorem V. Obvious typographical errors in the list of con- 
ditions can be remedied by studying the context. 

7 Kojima, loc. cit., p.5. The choice indicated in lines 6 and 7 is not necessarily possible. 

§ Kojima, loc. cit., p. 14. 
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Leja® asserts conditions N. and 8S. for be — ¢ reg, be > c, ec > ¢ reg, c > 
(triangular matrix, n = 2). He then states conditions N. and 8. for the same 
transformations (square matrix, n = 2), under the assumption that || an; || 
satisfies (a;) in all cases, and (ae) in the last two. (As shown below, these condi- 
tions are actually N. that {c,,} bee.) Finally, he states with partial proof con- 
ditions N. and S. for the same transformations (triangular matrix, n = n), under 
the assumption in the last two cases that the matrix satisfies an added condition 
equivalent to (bz), which is proved to be N. in §5 below. Im all cases, (c;) should 
be replaced by (b:), since Leja’s proof of the necessity of his condition 3° is in 
error.” 

Robison" asserts conditions N. and S. for be — ¢ reg (triangular matrix, n = 
2); be — (e and c reg) (square matrix, n = 2). In both cases (c;) should be re- 
placed by (b;), the proof of Robison’s inequalities (4) being at fault." He cor- 
rectly states conditions N. and 8. for be — be (triangular matrix, n = 2); be > 
(e and be) (square matrix, n = 2). Conditions are also stated to be N. and S. 
for be — c, with o a function of s only (triangular matrix, n = 2); and for be > 
(e and c), with o a function of s only (square matrix, n = 2): in the first set (c) 
should be replaced by (b;) and in the other (b,) should be added. Finally, con- 
ditions are stated to be N. and S. for b — be (triangular matrix, n = 2); b — (e 
and be) (square matrix, n = 2), both sets being incorrect: the first can be rectified 
by replacing Robison’s second condition by (c:), and the second by replacing his 
first condition by (c:) and omitting his last. 


1.9. Scope of the present paper. In this paper are found conditions N. 
that a sequence of any specified one of the types listed in §§1.2 and 1.5 have an 
existent transform which is of a specified one of these types, and conditions S. 
that the transform of such a sequence be of a specified one of these types, with- 
out being necessarily defined for all m. The hypothesis of existence of the 
transform is of course implied when the latter is bounded, and the manner of 
obtaining additional conditions to insure that the transform exist in any case will 
appear. 

In the sense of the preceding paragraph, then, the question (i) of §1.4 is com- 
pletely answered. The answer to (ii) is given in §7. For the special cases of 
ultimately row null and row null sequences, questions (iii) and (iv) are answered, 
though the complete solutions seem at present to offer difficulties. 

§2 below contains certain preliminary lemmas and some remarks on the proofs 
to follow. In §3 are listed the conditions to be used, various implications of 
which are indicated in §4. N. proofs are given in §5, and S. proofs in §6. A 


* Leja, Sur les transformations linéaires des suites doubles et multiples, Bulletin Inter- 
national de |’ Académie Polonaise, Classe des Sciences, A, (1930), pp. 1-10. 

1° Leja, loc. cit., p.4. The statement in lines 6 and 7 is untrue. 

1 Robison, Divergent double sequences and series, Transactions of the American Mathe- 
matical Society, vol. 28 (1926), pp. 50-73. 

#2 Robison, loc. cit., p. 55, line 13. Cf. footnote 7. 
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few remarks on interpretation of results and a list of conditions for regularity in 
§7 conclude the paper. 


§2. Preliminary observations 


Under the hypothesis that re sequences are e, the several types of sequences to 
be considered are related as shown in the diagram, the arrow indicating implica- 
tion of the quality at its head by that at its tail. 


" 
eS 
2 


BURCN URCRN 
iit. 
He rn 
— 
RCRN 


These relations are sufficiently clear, save perhaps that regular convergence 
plus existence implies boundedness, but this is an immediate corollary of the 
theorem below on uniformity of row convergence. 

Tacit use is made throughout of the fact that, if uC U and v C V, where 
u, U, v, V represent classes of sequences, then N.u — V is at once N.(u, U) > 
(v, V) (four cases), and 8.U — v is at once 8.(u, U) — (v, V). 

In accordance with the remark in the last paragraph of §1.1, it is to be noted 
that most of the conditions in §3 are equivalent to sets of several conditions. 
Thus (f,) implies n(2" — 2) separate conditions depending on the various pos- 
sible positions of x, the dimensionalities of m? (1 to n — 1), and the various ways 
in which m? can be chosen positionally from m for fixed dimensionality. How- 
ever, in necessity proofs, where denial of such a condition implies denial for a 
particular positionality of x and a particular positionality and dimensionality of 
m?, these qualities are necessarily assumed fixed throughout any given theorem. 
If certain variables are given particular values in the course of a proof, this fact 
will be indicated in parentheses (or brackets), and subsequently parentheses (or 
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brackets) will enclose the generic symbol involved. Thus: “With (m! = r') and 
(k® = p*), lim > Oma) = Lay’. 


mio k4 


It is often convenient to decompose the operator >> as follows (R being an 
k=1 
arbitrary positive integer) : 
oo R oe 
(.001) x= oF. SF ety Fe . 
kim] k*=R+1 k=1 k=R+1 


Here the free }> indicates summation over all manners of choice of k' from k, both 
dimensionally and positionally. Another decomposition is the following: 


(0) PETE Reco Es Fe, 


where the free >> sums over all manners of positional choice of the summation 
index immediately following, for dimensionality defined by the summation index 
first preceding. Similar decompositions of }> occur, ete. 
ktm 
A theorem on uniformity of row convergence of re sequences is now given. 
(.003) THrorem. [f 8, is re, then, given any « > 0, there exists R = R(e) such 
that 


(.0001) |s — S| < 2efork®?>R (ki! = 1,2,---) 
From the convergence of s; follows 
(.0002) |s. — 8| < efork > R. 


Since s; is rc, there exist R; < Rz < --- < R, such that 
|& —&:| <efork®? > Rin 


(.0003) 
(Ki=1,2,---, Ri t=1,2,---,n—1). 


Let p' be arbitrary. Since the dimensionality of p' is not greater than n — 1, 
at least one interval (R; + 0, Rin) (¢ = 0,1,---, m — 1; Ro = 0) contains 
no element of it. If p' S R;, from (.0003) it follows, with (k' = p'), that 
| 8q, — 8» | < ¢€ for > Ris; if p' > Riss, from (.0002) it follows that 
|8q) — 8, | < 2efork®? > R,; if p" S Ry and p® > Rj, from (.0003) it follows 
that | s,, — 8,| S | sq) — 8! + |18, — 8] < 2efork® > Rix. Hence 
R = R, satisfies (.0001). 

In the course of the proofs in §5 it becomes convenient to use the oscillation at 
infinity of a function f(m), defined thus: ose f(m) = lim | f(m) — f(r)|. At 


once it follows that osc f(m) — ose ¢(m) Son tf(m) + “o(m)} s ose f(m) + 
ose ¢(m); ose f(m) S 2 lim | f(m) 7 ose C-f(m) = C. ena fen) end 0. and 8. 


lim J(m) exist finite is: ose ye 


mn 
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Use is made in §6 of the following set of sufficient conditions that lim 
S(m) = ¢, namely: f(m) = ¢(m, R) + ¥(m, R), where R is an arbitrary positive 
integer; lim ¥(m, R) = 0; lim ¢(m, R) = $(R) for each R; lim ¢(R) = ¢. 


m,R-2 moo R-?-@ 


§3. Conditions on the matrix 
Existence conditions. 


(ay) D lan| < (m = 1,2, ---). 


(a2) Let «x, \ be any two single elements of k, and k? those remaining. Then 
Qnk = 0 for \ > C,(m) (ke = 1,2,--- ;m=21,2,--- se = 1,2,---). 
UB conditions. 


(b,) > |ane| < A form> B. 
k=1 
(bz) Let x, \ be any two single elements of k, and k* those remaining. Then 
Qnk = Ofor m,r > C, (4 2 1,2,---3;e421,2,---). 


B conditions. 
(e1) > lan| <A (m = 1,2, ---). 
k=1 


(ce) Let x, \ be any two single elements of k, and k* those remaining. Then 
Qnk = 0 for X > c (k? = 1, 2, --- ; m = 1, 2, eee ;K = 3 cece ). 


C conditions. 


(di) lim Qnk = Ay (k = 1, 2, eee >: 
(ds) lim > ay = Ly (FP = 1,2 «--). 


mo ki] 


(ds) lim >> an = L. 

mo kel 
(d4) There exist numbers a, such that, if x is any single element of k, and k 
those remaining, then 


lim >> lan, — ax| = 0 (x = 1,2,---). 
m0 kien] 
(ds) There exist numbers a, such that 


lim >> lane — ax| = 0. 


mo k=l 
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CN conditions. 
(d,) (dy), with a, = 0 Ge ih.:.% 
(dz) (dz), with Ly. = 0 (@ = 1,3,---). 
(ds) (ds), with L = 0. 
(d.) (d,), with a, = 0 (k = 1,2,---). 
(ds) (ds), with a, = 0 eS on 
URC conditions. 
(e) lim a,,. = @,,,, for m' > D (k = 1,2,---). 


2 
m2 


(e}) Let « be any single element of k*, and k*? (which may in this instance be 
null) those remaining. Then 


lim >> a,, = Lys for m' > E, (kK = 1,2,---;« =1,2,---). 


m*—o kt=] 


(ez) lim >> a, = Lyx: for m' > E (k* = 1,2,---). 


m2 kt=] 
(es) lim > ank = Lm for m' > F. 

m2 k=1 
(ei) There exist numbers a... such that, if « is any single element of k, and k* 
those remaining, then 


a 


lim > | aa — Gnu | =Oform>G. («=1,2,---). 


min kt=1 


(e,) There exist numbers a,x such that, if x is any single element of k, and k* 
those remaining, then 


lim >> |anz — Qnx| =Oform'>G («x =1,2,---). 
mina kt=1 


(es) There exist numbers a,., such that 


lim > | Qmk — mx | = 0 form >H. 
mia k=l 


URCRN conditions. 
(&) (e:), with anuz = 0 for m' > D (k = 1,2,---). 
(@3) (e}), with Las = 0 for m' > E, 


(kK? = 1, 2, --- k= 1,2,---). 
(&2) (e2), with Law = O form > E (ke? = 1,2,---). 
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(&s) (e3), with Lu = O for m' > PF. 
(3) (e3), with amx = 0 for m' > G, 

(4 = 1,2,--- 3« 
(&,) (e4), with a,., = 0 for m' > G (k 
(és) (es), With aux = 0 for m' > A (k 

RC conditions. 
(fi) lim @n4 = Gmx for all m! (k 
mv 

(fs) lim > Ami = Lous for all m' (k* 


mio ki=1 


(fs) lim >> aux = Lm: for all m'. 


mo k=1 
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™ 1, 2, . ) 
ons 1, 2, ) 
= 1, 2, . ) 
= 1, 2, . ) 
= 1, 2, ' ) 


(f4) There exist numbers a,., such that, if « is any single element of k, and k* 


those remaining, then 


lim > | anc — Gmx| =O for all m' = (x 
mo kt=1 

(f;) There exist numbers a,,., such that 
lim >> | @ni — Gnu | = 0 for all m'. 


mo k=1 


RCRN conditions. 
(f1) (f,), with a,:% = 0 for all m' (k 
(fs) (f2), with Luss = O for all m' (K* 
(fs) (f3), with Lu = 0 for all m'. 
(F,) (f4), with anux = 0 for all m! (k = 
(fs) (fs), with amu. = 0 for all m' (k 


§4. Implications of the conditions 


For later use it is convenient to list the following relations, 
easily established. The symbol — indicates implication. 


(01) (by) + (a) + 5 var? @ 


(02) (by) + (ds) > = In| SA. 


= 1,2,---). 
= 1,2, - ) 
= 1,2,- ) 
= 1, 2,- ) 
= |, 2, - ) 


which may be 








(.03) 
(.04) 
(.05) 
(.06) 


(.07) 
(.08) 
(.09) 
(.10) 
(.11) 
(.12) 
(.13) 
(.14) 
(.15) 
(.16) 


(.17) 
(.18) 
(.19) 
(.20) 
(.21) 
(.22) 
(.23) 
(.24) 
(.25) 
(.26) 


EE 
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(bi) + (e:) @ SS | ane | S A for m' > B,D. 
k=1 


~ 


1 


(b:) + (e3) > SS | Laws | S A for m' > B, E, 


(by) + (e2) ~ D> | Laws | S A for m' > B,E. 


ki=1 


(ex) + (f:) ~ D5 | aux | < A for all m'. 
k=1 


(ex) + (f2) >» 35 | Lmas | S A for all m'. 
k=1 


(c:) > (a). 

(e:) — (b:), with B = 0. 

(cz) — (ae), with C,(m) = C, for all m 
(ce) — (be). 

(be) + (di) — (da). 

(ds) — (d,) . 

(bi) + (ds) — (de), with Lu = bs ay, 


(ds) — (d,). 


(bi) + (ds) — (ds), with L = po ak. 


Each CN condition implies the corresponding C condition. 


(di) + (ds) — (ds) . 

(di) + (ds) — (ds). 

(ds) — (di) . 

(d,) — (d:) . 

(ds) — (ds) . 

(ds) — (dy) . 

(bs) + (e:) — (e%), with G, = max (C., D) 
(c2) + (e1) — (es), with G = D. 


(es) — (e}), with E, = E 

















(.27) 


(.28) 
(.29) 


(.30) 
(.31) 


(.32) 


(.33) 


(.34) 
(.35) 
(.36) 
(.37) 
(.38) 
(.39) 
(.40) 
(.41) 
(.42) 
(.43) 
(.44) 
(.45) 


(.46) 
(.47) 
(.48) 


(.49) 


TRANSFORMATIONS OF MULTIPLE SEQUENCES 


(bi) + (e3) — (e3), with E, = max (B,G) s(x = 
Lists = Lams (kK? = 
(e,) — (e:), with D = G. 

(bi) + (es) — (e2), with E = max (B, G), and 

Les = = eit (k3 = 
(ex) — (e3), with G, = G (x 


(es) — (es), with G = H. 


(bi) + (es) — (es), with F = max (B, H), and L,. = 


1, 2, --- ), and 
= 1,2,- ) 
= 1,2,- ) 
= 1,%--.) 


wo 
} Amik - 
k=1 


Each URCRN condition implies the corresponding URC condition, 


with D, E, etc., replaced by D, E, etc., respectively. 
(&:) + (e3) > (@}), with G, = max (D, G,) (x 
(&) + (es) — (@), with G = max (D, G). 
(&) + (es) — (@s), with A = max (D, H). 
(@:) — (@)), with E, = E (x 
(83) — (@)), with BE, = G, (x 
(&,) — (&), with D = G. 
(&) — (@), with E = G. 
(&,) — (83), with G, = G (x 
(@s;) — (@s), with F = 7. 
(&s) — (&,), with G = A. 
(c2) + (f:) — (f,) . 
(f.) — (fi) . 


(cx) + (td) > (fs), with Lan = > un (k* = 


kt=1 


(fs) — (f4) . 
(cx) + (fe) — (fe), with La = 3 aus. 


Each RC condition implies the corresponding URC 
all m'. 


= 1, 2, : ) 
= 1,2,---) 
= 1, 2, = ) 
= 1, 2, . ) 
= 1, 2, . ) 


condition for 








40 HUGH J. HAMILTON 


(.50) Each RCRN condition implies the corresponding RC condition. 
(.51) (fi) + (fs) — (fa). 

(.52) (f1) + (fs) — (fs) . 

(.53) (fs) — (fi). 

(.54) (fs) — (f2) . 

(.55) (fs) — (fs) . 

(.56) (fs) — (fy). 


(.57) Each RCRN condition implies the corresponding URCRN condi- 
tion for all m’. 





(Brackets in the following conditions indicate application of the corresponding 
parenthesized condition to the matrix || an, |’, defined for all m, but only for 
k > Q, where Q is an arbitrary positive integer.) 


(.58) With the exception of those in the right-hand members below, each con- 
dition implies its bracketed counterpart. 


(.59) (di) + (de) — [d,]. 

(.60) (di) + (dz) + (ds) — [ds]. 

(.61) (di) + (de) — [d,). 

(.62) (di) + (dz) + (ds) — [dj]. 

(.63) (e:) + (e3) — [e?] for m' > max (D, E,) (c = 1,2, ---). 
(.64) (e:) + (e2) — [es] for m' > max (D, BE). 

(.65) (e:) + (e2) + (es) — [es] for m' > max (D, E, F). 


(.66) (@:) + (@) — [#3] for m! > max (D, E,) (x = 1,2,---). 


(.67) (&:) + (&) — [@2] for m' > max (D, EB). 

(68) — (&) + (&) + (@) — [és] for m' > max (D, E, FP). 
(.69) (f:) + (f2) — Ife). 

(.70) (f:) + (fe) + (fs) — [fsl. 

(.71) (f1) + (f2) — [fe]. 

(.72) (1) + (f2) + (fs) — Ifa}. 

(.73) (di) + (&:) — (di). 

(.74) (dz) + (2) — (ds). 





tin 








TRANSFORMATIONS OF MULTIPLE SEQUENCES 41 


(.75) (ds) + (&s) — (ds). 
(.76) (ds) 9 (b:) : 
(.77) (a1) + (ds) + (fs) > (e). 


The methods of proof of these relations will be sufficiently clear in view of the 
following typical examples. 
Proof of (.12). From (bs) it follows that a, = 0 for A > C, (kK? = 1, 2, --- ). 
2 Cc 


Hence, for m > C,, in the notation of (dy), >> | ame — ax! = D> lane — ax. 


k%=1 kt=1 
Proof of (.59). By (.002), with the dimensionality of k* represented by ¢, it 
follows that 


« cs t-—1 Q x A 
DY ow = Dam— DS (-Y)Y YD YL LY an — (-1) DY an. 
k%=Q+1 kt=1 r=1 kil] ke] ki=1 


Proof of (.73). For fixed k, the sequence in m, {amx}, is ure, with all row- 
limits ultimately zero. Hence the principal limit is zero. 


§5. Necessity proofs (See §1.7) 


1. N.RCRN — e is (a). By denial of (a:), there exists an r such that }> | a,x | 
k=1 


Mi 
©. Let M; > M,-. be such that P p> lax| 2%. Now s = (—1)'sgna,/i 
=1,$Mi- 
(k = M;, = M;-4) is rern, while a, does not exist. 

2. N.URCRN — ¢ is (az). By denial of (a2), there exist x, r, and sequences 
k? and A; (> A,-1), such that, with «; = x (i = 1, 2,--- ), an, #0. Nows, = 
{(—1)'/a,x (k = k,); 0, otherwise} is urern, while o, = —1+4+1—14---. 

3. N.RCRN — UB is (b:). Since the existence of the transform is assumed, 
(a:) can be assumed, by 1. Now | a,x.| < Ax for m > B, (k = 1,2,---). 
For, with arbitrary p, s, = {1 (k = p); 0, otherwise} is rern, and o,, = @,,, must 
be ub. 

By denial of (b;), there exist sequences M ;, m;, satisfying M; > Mj; 


(3.1) m: > mia, By (k Ss My), 
Mi Mia x 
such that >> | an | = i (2 > Ar + i), where, by (am), >> |anu| S 1. 
k=1 k=1 k=1, $M 
The sequence s; = sgn Qmx/t (k S Mi, £ Mi.) is rern, whereas 
Mi Mia a0 
|om | = Do | ame |/i— 2 DY Ar — ) lane | 2i—1, 
k=1 k=l k=l, my 


so that, by (3.1), o,, is not ub. 
4. N.URCRN — UB is (be). (a2) can be assumed, by 2. By denial of (be), 
there exist r and sequences k? and \;, m,, satisfying A; > Aya, Ce(m) Gi < 9, 


(4.1) my > mei, 





— 
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such that, with «x; = 7 (i = 1,2,--- ), dmx #0. Now s = {1 (k = ky); 


gi 


‘-1 
(= | Qik; Sk; | + i) [as (k = k; fori > 1); 0, otherwise is urern, while 


| om; | = 








i-1 
2B Amgk; Sk; + Ame; 8; | 2%. Hence, by (4.1), om is not ub. 
7=1 

5. N.RCRN — Bis (e:). Proof similar to that of 3. 

6. N.URCRN — B is (ez). Proof similar to that of 4. 

7. N.RCRN —C is (di). Let p be arbitrary. Now s; = {1 (k = p); 0, 
otherwise} is rern, and om = Amp. 

8. N.RCURN — C is (dz). Let p' be arbitrary. The sequence s; = 
{1 (k! = p'; k® = 1, 2,--- ); 0, otherwise} is reurn, and, with (k' = p’), 
Cm = > Om(k)+ 


k=1 
9. N.RC —C is (ds). The sequence s, = 1 (k = 1, 2,---) is re, while 
Cm = > Amk- 
k=1 


10. N. BURCRN — C is (dy). (bi) is assumed, by 3, and (d;), by 7. By 
denial of (d,), then, there exist 6 > 0, 7, and sequences M,( > M,-,) and m; 
satisfying 


(10.1) mM; > M1, B ’ 


Mi 
such that, with (x = =), > | Qm(k) — Qe) | 2 66, where, by (10.1), (b:), and 
ki=1 


o Mi-1 
(01), >> | amnicey — acm | S 4, and, by (di), D> | ania) — Qu | S 4. 
eT, $M; a1 


The sequence s, = {sgn(@mj;x — ax) (x = 7; k? S M,, € My, for 7 odd); 0, 
otherwise} is burern. Now 


on, = : i Ack) 8k) (A) 
ki=1 
(10.2) a 
+ pas (Amir) — Gay) 8) « (B) 
k= 
Mi Mii bed 
(A) exists, by (.01). However, for ¢ odd, | (B) | 2 { ~-832,- } 
kiel ki=l kt= 1, $M 
| Gmgtk) — Gu) | 2 34, and for 7 even, 
Mia oa 
(Bis 4 + > } 1 aaa) ~ au | S28 
kia} ktm l, My 


Hence, by (10.2) and (10.1), ¢,, is not c. 
11. N.B—C is (ds). Proof similar to that of 10. 
12. N.RCRN — CN is (d,). See proof of 7. 
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13. N.RCURN — CN is (dz). See proof of 8. 

14. N.RC — CN is (d3). See proof of 9. 

15. N. BURCRN — CN is (dq). See 10, 12, and (.18). 

16. N.B — CN is (ds). See 11, 12, and (.19). 

17. N.RCRN — URC is (e:).  (b:) is assumed, by 3. Now lim a,,. exists 


for m' > D, (k = 1, 2,--- ). For, with arbitrary p, s, = {1 (k = p); 0, other- 
wise} is rern, and gm = Amp. 

By denial of (e:), there exist sequences 5; > 0, k;, and m} satisfying 
(17.1) m; > mi-15 B, Di; (j <1) , 
such that, with (m! = m!), 


(17.2) OSC Aim; = 5, 


where, by (17.1), 
(17.3) OSC Aim; = O (j <7), 


and, by (17.1) and (b,), 
(17.4) | Qimx;| < A for m? > B (j = 1,2, ---). 


Define d; = 1; d; = (min d,é,)/A-2*' (¢ > 1). Thus lim d; = 0, whence 
v<i i—2 


8. = {d; (k = k,); 0, otherwise} is rern. But 


‘-1 
o(m) = , Qimyk 4; (A) 
7=1 
+ Amy x5 di (B) 
+ > Ami; y (C) 
j=i+l 


and ose (A) = 0, by (17.3); ose (B) = d;é;, by (17.2); ose (C) S 2A > d; s 


m?=00 mt=oo j=i+l 


d;6;/2', by (17.4). Thus ose o(m) 2 d;5;/2, so that by (17.1) o,, is not ure. 


mia 


18. N.RCURN — URC is (e}).  (b:) is assumed, by 3. Now lim } a: 
mt—-c hte] 
exists for m' > Ey (k® = 1, 2,---). For, with arbitrary p*, s, = {1 (4° = p'; 
k* = 1, 2, --- ); 0, otherwise} is reurn, and, with (K° = p*), o, = a Oink) - 
k= 


By denial of (e}), there exist + and sequences 5; > 0, k’*, and m! satisfying, 
with «; = r (¢ = 1,2,--- ), 


(18.1) m, > mj, B, Ex (j <7), 


such that, with (m' = m}), and [k* = k?], ose S> acm) = 5, where, by (18.1), 
kil 


mio 





— 
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with {k* = k}}, ose DY) @im)ix) = 0 for j < i, and, by (18.1) and (bi), | 20 @gnyix) 
m'=2 ki=1 kt=1 
< A for m? > B (j = 1, 2, --- ). Define {d;} as in the proof of 17. Then 








s, = {d; (® = k?; k* = 1, 2, --- ); 0, otherwise} is reurn. But, for m? > B, 
i-1 oo ) C-) C) 

o(m) = > p> Qim) (x) €j) + z Gem) ey Ui + 2D 2 Ame d;, and the conclusion 
j= f= t= j=i+ t= 


follows as in the proof of 17. 

19. N.RCN — URC is (ez). (bi) is assumed, by 3; and (18.1), by 18. The 
proof resembles that of 18. 

20. N.RC — URC is (e3). See proof of 9. 

21. N.BURCRN — URC is (e3). (bi) and (e;) are assumed, by 3 and 17, 
respectively. By denial of (e{), there exist + and sequences m} satisfying 


(21.1) m': > mis, B, D, 


such that, with (m!' = m!}) and (« = =), lim > | acmay — Amiga | > 0. 
kt=1 


mow 


Hence, there exist sequences 6; > 0, and ms 9 M;;, satisfying 


(21.2) B<m)<m).< mi <mi<mii<mii<- 5 

(21.3) My < My < My < My < My < My <---, 

such that, with [m’ = m}; m? = mi | Qjmj(k) — Omi) | 2 66;, where, by 
(21.1), (21.2), (bi), and (.03), ee | Atm) — nia) | S 4, and, by (e:), 
> | Gtm)e) — Omi) | S 6;, Ni; being the first number preceding M,; in (21.3). 


Now the sequence s, = {sgn (@mjz — Gm'x) (kx = m;k* S My, € Ny, for 7 
odd); 0, otherwise} is burern. But 


O[m) = > Amik) 8(&) (A) 
ss p> (Gjmj(k) — Amie) 8x) « (B) 


(A) exists, by (21.1) and (.03). Yet for j odd, | (B) | = { 43 « 


kK k= 


i by Bm}(k) — Omice) | 2 3 5s, while for j even, 
1, Mi; 


|(B)| s { + >> + ayaa — Ani) | S 2 4, 


kt=1, $Mij 


ki= 


whence, by (21.2), lim o,, fails to exist, so that, by (21.1), o» is not urc. 


mi-w 





CC — 
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22. N.BCN — URC is (es). (bi) is assumed, by 3; (e:), by 17; and (e3), by 
21. By denial of (e4), there exist sequences x; and m} satisfying 
(22.1) m; > m;-1, B, D, G., (v <i), 
such that, with (m! a m'), and (x = Ki), lim > | cm) ce) - Omicx) | ~ 0, 
mo kt=]1 
where, by (22.1), with [x = x,], 


(22.2) lim Do | dma — Gnitny | = 0 (v <7). 


mio kt=1 


Hence, there exist sequences 6; > 0, and ms M ;; satisfying the inequalities 
(21.2) and (21.3), such that, with [m' = m}; m? = my Js 


Mij 
p> | Qtmycry — Amcx | 265, 
t=] 


where, by (22.1), (21.2), (bi), (e1), and (.03), ‘. Zz = Q[m)}(k) — Om) (k) | < bi, 


‘= 1, Mi; 
Nij 
and, by (e:), > | Alm) (k) — Am} ck) | s 6, and, by (22.1), (b;), and (.03), 


k=1 


(22.3) p> | Gmy tk) — Anitxy | < 2A 


for m? > B, N,; being defined as in the proof of 21. 
Define {d,} as in the proof of 17. Then s, = {dj sgn (@jmjx — Gmiz) (Kk = Ki} 
k* = My, = Ni, for j odd); 0, otherwise} is ben. Now 


F[m) = Do amit) 91%] (A) 
vy=1 kt=1 
t-1 C) 
+ > p> (Q jmp tk) — Omiey) 812) (B) 
+ p> (jmjcx) — Om) cx) 8x) (C) 
+ ay p> (Qjmjtk) — Omi{a)) Spay - (D) 
(A) exists, by (22.1), (21.2), and (.03); osc (B) = 0, by (22.2) and (21.2); ose 
(D)<4A D> d, S dj,/2™, by (22.3) and (21.2). But for j odd, | (C)| = 
veitl 
Mis Nij C 
(> —2 > = > ) | Qtmy(x) — Amica | ds 2 3d,6;, and for 7 even, 
k= wet bem 1 EM 
\(c)|s (> + . yy ) | Qtmycay — Om) (k) |d; S 2d6;, so that ose (C) = 
=1 t= 1, Mi; j= 
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d,é;, whence, by (21.2), OSC O(m) 2 d;6;/2fori > 1. Thus, by (22.1), o» is 


not ure. 

23. N.B — URC is (es). Proof similar to that of 21. 

24. N.RCRN — URCRN is (@). (bi) and (e:) are assumed, by 3 and 17, 
respectively. Now dmx = 0 for m! > D, (k = 1,2,---). (See the proof of 
17.) By denial of (&), there exist sequences 5; > 0, k;, and m} satisfying 


(24.1) mi > m}_, , B, D, Di; (j <2), 
such that 

(24.2) | mix; | = 5, 

where, by (24.1), 

(24.3) Amr; = 0 (j <1), 
and, by (24.1) and (b;), with (m! = m}), 

(24.4) | Gim)x; | < A for m? > B (j =1,2,---). 


Define {d;} as in the proof of 17. Then = {d;sgn an'x; (k = k,); 0, other- 
wise} isrern. But ¢,, is not urcrn, as can be shown by an argument analogous 
to that used in the conclusion to the proof of 17. 

25. N.RCURN — URCRN is (@:). Proof analogous to the proof of 18 in the 
same way that the proof of 24 is analogous to that of 17. 

26. N.RCN — URCRN is (&). Proof similar to that of 19. (See remark 
under 25.) 

27. N.RC — URCRN is (@;). See proof of 9. 

28. N.BURCRN — URCRN is (83). See 21, 24, and (.34). 

29. N.BCN — URCRN is (&). See 22, 24, and (.35). 

30. N.B — URCRN is (8s). See 23, 24, and (.36). 

31. N.RCRN — RC is (f,). See proof of 7. 

32. N.RCURN — RC is (fz). See proof of 8. 

33. N.RC — RC is (f3). See proof of 9. 

34. N.BURCRN — RC is (f4).__ (e1) and (f;) are assumed, by 5 and 31, respec- 
tively. By denial of (f,), there exist 6 > 0, r', x, and sequences m; (> m‘_,), 

M. 


Mi(> M;_,), such that, with (m! = r'; 2 m?) and (x = 7), > | Q¢my(ky — G,(k) | 
k= 


= 65, where, by (c1), (f:), and (.06), >> | acmay — @ncy | < 4, and, by (f,), 


kt=1,$M 


M;_ 


» ® | @¢m)(k) — @,.e)| <6. The remainder of the proof is similar to that of 10. 
k=l 


35. N.B — RC is (fs). The proof is similar to that of 11. 
36. N.RCRN — RCRN is (f;). See proof of 7. 

37. N.RCRN — RCRN is (f.). See proof of 8. 

38. N.RC — RCRN is (f3). See proof of 9. 
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39. N.BURCRN — RCRN is (f,). See 34, 36, and (.51). 
40. N.B — RCRN is (fs). See 28, 35, and (.52). 


§6. Sufficiency proofs 
Proofs or indications of proofs are given only when this seems necessary. 
1. S.B — e is (a). 
2. S.UB — e are (a) and (a2). Suppose s,; bounded for k > Q. Let r be 
arbitrary, and choose R > max {Q, C,(r)} («x = 1, 2,---,Q). Then |o,| = 
R oo 


1d - > + > bons s{¥4+ ¥ } lens. 


k=1 k=Q+1 k=Q+1 k=1 k=Q+1 


3. S.B — UB is (b;). 
4. S.UB — UB are (b;) and (bz). Suppose s, bounded fork > Q. Let R > 
max {Q,C,} (x = 1,2,---,Q). Form>B,R, 


jonl = |{ 3 - > + & bans s(d+ ¥ } lana. 


k=Q+1 k=Q+1 k=Q+1 











5. S.B — B is (e:). 

6. S.UB — Bare (e:) and (cz). Proof similar to that of 4. 

7. S.RCRN — C are (b:) and (d;). Let R be arbitrary. By (.001), for 
m> B, 


R 
m= {E> eS  % amas + Orn. 
kim] k%=R+1 k=R+1 k=1 


As m, R — , the first expression — 0, by (bi) and (.003); as m— o, the second 


R 
tends to >> ax s:, and as R — © this expression converges, by (.01). Hence 
k=1 


(7.1) c= Fan. 


k=1 
8. S.RCN — C are (b;), (di), and (d:). Let R be arbitrary. For m > B, 
R oc R ) 
ma{Oe SF + E+E fone 
kim] k*%=R+1 k=1 k=R+1 


= >> > > nx (8, — 8a) + sft Ank Sx (A) 


+OD oe Y oat Yous. (B) 


kil 


t-—1 R oo kK R 
- > (-)"> > D> -—(-1" SS on + >> a,,8,. Thus,asm— , 
r= kim} 1 Pos) k=1 


k te 


(B) tends to 5 >> ‘ {le i p> (pS by Lyya — (—1) = a} + 


As m, R — «, (A) — 0, by (b:). Now by (.002), (B) = >> > Sy {E 
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R 

> a,s,, and as R — this expression converges, by (.01) and (.02). Hence 
k=1 

ed ¢—1 c) 
“se ont fw {ly ~Feoed FE Law 
ki=1 r=1 ki=1 
cmb da feas. 
k=1 


kt=1 
9. S.RC — C are (bi), (di), (dz), and (ds). The sequence t; = (8, — 8) is 
ren. Hence, by (8.1) and (ds), 


(9.1) o= 58- L oo > > (Se _ 8) {La _ x (. 1)" Zz ‘4 Liaien 


ki=1 


om (.. 1)¢+1 ym as} ob > ay (s _- 8) . 


kt=1 


10. S.BCN — C are (b:) and (ds). For m > Bon = p> ax%+ >, 
k=1 
(Gmz — ax) 8. These two sums exist, by (.13), (bi), and (01). ‘Let R be arbi- 
R 
trary. Now the second sum w{Z (— 1)" >> >. 4 + (— 1)*41 x} 
v=1 


k=1 
J 


(Amz — Mx) && + > (Amz — a) 8. As m— > &, the first part of this last ex- 


k=R+1 
pression — 0, by (d,) and (.13). As m, R — «, the second part — 0, by (b:) 
and (.01). Hence 


(10.1) c= z. ay Si. 
k=1 


11. S.BC — C are (bi), (ds), and (ds). The sequence t, = (s, — 8) is ben. 
Hence, by (10.1), 


(11.1) ee e > wie ~ a. 


12. S.B — C are (bi) and (ds). For m > B, on = Do ase + D> 
k=1 k=1 
(dmk — Gx) 8%, the expression existing, by (b;:), (.15), (.13), and (.01). Hence 


(12.1) ¢= ye ay Se. 
k=1 


13. S.CN — C are (bi), (be), and (di). Suppose s; ben fork > Q. Let R > 
R R 
max {Q, C.} (x = 1,2,---,Q). Form > B, R, en={5 


k 


> ames. Asm— ®, the first expression on the wee tends to x - 
k=Q+1 = 


R oo 
, ba s,. By (.12), (.58), and (10.1), the second tends to >> ax sx. 
=Q+1 k=Qt+l 















Hence, by (be) and the nature of R, 





(13.1) 


k 


14. S.C — C are (b,), (be), (di), and (ds). 
Hence, by (13.1), 













(14.1) 


o 








15. S.UB — C are (b;), (b2), and (ds). 


the second, yield 





c=), as. 
k=1 
S.RCRN — BC are (¢:) and (d)). 
S.RCN — BC are (e1), (di), and (de). 
18. S.RC — BC are (e:), (di), (de), and (ds). 
19. S.BCN — BC are (¢e:) and (dx). 
. S.BC — BC are (¢:), (ds), and (dj). 
21. S.B — BC are (e:) and (ds). 
. S.CN — BC are (c;), (¢2), and (d,). 
. S.C — BC are (ce), (ce), (di), and (ds). 
. S.UB — BC are (e1), (€2), and (ds). 
. S.RCRN — CN are (b;) and (d,). 
. S.RCN > CN are (by), (di), and (de). 
. S.RC — CN are (bi), (di), (ds), and (ds). 
. S.BCN = CN are (b;) and (d,). 
. S.BC — CN are (b:), (ds), and (dy). 
. S.B— CN is (ds). 
. S.CN > CN are (b;), (b2), and (d;). See 
_ S.C CN are (b), (bs), (di), and (ds). 
. S.UUB— CN are (be), and (ds). 
. S.RCRN — BCN are (c;) and (d,). 
. S.RCN — BCN are (e1), (di), and (2). 
. S.RC — BCN are (e1), (di), (de), and (ds). 
. S.BCN — BCN are (e;) and (d,). 
. S.BC — BCN are (ec), (ds), and (d,). 
. S.B — BCN are (e;) and (ds). See 5 and 
. S.CN — BCN are (e,), (2), and (d)). 
. S.C > BCN are (e,), (e2), (di), and (ds). 
. S.UB — BCN are (e,), (c2), and (ds). 


(15.1) 






16. 
17. 
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oo 
c= > a . 


The sequence t, = (s, — 8) is en. 


ay (s — 8). 


Suppose s, bounded fork > Q. Let 
R be chosen as in the proof of 13, and let o,, be decomposed in the same manner. 
Application of (.15) and (.13) to the first component, and of (.58) and (12.1) to 


See 6, (.11), and 33. 





See 5, (.09), and 7. 
See 5, (.09), and 8. 


See 5, (.09), and 9. 


See 5, (.09), and 10. 
See 5, (.09), and 11. 
See 5, (.09), and 12. 
See 6, (.09), (.11), and 13. 
See 6, (.09), (.11), and 14. 
See 6, (.09), (.11), and 15. 
See (.17) and (7.1). 
See (.17) and (8.1). 


See (.17) and (9.1). 


See (.17), (.20), and (10.1). 
See (.17), (.20), and (11.1). 


(.17) and (13.1). 


See (.17) and (14.1). 

See (.17), (.76), (.23), (.20), and (15.1). 
See 5, (.09), and 25. 

See 5, (.09), and 26. 


See 5, (.09), and 27. 


See 5, (.09), and 28. 
See 5, (.09), and 29. 


30. 


See 6, (.09), (.11), and 31. 


See 6, (.09), (.11), and 32. 
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43. S.RCRN — URC are (b), (di), and (e:). The conditions are S.rern — ec, 


K oO 
by 7. Let r' > B,D. For m?> B, with (m' = r’), om) = {E f+ 


ki=1 kt=R+1 
«o 


R 
p + aan 8k + >> acme si. As m?, R— , the first part > 0, by (b:); as 
k=1 


k=R+1 


R 
m? — x, the second tends to }> anx sx, and as R — », this expression con- 
k=1 


verges, by (.03). Hence 
(43.1) on = Dy One &. 
k=1 


44. S.RCURN — URC are (b:), (di), (de), (e1), and (e3). The conditions are 
S.rcurn—c, by 8. Suppose s rernfork > Q. Let r'> B, D, E. (x = 1, 2,---,Q). 


For m? > B, with (m' = r'), om) = 2, Q(m)k 8k + + (- aa > > ¥ ae 
=Q+1 


kim] ktm 
@ 
+ (-1)"" } amese. By (.58), 43, and (e:), as m? > ©, the first and third 
k=1 
parts converge. With (k? = p* < Q), the summand next to the last in the second 


part becomes, for arbitrary R, 


C) R 
2 Bema 80 = {x rE Zz + ; + ©, } ease 


k=] k@eR+1 kt=1 


R 
= } » bo Gm (x) (Sey — Syren) + oe Aim) () (Sx) — Spa) (A) 


kim] k@mR+1 


R J R 
+ DD Syne Do emery + De Geme Say +8 DY may. (B) 
kilo] kV=R+1 kt=1 kt=R+1 


As m?, R— «, (A) 0, by (bi). Now (B) = >> bs 80 4 _ - (—1)"™! 
LD Dd -(-y" z Qim) (ky + Bim) (ky Sky Sp z- > (-—1)"" 


kien] KAM] r=l1 
R oo 


R 
2 — (-1)" p> Gime)» AS m?—» ©, (B) tends to >> > Syn 
~— 


kin] = 


{L rips — > (—1)"" Do - Lyrpxaen — (—1)4! x arp} + pa G(x) + 
sy {L ry : (—1)! > x Lagan _ (—1) > sah. As R- o, this 


expression converges, by (.03) and (.04). Hence, with obvious reductions, 


n= p 0,4, 8% + i (- 1)"* } + {x . tous | Lowe _ zr (— i 


r=1 
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(44.1) } a > Lyspayen _ (—1)*#1 > a. | + Ss EE _ > (— 1)"+! 


kee 1 r=1 


} x Lay — (— | ad b= a..|} ° 


45. S.RCN — URC are (b;), (di), (de), (e1), and (es). The conditions are 
S.ren — cc, by 8. Let r' > B, D, E. For m*? > B, and arbitrary R, with 


R aa cd R 
(m' =Pr'),om =D DV De Amal — 9) + Le Gms + DD se 
ki] kt=R+1 k=R+1 kim 
c R 
>» Qim)k + > @,»)x8;- Ina manner similar to that used in the proof of 8, 
kt=R+1 k=1 


it can be shown that 
on  e 2 su Low Fs. p> (- ayer > >» Lyn 
- (—1)'+# > anu} + } Gay, 8, « 
k=1 k=1 


46. S.RC — URC are (b:), (di), (de), (ds), (e1), (e2), and (e3). The conditions 
are S.rc — c, by 9. The sequence t, = (s, — s) is ren. Hence, by (45.1), for 
> B,D,E,F, 


c ¢—1 x 
o, = s- La + z. p> (Sys —_™ 8) {lw — p> (51) ; x >» Lyn 
_ 9 £ anh 4 E aala a. 
ki=1 k=1 


47. S.BURCRN — URC are (b;), (da), (e1), and (e$). The conditions are 
S.burern — c, by 10. Suppose s, rern for k > Q. Let r' > B, D, G. 


(x = 1,2,---,Q). Form? > B, with (m' =r"), om =~» > @omese + ¥ 


k=Q+1 v=l 


(45.1) 


(46.1) 


(— 1)" ys x > A(m)k 8k a (— 1)*+! ) » Aimy k 8k - By (.58), (.13), and 43, as 


t=] kim] 


m?— «© ‘the first and third parts converge. With (k* = p* S Q), the summand 


next to the last in the second part becomes, for arbitrary R, a Q(m) (x) Sk) = 
kt 
a we 
D au 8a) + DS (Quy) — @r@)8a), the expression existing, by (b,) and (.03). 
k= kt 


2 n—l 
As m? > », the last sum — 0, by (e}). Hence o, = , ™ 0,4. 8, + : (—1)"** 
vy=l 


k=Q+1 
io) 


g C) 
; > } G1, 8% + (-1)"" a a, 8, 5 Or 


kite] kim] 


(47.1) n= ) Gay, 8, « 
k=1 
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48. S.BURCN — URC are (b,), (dy), (e1), (e2), and (e3). The conditions are 
S.buren — c, by 10. Suppose s, ren for k > Q. Let r' > B, D, E, G, 
(x = 1, 2, --- , Q). For m? > B, with (m' = r'), om = » Acm)k Sk + 


k=Q+1 


n—1l Q oo Q 
2. (—1)4 > yo » Aim)k 8k + (—1)**! > Acm)k Sk - By (.58), (.13), (.14), 
| B11 k= 
(.64), and 45, as m?— ~, the first and third parts converge. With (k* = p* S Q), 


the summand next to the last in the second part becomes: ; Q(m)(k) Sk) = 
kt=1 
wo oO 
D ap 8ay + Do (Gmc) — 2@) Sa, the expression existing, by (b;) and (.03). 
kt=1 kt=1 


As m? — «, the last sum — 0, by (ef). Hence on= >> > 3 om 


sme+t mio kt=Q+1 


Qim)k — > (- aap > > lim = tome — (— ili” = ona} + 3 2 Gusts 


k9=Q+1 mt ktm 
which os to 


On = > oe Sys a > > (-—1) > ba Linnea — (—1)! 


ke=Q+1 
g s—1 x t-1 Q 
48.1) Sa,-S (vd ¥ E& ~F yer FH Less 
=I t=1 kilegQt pms | aim 
y 20 cy 
- (— 1" > ans -(—~1" > ans} + Dans. 
k=1 


k@=1 k4=Q+1 


49. S.BURC — URC are (by), (ds), (da), (e:), (e2), (es), and (e3). The con- 
ditions are S.bure — c, by 11. The sequence t, = (s; — 8s) is buren. Hence, 
by (48.1), for r' > B, D, E, F, G, (« = 1, 2, --- , Q), Q being defined as in the 
proof of 48, 


on = 8-Ly + > > (9 {1 rks — >>) (— 1)"*1 > p> Lrkaen 


k=Q+1 r=1 
« (. fore z an — y (— 1) pO - 
(49.1) : vitae : 
[1 rikake. — = (— 1)"*! > ae Lrakeken -_ (— yt a 


-(-1)" > ans} + > an (% — 8). 
kt=Q+l k=1 
50. S.BCN — URC are (b;), (d4), and (e4). The conditions are S.ben — ec, 
by 10. Forr' > B, H, it can be proved in a manner similar to that used in the 
proof of 10, that 


(50.1) on = y Griz Sk. 
k=1 
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51. S.BC — URC are (bi), (ds), (ds), (es), and (e4). The conditions are 
S.be — c, by 11. The sequence t, = (s, — s) is ben. Hence, by (50.1), for 
r> B,F,H, 


(51.1) on=s-L+ > Grx (8% — 8). 
k=1 
52. S.B — URC are (b;), (ds), and (es). The conditions are 8.b — c, by 12. 


For r' > B, J, it can be proved in a manner similar to that used in the proof of 12 
that 


(52.1) on = > On &. 
k=1 
53. S.URCRN — URC are (b:), (bz), (di), and (e:). The conditions are 
S.urern — c, by 13. Suppose s; rern fork > Q. Let R > max {Q, C,} 


(x = 1,2,--- ,Q). Forr' > B, D, R, it can be proved in a manner similar 
to that used in the proof of 13, that 


(53.1) on = > Arie Sx. 
k=1 


54. S.URCN — URC are (b:), (be), (di), (1) and (e2). The conditions are 
S.uren — c, by 13. Suppose s ren for k > Q. Let R > max {Q, C,} 


(x = 1,2,---,Q). Let r' > B, D, E, R. For m* > B, R, with (m' = Pr’), 
R R oo 

o(m) = ; - os \ Qim)k Sk + } i Qimk Se. As m? — o, the first 
k=1 k=Q+1 k=Q+1 


part tends to {zs _ > ana &. By (.58), (.12), (.14), (.64), and (45.1), the 


k=1 k=Q+1 
CJ 


second tends to )) > & {tim > Qin = > (-pe yd > 


ki=Q+l miro kt=Q+1 kM=Q+l 
oO a 


lim > am—(—)I) YO anr>+ YO an ss. Hence, as in the 


mio k@=Q+l ké=Qt+1 k=Q+1 
proof of 48, 
« t-1 ¥v 
on = ss > Ska {in _ , (— 1) > p | DEveren 
kt=Q+1 r=1 kel 
v t-1 I 
— (~ 1)" yi on } > (— 1) pe bs 
k= 1 r=1 kO=Q+1 
(54.1) ee : 
[Ene = > (-)p*> Pe Lexan — (— 1) p> ans 


k= Q+1 


— (— 1) > avs} 2 Griz Se « 
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55. S.URC = URC are (b;), (be), (di), (ds), (e1), (€2), and (e3). The condi- 
tions are S.ure — c, by 14. The sequence t; = (s, — 8) is uren. Hence, by 
(54.1), for r' > B, D, E, R, 


oo ¢—} Q 
C1 = s-La + : me ; (s,s _- 8) {lw _ p> (— 1)"*1 ) >» Leys 


Q+ 


co) 


(55.1) (9 Yau — D-DD ie - > (-pe 


kMl=g+ 


; . Ly apspangen - (— 1)*! 7 ane | _ (— 1)*+! > avn} 


iti ké=Q+1 
3) 
+ ts a, (8, — 8). 
k=1 


56. S.CN — URC are (b:), (bz), (di), and (e,). The conditions are S.cn — ¢, 
by 13. Suppose s, ben fork > Q. Let R > max {Q, C,} (« = 1, 2,--- , Q). 
For r' > B, H, R, it can be proved in a manner similar to that used in the proof 
of 13 that 


(56.1) n= pk G4, 8; - 
k=1 


57. S.C — URC are (b:), (bz), (di), (ds), (es), and (e,). The conditions are 
S.c — c, by 14. The sequence t, = (s, — 8) is en. Hence, by (56.1), for 
r' > B, H, R, where R is defined as in the proof of 56, 


(57.1) n= s-L +- Z. G1, (8, —_ 8). 
k=1 


58. S.UB — URC are (b;), (b2), (ds), and (es). The conditions are S.ub — ¢, 
by 15. Suppose s, bounded fork >Q. Let R > max {Q, C,} (« = 1,2, ---,Q). 
For r' > B, I, R, it can be proved in a manner similar to that used in the proof 
of 15, that 


(58.1) 1 = 2 Oy. 8 . 
k=1 


59. S.RCRN — BURC are (1), (di), and (e:). See 5, (.09), and 43. 

60. S.RCURN — BURC are (c,), (di), (dz), (e:), and (e3). See 5, (.09), and 
44. 

61. S.RCN — BURC are (e:), (di), (de), (e1), and (e2). See 5, (.09), and 45. 

62. S.RC — BURC are (e:), (di), (de), (ds), (e1), (e2), and (e3). See 5, (.09), 
and 46. 

63. S.BURCRN — BURC are (c,), (ds), (e:), and (e3). See 5, (.09), and 47. 

64. S.BURCN — BURC are (c:), (da), (e:), (€2), and (e%). See 5, (.09), and 
48. 

65. S.BURC — BURC are (cx), (ds), (da), (e1), (e2), (es), and (e7). See 5, 
(.09), and 49. 
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66. S.BCN — BURC are (e:), (da), and (e4). See 5, (.09), and 50. 

67. S.BC — BURC are (ce), (ds), (da), (es), and (e4). See 5, (.09), and 51. 

68. S.B — BURC are (c;), (ds), and (es). See 5, (.09), and 52. 

69. S.CN — BURC are (e:), (€2), (di), and (e:). See 6, (.09), (.11), (.25), 
and 56. 

70. S.C — BURC are (c:), (e2), (di), (ds), (ex), and (es). See 6, (.09), (.11), 
(.25), and 57. 

71. S.UB— BURC are (c1), (cs), (ds), and (es). See 6, (.09), (.11), and 58. 

72. S.RCRN — URCN and (by), (di), and (e:). See (.17), 43, and 25. 

73. S.RCURN — URCN are (b:), (d:), (ds), (e:), and (e3). See (.17), 44, and 
26. 

74. S.RCN — URCN are (b:), (di), (de), (e1), and (e2). See (.17), 45, and 26. 

75. S.RC — URCN are (b,), (di), (de), (ds), (e1), (e2), and (es). See (.17), 
46, and 27. 

76. S.BURCRN — URCN are (b;), (da), (e:), and (e3). See (.17), 47, and 28. 

77. S.BURCN — URCN are (b;), (da), (e:), (e2), and (e3). See (.17), 48, 
and 28. 

78. S.BURC — URCN are (b;), (ds), (da), (e1), (€2), (es), and (e$). See (.17), 
49, and 29. 

79. S.BCN — URCN are (b;), (da), and (e4). See (.17), 50, and 28. 

80. S.BC — URCN are (b;), (ds), (da), (es), and (e4). See (.17), 51, and 29. 

81. S.B — URCN are (ds) and (es). See (.17), (.76), 52, and 30. 

82. S.URCRN — URCN are (b;), (b2), (di), and (e;). See (.17), 53, and 31. 

83. S.URCN — URCN are (by), (bz), (di), (e:), and (ez). See (.17), 54, and 31. 

84. S.URC — URCN are (b;), (bz), (di), (ds), (e1), (e2), and (es). See (.17), 
55, and 32. 

85. S.CN — URCN are (b:), (bz), (di), and (e4). See (.17), 56, and 31. 

86. S.C —- URCN are (b;), (b2), (di), (ds), (es), and (e4). See (.17), 57, and 32. 

87. S.UB — URCN are (bz), (ds), and (es). See (.17), (.76), 58, and 33. 

88. S.RCRN — BURCN are (ci), (di), and (e:). See 5, (.09), and 72. 

89. S.RCURN — BURCN are (e:), (di), (de), (e:), and (e3). See 5, (.09), 
and 73. 

90. S.RCN — BURCN are (cx), (di), (de), (e1), and (e2). See 5, (.09), and 74. 

91. S.RC — BURCN are (ex), (di), (dz), (ds), (e:), (€2), and (es). See 5, 
(.09), and 75. 

92. S.BURCRN — BURCN are (ce), (da), (e:), and (e3). See 5, (.09), and 76. 

93. S.,BURCN — BURCN are (c:), (da), (e:), (e2), and (e3). See 5, (.09), 
and 77. 

94. S.BURC — BURCN are (c:), (ds), (da), (e:), (e2), (es), and (e3). See 5, 
(.09), and 78. 

95. S.BCN — BURCN are (ce), (d4), and (e4). See 5, (.09), and 79. 

96. S.BC — BURCN are (c1), (ds), (da), (es), and (e4). See 5, (.09), and 80. 

97. S.B — BURCN are (c:), (ds), and (es). See 5 and 81. 

98. S.CN — BURCN are (e:), (c2), (di), and (e:). See 6, (.09), (.11), (.25), 
and 85. 





ee 
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99. S.C + BURCN are (c,), (cs), (di), (ds), (e1), and (e3). See 6, (.09), (.11), 
(.25), and 86. 

100. S.UB — BURCN are (c;), (2), (ds), and (es). See 6, (.11), and 87. 

101. S.RCRN — URCRN are (by), (di), and (@:). See (.33) and (43.1). 

102. S.RCURN — URCRN are (b;), (di), (de), (@:), and (83). See (.33) and 
(44.1). 

103. S.RCN — URCRN are (bi), (di), (de), (@:), and (@2). See (.33) and 
(45.1). 

104. S.RC — URCRN are (b:), (di), (ds), (ds), (&), (&s), and (&3). See (.33) 
and (46.1). 

105. S.KRURCRN — URCRN are (by), (ds), (e3), and (@). See (.33) and 
(47.1). 

106. S.BURCN — URCRN are (b:), (ds), (e3), (&), and (@:). See (.33) and 
(48.1). 

107. S.BURC — URCRN are (b,), (ds), (ds), (e3), (@:), (&), and (&s). See 
(.33) and (49.1). 

108. S.BCN — URCRN are (b,), (ds), and (&). See (.33) and (50.1). 

109. S.BC — URCRN are (b:), (ds), (da), (@s), and (@s). See (.33) and (51.1). 

110. S.B — URCRN are (ds) and (és). See (.33), (.43), (.39), (.15), (.13), 
(.73), (.19), (.76), and (52.1). 

111. S.URCRN — URCRN are (b;), (be), (di), and (&:). See (.33) and (53.1). 

112. S.URCN — URCRN are (b,), (be), (di), (#1), and (@2). See (.33) and 
(54.1). 

113. S.URC — URCRN are (b,), (be), (di), (ds), (@:), (@2), and (@3). See (.33) 
and (55.1). 

114. S.CN — URCRN are (b,), (be), (di), and (@). See (.33) and (56.1). 

115. S.C — URCRN are (b;), (bz), (di), (ds), (@s), and (@). See (.33) and 
(57.1). 

116. S.UB — URCRN are (bs), (ds), and (@s). See (.33), (.43), (.39), (.15), 
(.13), (.73), (.19), (.76), and (58.1). 

117. S.RCRN — BURCRN are (ce), (di), and (@). See 5, (.09), and 101. 

118. S.RCURN — BURCRN are (ei), (di), (de), (@:), and (@3). See 5, (.09), 
and 102. 

119. S.RCN — BURCRN are (ce), (di), (de), (#1), and (@2). See 5, (.09), and 
103. 

120. S.RC — BURCRN are (¢), (di), (ds), (ds), (&), (€2), and (é3). See 5, 
(.09), and 104. 

121. S.BURCRN — BURCRN are (ce), (da), (e%), and (&). See 5, (.09), 
and 105. 

122. S.BURCN — BURCRN are (c,), (da), (e3), (%:), and (&). See 5, (.09), 
and 106. 

123. S.BURC — BURCRN are (c), (ds), (da), (e3), (@), (@), and (&3). See 
5, (.09), and 107. 

124. S.BCN — BURCRN are (c,), (ds), and (@). See 5, (.09), and 108. 
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125. S.BC — BURCRN are (e;), (ds), (da), (@s), and (@). See 5, (.09), and 
109. 

126. S.B — BURCRN are (ce), (ds), and (5). See 5 and 110. 

127. S.CN — BURCRN are (e:), (cz), (di), and (&). See 6, (.09), (.11), (.33), 
(.25), (.35), and 114. 

128. S.C — BURCRN are (e:), (2), (di), (ds), (@:), and (@3). See 6, (.09), 
(.11), (.33), (.25), (.35), and 115. 

129. S.UB — BURCRN are (ce;), (€2), (ds), and (5). See 6, (.11), and 116. 

The proofs of * # 130-138 have already been constructed under the corre- 
sponding theorems for ure transformations, among # # 43-58, as examination 
of these proofs will show, in view of (.09), (.11), (.44), and (.49). Also, the form 
of the row-limits of the transform will be found there. 

130. S.RCRN — RC are (e;), (di), and (f;). 

131. S.RCN — RC are (e:), (di), (de), (f1), and (fe). 

132. S.RC — RC are (e:), (di), (de), (ds), (f1), (f2), and (fs). 

133. S.BCN — RC are (e:), (da), and (f,). 

134. S.BC — RC are (e:), (ds), (da), (fs), and (f,). 

135. S.B — RC are (c1), (ds), and (fs). 

136. S.CN — RC are (e1), (€2), (di), and (f;). 

137. S.C — RC are (e1), (€2), (di), (ds), (f1), and (fs). 

138. S.UB — RC are (c1), (2), (ds), and (fs). 

139. S.RCRN — RCN are (¢;), (di), and (f;). See (.17), 130, and 34. 

140. S.RCN — RCN are (e;), (di), (de), (£1), and (f2). See (.17), 131, and 35. 

141. S.RC — RCN are (ex), (di), (ds), (ds), (f:), (fe), and (fs). See (.17), 132, 
and 36. 

142. S.BCN — RCN are (c;), (da), and (f4). See (.17), 133, and 37. 

143. S.BC — RCN are (ce), (ds), (ds), (fs), and (f4). See (.17), 134, and 38. 

144. S.B — RCN are (e:), (ds), and (fs). See (.17), 135, and 39. 

145. S.CN — RCN are (c1), (¢2), (di), and (f:). See (.17), 136, and 40. 

146. S.C — RCN are (ex), (e2), (di), (ds), (f1), and (fs). See (.17), 137, and 41. 

147. S.UB — RCN are (e1), (cz), (ds), and (fs). See (.17), 138, and 42. 

148. S.RCRN — RCURN are (c:), (di), (@:), and (f;). See 130 and 117. 

149. S.RCURN — RCURN are (c,), (di), (ds), (), (@2), (f), and (f2). See 
131 and 118. 

150. S.RCN — RCURN are (¢1), (di), (de), (&:), (@2), (f1), and (f2). See 131 
and 119. 

151. S.RC — RCURN are (e:), (di), (de), (ds), (@:), (G2), (Gs), (fi), (fe), and 
(fs). See 132 and 120. 

152. S.BCN — RCURN are (¢,), (da), (@:), and (f4). See 133, (.49), (.35), and 
124. 

153. S.BC — RCURN are (e:), (ds), (da), (@:), (@s), (fs), and (f4). See 134, 
(.49), (.35), and 125. 

154. S.B — RCURN are (e;), (ds), (@:), and (fs). See 135, (.49), (.36), and 
126. 
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155. S.CN — RCURN are (e1), (¢2), (di), (@:), and (f;). See 136 and 127. 

156. S.C — RCURN are (c1), (ez), (di), (ds), (&:), (@s), (f1), and (f3). See 137 
and 128. 

157. S.UB — RCURN are (¢;), (cz), (ds), (@:), and (f5). See 138, (.49), (.36), 
and 129. 

158. S.RCRN — RCRN are (c:), (di), and (f1). See (.50), 130, and (43.1)." 

159. S.RCN — RCRN are (ce), (di), (de), (£1), and (f2). See (.50), 131, and 
(45.1)." 

160. S.RC — RCRN are (c:), (di), (de), (ds), (f:), (fe), and (fs). See (.50), 
132, and (46.1)." 

161. S.BCN — RCRN are (e;), (ds), and (f4). See (.50), 133, and (50.1). 

162. S.BC — RCRN are (e:), (ds), (da), (fs), and (f4). See (.50), 134, and 
(51.1). 

163. S.B — RCRN are (a:), (ds), and (fs). See (.50), (.57), (.43), (.39), (.15), 
(.13), (.73), (.19), (.77), 135, and (52.1). 

164. S.CN — RCRN are (c1), (¢2), (di), and (f;). See (.50), 136, and (56.1)." 

165. S.C — RCRN are (¢1), (¢2), (di), (ds), (£1), and (fs). See (.50), 137, and 
(57.1)." 

166. S.UB — RCRN are (a), (cz), (ds), and (fs). See (.50), (.57), (.43), 
(.39), (.15), (.13), (.73), (.19), (.77), 138, and (58.1)." 


§7. Conclusion 


It is now possible to write down at once S. conditions for the transformation 
{sk} — {om} in each of the 256 possible cases. For example, consider buren — 
cn. Now # # 25-33 of §6 state conditions S. that the transforms of various 
sequences {s,} be cn, and among them # 28 involves the most restricted class in 
these #* #(namely ben) including the type buren. (See diagram.) Hence 
S.buren — en are (b;) and (d,). 

The S. conditions thus secured will be found to be always N. for the transfor- 
mation concerned, under the added hypothesis of existence. Thus in the above 
example (b;) is N.buren — (en and e) by #3 of §5, and (d,) is N.buren — en by 
#15 of §5. (See diagram.) 

From the 8. conditions thus obtained it is possible to write at once conditions 
S. that the transform be also existent. This can be accomplished by adding (a:) 
if {s,} is rern, reurn, ren, re, burern, buren, burc, ben, be, or b, and by adding 
(a;) and (a2) if {s,} is urern, uren, urc, cn, c, or ub, as is shown by #* #1 and 2 
of §6. Also, * #1 and 2 of §5 show that these conditions are N. for existence. 
Hence the set of conditions thus obtained will be both N. and 8. It is to be 
observed that, by (.08) and (.10), the addition of (a;) is unnecessary when (c:) 
already appears, and the addition of (a2) is superfluous when (cz) is present. 
Thus N. and 8S. buren — (en and e) are (a;), (b;), and (d,). 

When || a,x || is row-finite, (a:) and (az) are automatically satisfied, so that 
the question of existence does not enter. Thus for row-finite (and, in particular, 


13 See remark preceding 130. 











TRANSFORMATIONS OF MULTIPLE SEQUENCES 59 


for triangular) matrices, N. and S. buren — en are (b;) and (d4), and ¢,, exists 
for each m. 

Stronger sets of S. conditions for any transformation can easily be secured, as 
suggested in §2. Thus conditions 8.U — v, where U, v represent classes of 
sequences, are S.buren — en whenever U > buren anden Dv. To find all of 
those conditions in §3 which are N. for a given transformation involves either 
applying the relations in §4 to the N. conditions already obtained, or collecting 
all relevant implications from the theorems in §5. Thus N.buren — (en and e) 
are (a1), (bi), (di), (de), (ds), (di), (de), and (d,). 

Conditions for convergence preservation, with preservation of the limit for 
null sequences, are also available in the several cases as combinations of the corre- 
sponding separate sets of conditions. In view of (.17), these may be simplified to 
yield the following table, which is here presented for convenience in possible 
future reference. 


RC —C (ay), (bi), (ds), (di), and (da). 
BURC —C (ai), (bi), (ds), and (d,). 

BC -—C (ai), (bi), (ds), and (d,). 

URC -—C (ai), (a2), (bi), (bz), (ds), and (di). 
C —>C (ax), (a2), (b:), (bz), (ds), and (di). 


RC -—BC (), (ds), (di), and (dy). 
BURC — BC (ce), (ds), and (d,). 
BC -—BC (ce), (ds), and (d,). 
URC -— BC (qa), (cz), (ds), and (di). 
C — BC (cx), (cs), (ds), and (di). 


RC — URC (a1), (bi), (ds), (d,), (d.), (e,), (es), and (es). 
BURC — URC (a), (bi), (ds), (di), (ex), (e2), (es), and (e4). 

BC -— URC (a), (bi), (ds), (ds), (es), and (e.). 

URC -— URC (a), (a2), (bi), (bz), (ds), (di), (ex), (€2), and (es). 
C — URC (a), (a2), (bi), (bz), (ds), (di), (es), and (e.). 


RC -BURC (ce), (ds), (di), (de), (ex), (es), and (es). 
BURC — BURC (¢:), (ds), (da), (ex), (€2), (es), and (e4). 
BC  -BURC (ei), (ds), (da), (es), and (e,). 

URC — BURC (c:), (ce), (ds), (di), (e:), and (es). 

C — BURC (cx), (cz), (ds), (di), (e1), and (es). 


RC -—RC (ce), (ds), (di), (de), (fi), (f2), and (fs). 
BURC — RC (e:), (ds), (da), (fs), and (f,). 

BC — RC (ci), (ds), (d,), (fs), and (f4). 

URC — RC. (ci), (¢2), (ds), (di), (fi), and (fs). 

C — RC (c1), (2), (ds), (d,), (fi), and (fs). 
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These sets are N. and S., existence being assumed. If (a;) and (a2) are deleted, 
it may be again remarked, existence is not necessarily implied (save when the 
transform is bounded). 

Now by examination of (9.1), (11.1), and (14.1), it is seen that in each case 
o = s-L. If, then, L = 1 in the formulation of (ds), N. and S. conditions for 
regularity are secured. 


Brown UNIVERSITY. 














THE GROUPS DETERMINED BY THE RELATIONS 
S'= T" = (S“T-18T)? = 1 
Part I 
By H. S. M. Coxerer 


In working out the commutator subgroups of the finite groups generated by 
reflections, I came across a group of order 288 having the abstract definition 


$= T = (S'T“ST)} = 1. 


When I sent this result to Dr. Sinkov, he replied that he was making a special 
study of such groups. So we agreed to write consecutive papers, his abstract 
treatment to follow my geometrical treatment. 


Groups of the form S' = 7” = (ST)" = 1, considered for the sake of analogy 


A triangle of angles x/l, /m, x/n can be drawn on a sphere, or in the euclidean 
plane, or in the hyperbolic plane, according as the number 1/1 + 1/m + 1/n is 
greater than, equal to, or less than unity. By reflecting this triangle in its sides 
repeatedly, we fill the whole sphere or plane with such triangles, which may be 
shaded or left white, according to their orientation. Dyck' showed that the 
white (or shaded) triangles correspond to the operators of the abstract group 


S' = T™ = (ST)*" = 1. 
It follows that this group is finite when 

1/l + 1/m + 1/n>1, 
and infinite otherwise. More precisely, its order is 


2 
1/l+1/m+1/n—-1 
whenever this number is positive, and is infinite otherwise. Miller® proved that 
each infinite group has an infinite number of finite factor groups. 
Very little is known about the infinite groups, save in the euclidean case 
1/l + 1/m + 1/n = 1. 


This case is manageable on account of the presence of self-conjugate subgroups 
generated by translations, whose quotient groups are obtained by identifying 





Received April 11, 1935. 

1W. Dyck, Gruppentheoretische Studien, Math. Ann., vol. 20 (1882), pp. 1-44. 

2G. A. Miller, Groups defined by the orders of two generators and the order of their product, 
Amer. Jour. of Math., vol. 24 (1902), pp. 96-100. 
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points of the plane that occupy corresponding positions in a network of period 

parallelograms. These quotient groups are as follows: 
S* = T* = (ST)* = (ST")(S"T) = 1, of order 3(b? + be + c*); 
St = T* = (ST)? = (ST")(S"T): = 1, of order 4(b? +c?) ; 
S§ = T* = (ST)? = (TST-'S“)*(ST-'S“"T)* = 1, of order 6(b? + be + c’). 
In particular (putting b = c in the first two cases, and b = 0 in the third), 


S§ = T? = (ST)* = (ST"S"T)? = 1 is of order 9p?, 
St = T4 = (ST)? = (ST“"'S""T)? = 1 is of order 8p?, 
S' = T¢ = (ST)? = (ST“'S"'T)? = 1 is of order 6p’. 


This suggests the following theorem, which is not strictly relevant to our main 
purpose, but we shall state and prove it, on account of the close analogy with 
Theorem 4 below (where the geometry is in three dimensions instead of two). 

THEOREM 1. In the group S' = T™ = (ST)" = 1 with 1/l1+ 1/m+ 1/n 31, 
the commutator of the generators is of infinite period.‘ 

Lemma 1.1. Ona sphere, or in the euclidean or hyperbolic plane, the continued 
product of the reflections in the sides of a triangle is an operation that leaves no point 
invariant§ 

Let Ri, R2, Rs denote the reflections in the sides of a triangle A,;AsA;. If 
possible, let the point P be invariant under the operation Ri R2R3, so that P = 
P.RR2R;, i.e., P-R;:R2R; = P. If P does not lie in the side A;Ag, let P-R; = P’, 
so that P’.R2R, = P. Since R2R, leaves A; invariant, A;3P’ = A;P. Hence A; 
lies on the perpendicular bisector of PP’, which is A,A2 (by definition of P’). 
On the other hand, if P lies in A,;A2, we must have P- R22; = P, which makes P 
coincide with A; (or its antipodes). In either case we are led to the absurd con- 
clusion that A; lies in A;A». Therefore P cannot exist. 

Lemma 1.2. For any finite set of (actual) points in the hyperbolic plane, we can 
define a unique “centroid”, which is invariant under all permutations of the points. 

Just as we may represent the points of the elliptic plane by concurrent lines in 
ordinary space, so also we may represent the points of the hyperbolic plane by 
time-like lines through a fixed point O of Minkowski three-space. There is, 
however, one important difference. In the latter case the representative lines 
are directed (in virtue of the “‘before-after” relation), and so can be replaced by 
points, equidistant from O, set off along them, either all “before” O or all 
“after” O. In other words, a “sphere” of time-like radius resembles a hyper- 
boloid of two sheets, and either of the sheets provides a (1, 1) mapping of the 


2 W. Burnside, Theory of Groups of Finite Order, Cambridge, 1911, p. 419. 

‘ This is a departure from the usual terminology. It seems desirable to speak of the 
order of a group, but the period of an operator. 

* Cf. Theorem 10 of Coxeter, Discrete groups generated by reflections, Annals of Math., 
vol. 35 (1934), p. 602. 
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hyperbolic plane. Let G denote the centroid of those points of Minkowski space 
which represent the given set of points of the hyperbolic plane. Then the 
required centroid of the given points is that point of the hyperbolic plane which 
is represented by the line OG. 

Proof of Theorem1. Consider the larger group 


(1.3) i = R} = R} = (R2R3)™ = (R3Ri)" = (RiR2)! = 1, 


in which the given group is a subgroup of index 2, generated by S = RR, 
T = R2R;. The generators Ri, Re, R; are reflections in the sides of a triangle of 
angles z/m, x/n, x/l (in the euclidean or hyperbolic plane, by virtue of the 
inequality). 

By Lemma 1.1, the operation RRR; leaves no point invariant. If this opera- 
tion were of finite period, its powers would transform any given point into a 
finite set, whose centroid would be invariant. Therefore Ri R2R; is of infinite 
period. But® (Ri R2R;)? = ST*S“"T. Hence’ ST—S~'T is of infinite period. 

The same result could have been obtained trigonometrically, by showing that 
the commutator is a rotation through y, where® 


cos? (y/4) = cos? x/l + cos? x/m + cos? r/n + 2 cos r/l cos r/m cos x/n. 


When 1/1 + 1/m + 1/n < 1, is pure imaginary, or rather it is a hyperbolic 
argument instead of an angle. 
By virtue of Theorem 1, we should expect a great variety of factor groups of 


S'= T™ = (ST)"=1 = (1/l+ 1/m +4 1/n < 1) 


to be obtainable by fixing the period of the commutator. Although some prog- 
ress has been made along these lines,’ the known results lack generality. There 
is, however, a general geometrical treatment for the case when we fix the period 
of the commutator but leave the product (S7’) unrestricted. In this respect, 
the product and commutator exchange réles in a remarkable manner. 


* Schwarz used this operation (R,R,R;)? to determine the triangle of minimum perimeter 
having one vertex on each side of a given triangle. Gesammelte math. Abhandlungen, 
vol. 2 (1890), p. 344. 

7 We often find it convenient to write the commutator in this form, instead of the ortho- 
dox S“'T-1ST. In statements of period, this clearly makes no difference, since each of 
these operators is conjugate to the inverse of the other. 

8 Putting 0 = 27/l, ¢ = 2x/mand cos \ = (cos r/l cos r/m + cos x/n)/sin r/l sin x/m 
in formula (1) of G. de B. Robinson, The real representation of the commutator S-'T—ST in 
four dimensions, Proc. Camb. Phil. Soc., vol. 26 (1930), p. 305. 

*°H. R. Brahana, Certain perfect groups generated by two operators of orders two and three, 
Amer. Jour. of Math., vol. 50 (1928), pp. 345-356. 

A. Sinkov, A set of defining relations for the simple group of order 1092, Bull. Amer. Math. 
Soc., vol. 41 (1935), p. 42. 

Burnside (op. cit., p. 422) gives the symmetric group of degree 5 in the form S? = T° = 
(ST)* = (ST-“'ST)* = 1; S? = TS = (ST)* = (ST ST)? = 1 is equally valid. 
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The case when T is involutory 


When n = 2, the group (1.3) becomes [I, m], the complete symmetry group of 
the regular polyhedron” {l, m}. The subgroup generated by RR: and R2R; 
is the rotation group [I, m]’. When mis even, the operators R,R:z and R; generate 
another subgroup (likewise of index 2), which we call [l’, m]. Writing S = R,R2, 
T = Rs, we find (since Ri, R; are commutative) S-'TST = (R2R;)*. Thus the 
generators of [l’, 2p] satisfy 


(2.1) S! = T? = (S"TST)? = 1. 


They may possibly satisfy other relations, independent of these, but we can 
assert that [l’, 2p] is at least a factor group of (2.1). 

To show that [l’, 2p] is in fact the whole group (2.1), we observe" that the 
operators s = S—', s’ = TST of (2.1) satisfy 


(2.2) s' = gs"! = (ss’)? = 1, 


Since the group (2.2) is invariant under 7, it is a subgroup of index 2 in (2.1). 
Similarly it is of index 2 in 


s =? = (st)? = 1 (s’ = tst). 


This last group is [l, 2p]’. 

Thus (2.2) is of index 4 in [l, 2p], and of index 2 in (2.1). But [l’, 2p] is of 
index 2 in [l, 2p], and is a factor group of (2.1). Therefore [l’, 2p] and (2.1) are 
the same group. 

Expressing this result in geometrical terms, we have 

THEOREM 2. The group [l’, 2p] has for fundamental region an isosceles triangle 
of angles 2x/l, x/2p,x/2p. It is generated by rotation through 2/1 about the apez, 
and reflection in the base.” Its abstract definition is 


S!' = T? = (S“TST)? = 1. 


In the figure on page 67, let CAA’ be this isosceles triangle, C’ the image of the 
apex C in the base AA’, and B the mid-point of AA’ (or of CC’). Then ABC isa 
fundamental region for [l, 2p], while CAA’ and ACC’ are alternative funda- 
mental regions for [l, 2p]’. T is the reflection in AA’, S or s“ is the rotation 
through 27/1 about C, s’ is the opposite rotation about C’, ss’ or S~'TST is the 
rotation through 27/p about A, and ¢ is the rotation through z about B. 

By evaluating the side BC of the triangle ABC, we obtain the following 

Coro.tutaRy. The group [l’, 2p] is generated by rotation through 2x/l about a 
point, and reflection in a line distant \ from this point, where 


sin r/l cos kX = cos 1/2p; 


1° Bounded by I-gons, m at each vertex. 

11 For this remark I am indebted to Dr. Sinkov. 

12 In other words, this group is generated by rotations about the centers of the faces of the 
polyhedron {l, 2p}, and reflections in its edges. 
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k = 1, 0 or i according to the sign of 2/l + 1/p — 1, the plane being spherical, 
euclidean, or hyperbolic, in the three cases. 
Thus the group is finite only when 


(2.3) 2/l+1/p>1, 


its order then being 4/(2/l + 1/p — 1). 

[l’, 2] is the equatorial group C’, (occurring in crystallography when | = 2, 3, 
4 or 6). It is the direct product of the cyclic group of order 1 with the group 
of order 2 generated by the equatorial reflection. 

[2’, 2p] is the dihedral alternating group Dé. When p is odd, this is the direct 
product of the dihedral group [2, p]’ with the group of order 2 generated by the 
central inversion. In particular, the rhombohedral group [2’, 6] can be obtained 
by adjoining the central inversion to the trigonal dihedral group. 

[3’, 4] is the pyritohedral® group T’. This is the direct product of the tetra- 
hedral group [3, 3]’ with the group of order 2 generated by the central inversion. 

If 2/1 + 1/p = 1, there are two groups illustrated on pages 66, 67. These 

are Pélya’s“ D{ and D3, Niggli’s Ci, and Cj,. In both cases, (S~'T)? and 
(ST)? are translations, and we have the finite factor groups" 


St = T? = (S“TST)? = (S“T)*(ST)* = 1, of order 8(8? + ¢); 


S$? = T? = (S“TST)* = (S“T)*(ST)* = 1, of order 6(b? + be + @). 


The subgroups generated by S~' and TST are two of Burnside’s groups men- 
tioned above. 


The general case 
Let [ki, ke, ks3] denote the group 
Rj = Ri = Ry = Ri = (RR) = (RaRs)* = (RRs) 
= (RiR;)? = (RR)? = (RR)? = 1. 


(3.1) 


Since every generating relation here involves an even number of generators, 
there must be a self-conjugate subgroup of index 2, say [ki, ke, ks]’, consisting of 


13 A. F. Mébius, Symmetrische Figuren, Gesammelte Werke, vol. 2 (1886), p. 672. 

4G. Pélya, Uber die Analogie der Kristallsymmetrie in der Ebene, Zeitschr. fiir Kristal- 
log., vol. 60 (1924), p. 281. 

18 P. Niggli, Die Flachensymmetrien homogener Diskontinuen, ibid., p. 291. 

16 These will be studied at greater length in Part IT. 
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all those operators of [ki, ke, ks] which are products of even numbers of R’s. 
[ki, ke, ks]’ is clearly generated by the operators 
T, — R,Ro, Ts — RRs, T3 = R;R,, 
which satisfy" 
T* = T* = T; = (1,T:2)? = (1T:T:2T3)? = (T2T3)? = 1. 
If ke is even, all these relations involve T2 an even number of times. There 
must then be a self-conjugate subgroup of index 2, say [ki, ke, ks]’’, consisting 


of all those operators of [ki, k2, k3]’ which involve 7; an even number of times. 
By repeated application of the relations 


TT, = T;'T;', T,T; = T;'T:', 
T.T{' T3T,T., T,T;' = T3T,T,, 


ll 


all T.’s that occur to an odd power (in the expression for any operator) can be 
collected in pairs; thus [ki, ke, ks]’’ is generated by 7, T;, and T}. 

Since T7'T,T,T;' = R,R,R,R,R,R,R,R, = R,R,R,R, = T}, the operators T, 
and 7’; generate [k,, ke, k3]’ or [ki, ke, ks]’’ according as kz is odd or even.” In 
the latter case they satisfy 

Tt; = T; = (T;'T3T,T;')™ = 1. 

Since we are chiefly concerned with the case when kz is even, it is convenient 
to write [l, 2p, m] instead of [k;, ke, ks]. We shall also write S for 7, and T for 
T;',sothat S = RR, T = R4R3. Since [l, 2p, m]’’ and [m, 2p, l]’’ are identical, 
we can assume that 1 = m. 

Geometrically,” [l, 2p, m] is generated by reflections in the faces of a “double- 
rectangular” tetrahedron, whose dihedral angles are 


This tetrahedron will generally have to be in hyperbolic space, but it will be in 
spherical or euclidean space when I, m, p are sufficiently small. 

Since the subgroup [l, 2p, m]’’ is of index 4, its fundamental region must be 
made up of four such tetrahedra. From one such tetrahedron, the other three 
are conveniently derived by reflecting in the faces 1 and 4. The whole funda 
mental region is then a tetrahedron in which two opposite edges are perpendicu- 


m/m, 


a/2. 


17Cf. J. A. Todd, The groups of symmetries of the regular polytopes, Proc. Camb. Phil. 
Soc., vol. 27 (1931), p. 217. 

18 Todd (ibid., p. 229) proved that it is impossible to generate [3, 4, 3]’ by two operators. 

19 Todd, ibid., pp. 214, 225. 
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lar, all the others being equal. Calling the faces 2, 3, 2’, 3’, the dihedral angles 
are 


(2 2’) = 2x/l, (3 3’) = 2x/m, 
(2 3) =(2’ 3) =(2 3’) = (2’ 3’) = x/2p. 
D 








AEE RR 


o 


In the diagram / is BDD’, 2 is ADD’, 2’ is A’DD’, 3 is AA’D, 3’ is AA’D’, 
4is AA‘C. 

The generators are rotations around the two perpendicular edges DD’, AA’: 
S carries face 2’ into the position previously occupied by 2; T carries 3’ into the 
position previously occupied by 3. If we associate the original fundamental 
region with the operator 1, the surrounding regions, beyond faces 2, 3, 2’, 3’, 
correspond to the operators S, T, S~', T—', respectively. (Face 2 of tetrahedron 
1 is face 2’ of tetrahedron S, and so on.) 

We have already seen that the rotations S, T satisfy the relations 


(3.2) S' = T* = (S"T-'ST) = 1, 


and that they suffice to generate the whole group [l, 2p, m]’’.. We shall next 
prove that every relation satisfied by these rotations is a consequence of {3.2). 

To find the operator that corresponds to a given region (i.e., that transforms 
region 1 into the given region), we consider any path from a point within region 
1 to a point within the given region. Since we are taking products of operators 
from left to right,” the successive steps along the path have to be written from 
right to left. (E.g., we pass through face 3 of region 1 into region 7, through 
face 2’ of region T into region S~'T, through face 3 of S“'T into TST, and 
through face 3 of TS“'T into T?S“T.) 

Any relation satisfied by the generators provides two different symbols for one 
region, and so corresponds to a closed path. The situation is easily visualized by 


20 Cf. Annals of Math., vol. 35 (1934), p. 599, where, unhappily, I adopted the opposite 
convention. 
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thinking of the path as an elastic string, threaded through a network of rigid 
wires forming the edges of all the tetrahedra. The path can be shrunk to a 
point by allowing it to slip through these edges, one at a time, the corresponding 
relation being simplified, at each stage, by means of the generating relation 
S' = lor T™ = 1 or (S'T-"ST)? = 1, according to the type of edge through 
which the path slips. The sufficiency of these generating relations thus follows 
from the simple connectivity of the spherical, euclidean, or hyperbolic space, and 
we have 

THEOREM 3. The group [l, 2p, m]’’ has for fundamental region a tetrahedron 
with dihedral angles 21/1, 2x/m at two opposite edges, the four remaining dihedral 
angles being r/2p. It is generated by rotations about the two special edges." Its 
abstract definitionis S' = T™ = (S*T“ST)? = 1. 

The distance between the opposite edges DD’, AA’ of the tetrahedron AA’DD’ 
is just the length of the edge BC of the double-rectangular tetrahedron ABCD. 
Evaluating this by spherical trigonometry, we obtain the following 

Coro.titary. The group [l, 2p, m]’’ is generated by rotations through 27/l, 
22/m about two perpendicular lines, distant \ apart, where 


sin +/l sin r/m cos kX = cos x/2p; 


k = 1,0, or i, according to the sign of sin x/l sin +/m — cos 2/2p, the space being 
spherical, euclidean, or hyperbolic, in the three cases. 
Thus the group is finite only when” 


(3.3) sin r/l sin r/m > cos x/2p. 


When m = 2, this condition reduces to (2.3); in fact [l, 2p, 2]’’ ~ [l’, 2p]. When 
p = 1, we have the direct product of cyclic groups of orders 1, m: [l, 2, m]’’~ [I]’ 
X [m]’. The remaining case, when! = m = 3 and p = 2, appears to be a new 
discovery : [3, 4, 3]’’ is of order 288, since [3, 4, 3] is of order 1152. (Actually, it 
is the commutator subgroup of [3, 4, 3].) 

There are no possibilities in the critical case when sin 7/1 sin x/m = cos x/2p, 
save such as have m = 2. When m = 2 and 2/1 + 1/p = 1, the fundamental 
region becomes a baseless prism whose cross-section is the isosceles triangle con- 
sidered in the two-dimensional representation. In other cases where m = 2, 
the fundamental region has a pair of antipodal vertices (ideal, in the hyperbolic 
case”), 

The trigonometry involved in proving the above corollary takes no account of 


21 In other words, this group is generated by rotations about the edges of the two recip- 
rocal polytopes {m, 2p, l}, {l, 2p, m}. For the theory of infinite regular polytopes, see 
Coxeter, Proc. Camb. Phil. Soc., vol. 29 (1933), pp. 1-7. 

22 This is, of course, the condition for the polytope {l, 2p, m} to be finite. 

23 FE. Goursat, Sur les substitutions orthogonales et les divisions régulitres de l’espace, 
Ann. Sci. de l’Ecole Norm. Sup., (3), vol. 6 (1889), p. 87. 

** The fundamental region has two ideal vertices whenever 2/1 + 1/p <1. All four 
vertices are ideal if in addition 2/m + 1/p < 1. 
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the rationality of 1, m, p. We can therefore state that the commutator of rota- 
tions 6 and ¢ about perpendicular lines distant d apart is a rotation y, where 


cos (¥/4) = sin 6/2 sin ¢/2 cos kx. 


Translating this result (in the spherical case) into terms of euclidean four-space, 
the commutator of pure rotations @ and ¢ about (not absolutely) perpendicular 
planes inclined at angle \ is a pure rotation y, where™ 


cos (y/4) = sin 6/2 sin ¢/2 cos X. 


THEOREM 4.% Jn the group S' = T™ = (S“T“ST)? = 1 with sin r/l sin r/m 
<S cos 1/2p, the product of the generators is of infinite period. 

Lemma 4.1. Jn spherical, euclidean, or hyperbolic space, the continued product 
of the reflections in the faces of a tetrahedron is an operation that leaves no point 
invariant. 

This is analogous to Lemma 1.1, from which it easily follows. 

There is also a precise analogue of Lemma 1.2 in hyperbolic three-space 
(proved by considering concurrent time-like lines in Minkowski four-space). 

Proof of Theorem 4. We have seen that the generators Ri, Re, R3, Ry of 
[l, 2p, m] are reflections in the faces of a tetrahedron. By Lemma 4.1, the opera- 
tion RiR2:RsR; (=ST) leaves no point invariant. Hence, the space being 
euclidean or hyperbolic (in virtue of the inequality), ST is of infinite period. 

Clearly this holds also for ST—. 

The same result could have been obtained trigonometrically,” by showing 
that ST (or ST") is a double rotation of angles x, x’, where cos? (x/2), cos? (x’/2) 
are the roots of the equation 


(x — cos? r/l)(x — cos* x/m) = x cos*® x/2p. 


Elliptic space 


TuHeoreM 5. Let G denote the dihedral alternating group of any odd degree, or 
the pyritohedral group, or the new group [3, 4, 3]’’. Then G has a central of order 
2, generated by the central inversion® (ST), where h is the period of ST. 

The only case that presents any difficulty is the last. In [3, 4, 3], the central 


28 Cf. G. de B. Robinson, Proc. Camb. Phil. Soc., vol. 26 (1930), p. 309. On replacing 
vy by y + 2x, we see that our formula is equivalent to his sin*(y/4) = sin*(@ /2) sin*(y/2) 
(1 — P?, — P%,) with Py = 0 and Py = sind. 

% Cf. Theorem 1. 

27 In the notation of F. N. Cole, On rotations in space of four dimensions, Amer. Jour. of 
Math., vol. 12 (1890), pp. 205-208, the product of pure rotations @ and ¢ about planes 
(0, 0, 1, 0, 0, 0) and (Pes, Px, 0, Pis, Pas, Psa) is a double rotation of angles x, x’, where, since 
a = tan (0/2), b= c=f=g=h=8= 8’ =0, D =1, B” = BB’, and 8” = af’, 
sec (x/2) see (x’/2) = sec (0/2) see (¢/2), tan (x/2) tan (x’/2) = Py tan (6/2) tan (¢/2). 
It follows that cos (x/2) cos (x’/2) = cos (@/2) cos (v/2) and cos? (x/2) + cos* (x’/2) = 
cos? (0/2) + cos? (v/2) + (1 — Pi.) sin? (6/2) sin? (g/2) = cos*® (0/2) + cos® (v/2) + 
cos? (y/4). For this calculation I am indebted to Dr. Robinson. 

28 Coxeter, Annals of Math., vol. 35 (1934), p. 606. 
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inversion can be expressed in the form” (R,R2R4R3)*; this is the operator (ST)* 
of [3, 4, 3]’”. 

The central quotient groups, 3G, can be regarded as operating in elliptic space. 
Abstractly, they are given by inserting the extra relation (ST)” = 1. Goursat® 
has enumerated all the crystallographic groups in elliptic three-space. Among 
these we easily pick out XX as the central quotient group of [3,4,3]’’. (It has 
the right order: 144.)* 

We thus have the following simple isomorphisms: 


3[2, 2p, 2)" _ 3[2’, 2p) — (2, p)’ (p odd; h= 2p) 
3[3, 4, 2)’ — 3[3’, 4] _ [3, 3)’ (h = 6), 
3[3, 4, 3]’’ ~ [8, 3)’ X [8, 3)’ (h = 12). 


An infinite group in which both product and commutator have specified 
periods 
Let (l, m, n; p) denote the group S' = T™ = (ST)" = (ST“'S"T)” = 1. This 
is not altered by permuting 1, m, n, since it can be put into the symmetrical form 
S'= T™ = U" = STU = (SUT)? = 1. After comparing Theorems 1 and 4, 
it is natural to wonder whether (1, m, n; p) is necessarily finite. We shall show 
that this is not so, since in fact (6, 6, 2; 2) is infinite.® 


THEOREM 6. The group S* = T* = (ST)? = (ST-)* = (ST“S"T)? = 1 
is of order 96 q’. 
Lemma 6.1. The group S" = T™ = (ST)? = (ST-')" = (ST“S"'T)? = 1 


is a subgroup of index 2 in st = t™ = (st)? = (st)?? = (st“'s-4)" = 1. 

This is easily proved by writing S = st = st's"', T = t"', so that 
ST = #?, ST = st"s-% and ST*S“T = st's“t#t"'s%—_ = (st)? . 

Lema 6.2. The group st = t® = (st)? = (st-')* = 1 is infinite, and has a 
representation in euclidean three-space in which the operation (st!s~'t)’ is a 
translation. 

We know that the group [4, 3, 4] (defined in (3.1)) is the complete symmetry 
group of the cubic lattice in ordinary space.* It has an involutory auto- 
morphism R3, such that 


R: = Ri RR, , R, = RRR, . 


29 Ibid., p. 608, (vi). 

3° Loc. cit., p. 66. 

31 Dr. Sinkov will clinch the matter by proving abstractly that } [3, 4, 3]’’ is the direct 
product of two tetrahedral groups. He will also consider other factor groups of [3, 4, 3]’’, 
obtained by assigning a smaller period for ST’. 

32 This result is of special interest in view of the fact that (7, 6, 2; 2) is finite (of 
order 2184). Another example is given, in effect, by H. R. Brahana, On the groups generated 
by two operators of orders two and three whose product is of order eight, Amer. Jour. of 
Math., vol. 53 (1931), p.901. His results show that (3, 2, 8; 6) isinfinite. Elsewhere, we 
shall prove that there are infinitely many infinite groups (l, m, n; p). 

33 Coxeter, The densities of the regular polytopes, Proc. Camb. Phil. Soc., vol. 27 (1931), p. 
202 (§3). 
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(Geometrically, this is the rotation through 7 about the line joining the points 
3,3, 0) and (0, {, 3); it interchanges two reciprocal polytopes {4, 3, 4}.) By 
adjoining R, to [4, 3, 4] we derive the group 


Ri = R,° = Rj = (RiR2)* = (R2R3)* = (RRs)? = (RR, RsRy)* = 1. 
Writing s = R,R}, t = RRs, we obtain a subgroup of index 2: 
# = & = (st)? = (st) = 1. 
Clearly st—'s-4@ = (RiRjR;)? = RRR. 


We now make use of Theorem 13 of Discrete groups generated by reflections,™ 
which tells us that, of the cycles in which the operation R,R2R,R; of (ki, ke, ks) 
permutes the vertices of the polytope \ki, ke, ks}, one ts the cycle of vertices of a 
Petrie polygon. Now, the Petrie polygon of the net of cubes {4, 3, 4} is a helical 
polygon, whose sides take the three principal directions in turn, proceeding 
(say) from the origin to the points (1, 0, 0), (1, 1, 0), (1, 1, 1, (2, 1, 1), 
(2,2, 1),---. Hence the operation R,R2R,R; or st~'s“t is a trigonal screw. 

Proof of Theorem 6. The translation (st~'s~'t)* and its conjugates generate a 
three-dimensional lattice-group. The quotient group s* = f = (st)? = (st-)* = 
(st's-4)’ = 1 is of order 192 (by direct calculation). By taking a longer 
translation, we see that the group s* = f = (st)? = (st) = (st's-¥4)* = 1 
is of order 192 g*. The theorem now follows from Lemma 6.1. 

In a somewhat similar manner, using the infinite groups 


1 = Ri = Rj = Ri = (RiR3)* = (ReR,)? 

(RiR2)* = (R2Rs)® = (RsR,)* = (RiRi)* = 1, 

t® = (st)? = (st-)* = 1, 

(ST)? = (ST-)" = (ST“S“'T)? = 1 


sf 


we may prove that the group St = T® 
is of order 192 q’. 


Trinity CoLLeGE, CAMBRIDGE. 
34 Coxeter, Annals of Math., vol. 35 (1934), p. 605. 


35 By virtue of Lemma 6.1 we need only verify that the subgroup S* = 7* = (ST)? = 
(ST-)3 = (ST“S“T)? = 1 is of order 96. 














THE GROUPS DETERMINED BY THE RELATIONS 
S' = T° = (S"' T— ST)? = 1 


Part II 
By ABRAHAM SINKOV 


1. Introduction. The purpose of the second part of this paper is to present 
a uniform abstract treatment of the spherical and euclidean groups satisfying 
the conditions given in the title. These have already been considered by Prof. 
G. A. Miller in four different papers._ The methods given here, however, are 
quite different from those which he used, yielding more general results and a 
number of properties of the groups not considered by him. In addition, an 
error is indicated in Miller’s results for the case 1, m, p = 3, 3, 2. He finds the 
largest group possible under these conditions to be of order 144. It will be shown 
in what follows (and has already been shown? in Part I) that this number should 
be 288. 


2. Conditions for finiteness. It can be shown very simply by abstract 
methods that the only finite groups determined by the relations in question are 
those given by the solutions of the inequality sin 7/1 sin x/m > cos x/2p. This 
is accomplished by making use of the known results regarding finiteness in the 
case of the relations ce = r“ = (er) = 1. Thus, suppose p >3. Then, since 
the subgroups {S-', TST} and {7-', S~’T'S} correspond to the cases L,M,N = 
l, l, p and m, m, p respectively, neither 1 nor m may exceed two. Similarly, if 
p = 2, neither 1 nor m may exceed three. Hence, except the case p = 1, in 
which the groups are abelian, the only finite groups correspond to the cases 
l,m, p = 2, 2, p; 3, 2, 2; 3,3, 2. These will be considered first. 


3. The case 1, m, p = 2, 2, p. Since the defining relations reduce to 
S? = T? = [(ST)?|" = 1, the period of ST is a divisor of 2p. It is exactly 2p 
if p is even and either p or 2p if pis odd. Hence 

TueoreoM 1. If p is even, the conditions S? = T? = (S'T-'ST)” = 1 deter- 
mine the. dihedral group of order 4p. If p is odd, these conditions determine 
either the dihedral group of order 4p or that of order 2p, according as the period of 
ST is 2p or p. 


4. The case |, m, p = 3,2,2. The subgroup A = {S~', TST} is generated 
by two operators of period three whose product is of period two; it is therefore 


Received April 11, 1935. 

? Proceedings of the National Academy of Sciences, vol. 18 (1932), p. 665; ibid., vol. 19 
(1933), p. 199; Téhoku Mathematical Journal, vol. 38 (1933), p. 1; Journal of the Indian 
Mathematical Society, vol. 20 (1933), p. 145. 

? Cf. the discussion of the group [3, 4, 3) following the corollary to Theorem 3. 
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tetrahedral. Each of its generators is transformed into itself by 7. It follows 
then that G is of order 24 and is the direct product of the tetrahedral group with a 
group of order 2. This group of order 24 is the most general group satisfying 
the initial conditions; in it ST is of period 6. 

That (ST)*® = 1 is a consequence of l, m, p = 3, 2, 2 may be verified rather 
neatly as follows. The commutator subgroup of G is the non-cyclic group* of 
order 4. Therefore 


STS“T.TS“TS = TS“TS-.STS“T, STSTS = TS“*TS“TS“T, (ST)* = 1. 


As a consequence, in any smaller group which satisfies the conditions 1, m, p = 
3, 2, 2, ST must be of period 3 or 2. If (ST)* = 1, G is the tetrahedral group. 
(ST)? = 1 is impossible, since S* = T? = (ST)? = 1 defines the non-cyclic group 
of order 6, in which the commutators are of period 3. 

THEOREM 2. The conditions S* = T? = (S-'T-'ST)*? = 1 generate either the 
direct product of the tetrahedral group and the group of order 2, or the tetrahedral 
group, according as ST is of period 6 or 3. 


5. A general property of different groups corresponding to given /, m, p. 
The larger group of the two obtained in Theorem 2 is the direct product 
of the smaller one and a cyclic group. The same statement is true for the two 
groups obtained in Theorem 1 when p is odd. Such a property occurs in other 
cases where more than one group results for given 1, m, p, and it is of interest to 
study general conditions under which it occurs. Suppose that, in one of the 
groups G in question, S*7® is of period r, and (S*7'*)’’* is invariant in G. The 
latter operator will generate a cyclic invariant subgroup of order s. If in the 
quotient group K the operators ¢ and r correspond to S and 7’, then K is defined 
by the relations o = 1" = (o7' tr! or)? = (0% r)"/* = 1. 

We now inquire under what circumstances G will be the direct product of K 
and a cyclic group of order s. Obviously a necessary and sufficient condition is 
that it be possible to select one operator from each co-set of G as regards 
{(S#7'*)"/*} in such a way that the totality of operators thus selected will form 
a group. Now each operator in the co-set containing S is of the form S(S*7T*)'""*; 
each operator in the co-set containing T is of the form T(S*T*)"'*. The com- 
mutator of two such operators is S“'T"'ST. If we let o; and r, represent par- 
ticular operators in the co-sets containing S and 7’, they satisfy the relations 


op = tt =(oi' ri! oan) =1, 
and will generate a group simply isomorphic with K if (ef rf)"* = 1. Now 
of r? _ (S*7'8)'+ratsB/s ’ 


+A. Sinkov, A set of defining relations for the simple group of order 1092, Bull. Amer. Math. 
Soc., vol. 41 (1935), p. 240. 











76 ABRAHAM SINKOV 
If it is of period r/s, 
r ry. p 
‘(1 + “fia + ial) = 0 (mod r), 


1 + * (ia + j8) = 0 (mod s). 


This congruence will have a solution if and only if r/s and s are relatively prime. 
We thus get the following 
THEOREM 3. If ina group G defined by the relations 


S' = T° = (S"T"ST)? = (S*T*) = 1 


the operator (S*T*)"'* is invariant, the quotient group K of G by the cyclic group 
{(S*T'*)"'*} is defined by the relations 


S'= T = (S?T- ST)? = (SeT*)"'* = 1. 


A necessary and sufficient condition that G be the direct product of K and the cyclic 
group of order s is that r/s and s be relatively prime. 


6. The case 1, m, p = 3,3,2. The subgroup A = {S~', TST} is generated 
by two operators of period 3 whose product is of period 2, and is therefore tetra- 
hedral. It is conjugate, under T7,, to the two subgroups B = {S-', TST} and 
C = {|T"“ST, TST}. Hence D = {A, B, C} is generated by three operators 
S-'!, T“ST, TST, any two of which generate a tetrahedral group. For pur- 
poses of convenience, these three generating operators will be designated a, b 
and c respectively. 

It is possible to determine the order of D very readily by enumerating its co- 
sets with respect to A. In the notation given below, each operator represents a 
right co-set; for example, c represents all the operators obtained by multiplying 
each operator of A on the right by c. With this convention, it follows that 
every operator of D is contained in the eight co-sets 1, c, c?, cb, c®a, cab, cab*, 
abc. Numbering these co-sets from 1 to 8, the operators a, b and ¢ are repre- 
sented as a = (1) (2,3, 5) (4, 7, 6) (8), b = (1) (2, 4, 3) (5, 6, 7) (8), e = (1, 2, 3) 
(4) (5) (6,8, 7). It is thus seen that D is of order 96, if no further restrictions 
are placed on S and T. Since D is invariant under 7, the order of G is at most 
288. 

This result is at variance with that obtained by Miller,‘ and it is thought ad- 
visable to demonstrate it by his method, which helps to establish further proper- 
ties of the groups. 

The powers of the generators S and T give rise to four possible distinct 
commutators: o, = S'T"ST, o. = TS'T"'S, o; = ST"S"'T, og = TST“'S“' 
generating the commutator subgroup® H of G. If H contains neither S nor T, 


* Second reference, footnote 1, p. 200. 
*G. A. Miller, On the commutator groups, Bulletin of the American Mathematical Society, 
vol. 4 (1897), p. 136. 
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it yields a quotient group generated by two operators of period 3. This quotient 
group is abelian, and therefore the index of H under G is at most 9. 

Now, as was shown by Miller, o; and o, are each commutative® with both 
o:and 3. The subgroup generated by o2 and gs is therefore invariant in H, and 
is of course dihedral. Again S~'e2S = o204, S-'o3S = o;. Hence S{o2, o3}S = 
{o2e4, 01}. This new dihedral group E = {o204, 0} is also invariant in H. It 
therefore contains the transform of o20, by «3; and the product of this transform 
by o204, i.€., o204-030204073 = o2030203. If the order of E exceeds 4, the operator 
(e203)? must be contained in its invariant cyclic subgroup. In addition, it is 
transformed into itself by o:. Therefore o203 is at most of period four.’ Now 
o.3 = TS?“T“S°T“S“T“T = T(S“T-)T—, and it follows that the 
period of ST divides 12. 

Suppose o2¢; is of period 4. Then 2040; is also of period 4. But oz and oy; 
are commutative, and as a result oo; is of period 4. Hence, the two permutable 
dihedral groups {o1, 74} and {o2, o3}, which taken together generate H, are each 
of order 8. Now oio20304 = TS“'TS“'T-'S“"T and is of period 2. Therefore 
(o104)? = (e203). The cross-cut of the two dihedral groups being considered 
is thus of order 2. Hence H must be of order 32 and G is of order 288. 

That this group of order 288 actually exists can be verified by direct calcula- 
tion.’ By means of this calculation a representation of G is obtained which is 
transitive on 32 letters. The subgroup on which this representation is based, 
i.e., the subgroup which keeps the element 1 unchanged, is F = {S8,(T?STST?)*} ; 
it is a non-cyclic group of order 9. 


S = (2, 3, 5) (4, 7, 11) (6, 9, 15) (12, 19, 18) (8, 13, 21) (10, 17, 14) (20, 16, 24) 
(27, 32, 30) (22, 31, 23) (25, 28, 29) (1) (26) 


T = (1, 2, 4) (3, 6, 10) (7, 12, 20) (5, 8, 14) (11, 18, 27) (9, 16, 25) (19, 17, 26) 
(13, 22, 32) (15, 23, 24) (21, 30, 29) (31) (28). 


This group G of order 288 is the most general group obtainable from the given 
initial conditions. 

We wish next to determine what further groups are possible if additional re- 
strictions are placed on S and T. Obviously these restrictions must be placed 
on the period of ST and the only possibilities to be considered are the divisors 
of 12. 

Suppose first that (ST)® = 1. In determining the resulting group, it will first 
be shown that (S7')* is invariant in G. For (e203)? = T(ST)*T"' = T-“(ST)*T 
is the only invariant operator in H besides the identity. It is therefore invariant 
inG. Since it is transformed by T~ into (ST')*, the latter is invariant in G, 
whose central is consequently of order 2. 


® Cf. the reference given in note 4, p. 199. 

7 Apparently Miller overlooked this possibility. Cf., also, the first paper mentioned in 
Note 1, p. 668. 

* Performed by Dr. Coxeter in connection with his geometric approach. 
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By Theorem 3, the central quotient group K, of order 144, is defined by the 
relations S* = T*? = (S“'T-'ST)? = (ST)* = 1. It is not contained inG. In 
it o1, 72, os and a, are all permutable, since oo, and o2c3 are both of period 2. 
Its commutator subgroup is therefore an abelian group of order 16 and type 
(1, 1, 1, 1). The subgroup U = {(ST)*, (7S)~*} is tetrahedral, since 
(ST)?.(TS)? = (ST-)', an operator of period 2. The automorphism ob- 
tained on replacing S by S' and leaving T unchanged replaces U by a second 
tetrahedral group V = {(S“'7’)?, (T7S-')?}. Each of the generators of V is per- 
mutable with both generators of U. Therefore {U, V} is the direct product 
of two tetrahedral groups, and, being of order 144, must coincide with K. 

A pair of generating permutations for K is readily obtainable from those for G 
by adjoining to F the operator (ST)*. The resulting subgroup W which is used 
as the basis of the new representation is of order 18, and the largest invariant 
subgroup of G contained in it is of order 2. On obtaining (ST)* by direct calcu- 
lation, and equating each pair of elements in each of its cycles, we get 


S = (1) (2, 3, 5) (4, 7, 11) (6, 9, 15) (8, 13, 21) (22, 31, 23) 
T = (31) (1, 2, 4) (3, 6, 11) (7, 5, 8) (9, 13, 22) (15, 23, 21). 


Suppose (ST)* = 1. Then, to find the resulting group, it is first necessary to 
determine the smallest invariant subgroup Q of G which is generated by (ST))* 
and its conjugates. Since (ST7')‘ is transformed by S into (7'S)*, Q contains the 
operators (ST')* and (T'S)* = (S-'T-*)*. These two operators are of period 3; 
their product (ST)*.(S-'T-’)* = (TS) is of period 4. Hence Q is at least® of 
order 24. It follows that the group determined by the relations S* = T* = 
(S“T-ST)? = (ST)* = 1 is at most of order 12. If we agree that the period 
of ST must be exactly four, then it is obvious that no group exists for the 
above set of relations. 

In the group of order 144 the operator (ST7')* and its conjugates generate a 
4-group. The quotient group, of order 36, is defined by S* = T* = (S'T-"'ST)? = 
(ST)* = 1, and is the direct product of the tetrahedral group and a group of 
order 3. 

Finally, if (ST)? = 1, the resulting group is tetrahedral. 

TueoreM 4. The relations S* = T* = (S“'T“'ST)* = 1 define only four differ- 
ent groups according as ST is of period 12,6,30r2. They are of orders 288, 144, 
36, and 12, respectively. 


7. The case l,m, p = 4,2,2. We now wish to consider the euclidean groups, 
viz., those for which sin x/l sin x/m = cos x/2p. The only solutions this equa- 
tion has are l, m, p = 4, 2, 2 and 3, 2,3. Let us study the case 4, 2, 2 first. 
The subgroup A = {S~', T-'ST'| is generated by two operators of period 4 whose 
product is of period 2. These relations of = r* = (er)? = 1 were first studied 


*G. A. Miller, Groups generated by two operators of order 3 whose product is of order 4, 
Bull. Amer. Math. Soc., vol. 26 (1919-20), p. 361-369. 
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in detail by Burnside,” who found that the most general additional restriction 
that could be imposed on o and r is of the form (e~'r)’ (or~')* = 1, and that the 
resulting group is of order 4(b? + c*?). The four relations ot = r* = (er)? = 
(o~—'r)’ (or~')* = 1 imply that the common period of the commutative operators 
or and o~'r is d(bj + cj), where b; and c; are relatively prime, and b = db, 
c = de. If we set a = d(bj + cj), the largest group in which (o~'r)* = 1 is 
of order 4a’. 

A isinvariant underG. For T transforms each of the generators of A into the 
inverse of the other. Since the adjunction of T to A will generate G, A is of 
index 2 or 1 under G, according as T is or is not contained in A. Suppose T is in 
A. Then since the commutator subgroup of A is abelian 


S“TST.TSTS“ = TSTS“.S“TST , 
SoTSTS? = TSTS.TSTS“ = STST, (TS*)* = 1. 


Now o*7? = TS*TS?. Therefore T is contained in A only when the period of 
or? divides 2. These exceptional cases will be considered later. 

In every other case, A is of index two under G, so that the order of G is 
8(b? + c*). Since o'r = (ST)? and or = (S“T)?*, the additional relation 
takes the form (ST)* (S“'T)* = 1. This relation, together with the initial 
conditions St = T? = (S-'T-'ST)*? = 1, implies that the period of (ST)? is a. 
If a is even, then ST is of order 2a. If @ is odd, there are apparently two 
possibilities for the period of ST. But one of these leads to a contradiction, 
for if ST is of odd period, the subgroup A which contains (ST)* will contain 
ST, and hence T. This is impossible, since A is of index two under G and it 
follows that ST is always of even period. 

The largest group in which ST is of period 2a is now seen to be of order 8a’, 
and such a group exists for every value of a. A pair of generating permutations 
for the general group may be obtained as follows. First a pair of permutations" 
are set down which will generate the subgroup A of order 4a: 


S = (1, 2, 3, 4) (5, 6, 7, 8) --- (4a — 3, 4a — 2, 4a — 1, 4x) 
TST = (3, 4, 5, 6) (7, 8, 9, 10) --- (4a — 1, 4a, 1, 2). 


It is now desired to find a permutation 7’, of period 2, which will transform each 
of the above substitutions into the inverse of the other. This is relatively simple; 
if it is supposed only that T replaces 1 by 2, it follows that 


T = (1, 2) (3, 4a) (4, 4a — 1) --- (Qa + 1, 2a + 2). 
This permutation, together with 
S = (1, 4, 3, 2) (5, 8, 7, 6) --- (4a — 3, 4a, 4a — 1, 4a — 2), 
1© W, Burnside, Theory of Groups of Finite Order, Cambridge, 1911, p. 416. 


"W. E. Edington, Abstract group definitions and applications, Transactions Amer. 
Math. Soc., vol. 25 (1923), p. 198. 
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generates the required group of order 8a’. It is easy to verify that S'T-'ST 
is of period 2 and that ST is of period 2a. 

Consider now this group G of order 8a. Let P be an operator in the group 
of the form (ST)* (S-'T)*, where bj + cj divides a. Then the period of P is 

i +c}. If we form the invariant subgroup U generated by the complete set 
of conjugates containing P, and then determine the quotient group V of G 
with respect to U, then V is defined by the relations St = T? = (S“'T-'!ST)? = 
(ST)* (ST) = 1, which imply (ST)** = 1. These relations are known to 
yield a group of order 8(b? + c?), which is the quotient of 8a? by b} + c?. 
Therefore U is the cyclic group generated by P and we have the following 

THEOREM 5. In the group of order 80? defined by the relations l, m, p = 4, 2,2 
and the additional condition (ST)** = 1, any operator of the form (ST)® (S-'T)* 
for which b} + cj divides a is conjugate to powers of itself only. 

Since the quotient group of G by {P} is of order 8(b? + c*), it follows from the 
existence of a group G for every a that a group K of order 8(b? + c?) exists for 
every pair of numbers b and c. 

To complete the investigation, it is now necessary to consider the exceptional 
cases (o?7?)? = 1. In each of these cases, it is possible to select two operators, 
viz. o~' and or, which are of period 4 and 2 respectively, have a commutator of 
period 2 and generate the entire group. As a result, the exceptional cases are 
found to coincide with the groups of order 8a? and 8(h? + h?) fora = landh = 1. 

In any group K the operator (ST’)* can be shown to be conjugate to its own 
powers only. Hence, the quotient group of K by {(ST)*} is of order 8d? and 
is defined by the relations St = T? = (S-'T-!ST)? = (ST)* = 1. Since it is 
one of the groups G, there exists a reciprocal relation between the groups G and 
the groups K, which is best expressed as follows. For every number w of the 
form b] + c? there exists an infinite family of groups K of order 8d2(bj + c7) 
which has with the groups G the reciprocal relation that a group in either 
family is obtainable as a quotient group in the other. 

Let us consider the special case when K is the central quotient group of G. 
In order to do this we first determine the invariant operators of G. Since 


T-(ST)® (S7T)*T = (TS)*(TS-)* = (ST)-* (S“T), 


a necessary condition is given by b = ¢c = a/2. This implies, of course, that a 
iseven. Now 


[(ST)]22 [((S“T)2]22 = (S°T)« = (0?r?)a/2 


is known to be the only invariant operator in A. Since G may be generated by 
adjoining S*T to A, it follows that the condition is also sufficient. The central 
of G is therefore of order 2 provided a is even, and in that case the central 
quotient group K is of order 16(a/2)? = 47. If we set a = 2h, there is a group 
of order 16h? for every h. 
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Now the group G of order 8h? is obtained from the above group by taking the 
quotient group with respect to {(S7)*}. Obviously, then, (ST) is invariant 
in K. Furthermore, the central of any group G is at most of order 2. There- 
fore the central of K is at most of order 4. It will be of order 4 only if the 
two operators (S*7')", (ST)™ (S?T’)’ are separately invariant. But S-(S°T)?S = 
(ST)*(TS?)?, so that S-*(S*T)*S = (ST)* (S?T)*. The central of K is therefore 
of order two, and the groups G are also central quotient groups. It is thus 
seen that the reciprocal property previously mentioned becomes in this special 
case a reciprocal relationship between central quotient groups.” 

It is a consequence of what has been demonstrated up to this point that the 
period of ST in any group defined by the relations St = T? = (S“T-'ST)? = 1 
is an even number. The assumption of an odd number for the period of ST 
requires A to coincide with G, and hence leads to a contradiction, so that there is 
no group to correspond to the relations St = T? = (ST)**! = (S"T"ST)* = 1. 

THeoreM 6. The most general relation that may be adjoined to conditions 
St = T? = (S"'T-'ST)? = 1 is (ST)* (S"'T)* = 1. The four relations define 
a group of order 8(b? + c*), and such a group exists for every pair of numbers 
band c. In any one of these groups the period of ST is 2d(bj + cj), where b, and 
c, are relatively prime and b = dh; c = de;. No group is possible in which the 
period of ST is an odd number. 

A procedure for obtaining a pair of generating permutations for any one of the 
groups of order 8(b? + c?) from those already given for the groups G of order 8a? 
will now be outlined, the case b = c being selected for purposes of illustration. 
The representation obtained for G is transitive and of degree 4a. Therefore, 
the subgroup B which keeps the element 1 unchanged is of order 2a. It is non- 
invariant in G and involves no invariant subgroup of G. Since S~'T is of period 
2a and keeps the element 1 unchanged, B = {S“'T}. Suppose now that the 
operator (ST)*(S“T)¢ is adjoined to B, yielding a subgroup C of order 4a. If 
the largest invariant subgroup of G contained in C is of order two, the central 
quotient group K of G will be obtained by using C as the basis of a new transi- 
tive representation. The actual mechanics would involve the calculation of 
(ST)*(S“T)« and the subsequent equating in both S and 7 of each pair of 
elements which occur in the individual cycles of (ST)*(S“'T):«. 

Now it happens that C involves an invariant subgroup of order four, viz., 
{(ST)*, (S"T)*}. Moreover, this is the largest invariant subgroup of G con- 
tained in C. Hence, in order to get a representation for A, it is necessary to 
obtain first a representation of G, in which the basic group B is replaced by a 
subgroup B,, which does not contain (S“'T)*. If a is divisible by 2” but not by 
2”+!, this may be accomplished by setting B, = {(S“'T)*}, where 8 = 2?*. 
When this has been done, the new representation for G will involve 2°**a let- 


12 These two infinite families of order 8a? and 16h*, corresponding to the cases c = 0 and 
b = c, are the two families obtained by Miller in his study of the same case. 
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ters. To get the corresponding generators, we replace each cycle (a, b) of T by 
the 2?+! cycles 


(a, b) (4a + a, 4a + b) --- ((2°+* — 4) a + a, [27+* — 4Ja+ bd). 

Each cycle (4k — 3, 4k, 4k — 1, 4k — 2) of S is replaced by the 27+! cycles 

(4k — 3, 4a + 4k, 4k — 1, 4a + 4k — 2) (4a + 4k — 3, 8a + 4k, 

4a + 4k — 1, 8a + 4k — 2) --- ((2°** — 4Ja + 4k — 3, 4k, 
[2°+3 — 4Ja + 4k — 1, 4k — 2). 
The operator (ST)*(S—'T)« is now found to be 
(1, 2a + 1) (2, 2a + 2) --- (2a, 4a) (4a + 1, 6a + 1) --- (6a, 8a) 
++ ((2°+3 — 4) aw + 1, [27+* — 2) a 4+ 1) --- (([2?+* — 2] a, QP). 


Equating each of these pairs of numbers in the new forms just obtained for S 
and 7’, we get a representation for K which is transitive on 2?+**a letters. 

In general, if @ is divisible by (6? + c7)? but not by (6? + c?)?+, the above 
procedure will yield a transitive representation on 4(b{ + c7)a letters for the 
group of order 8(b? + c’). 


8. The case |, m, p = 3, 2,3. The subgroup A = {S~', T-'ST} is generated 
by two operators of period 3 whose product is of period 3. These relations 
o* = r* = (or)* = 1 were also studied by Burnside.“ The most general addi- 
tional relation that can be imposed on ¢ and r is of the form (o~'r)’ (¢7~')* = 1, 
and the resulting group is of order 3(b? + be + c*). The four relations imply 
that the common period of the commutative operators o'r and or is 
d(b} + bic: + c7), where d, b; and c, have the same meaning as before. 

A is invariant in G and of index 2 or 1 according as T is or is not contained in A. 
Suppose T isin A. Then, since the commutator subgroup of A is abelian, 


S“TST-.TSTS“ = TSTS“*.S“TST, S*TS“TS™ = TSTSTST, (ST)® = 1. 


But o'r = (ST)*; therefore (o'r)? = 1. Setting these cases aside for the 
moment, we see that G is of order 6(b? + be + c?). In it the additional relation 
is of the form (ST)” (S“T)* = 1. 

The treatment from this point follows the same lines as the case l, m, p = 
4, 2, 2. The additional relation implies that ST is of period 2a; the largest 
group G in which (ST')** = 1 is of order 6a. Such a group exists for every value 
of a and a pair of generating permutations can be set up for the general case. 


S = (1, 3, 2) (4, 6, 5) (7, 9, 8) --- (a — 2, 3a, 3a — 1) 


T = (2, 32) (3, 3a ~ 1) (4,30 ~ 2) --- ([8] 41,3042 -[*']). 


1* Burnside, loc. cit., p. 414. 
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In any such group G, an operator P = (ST)* (S“T)*, for which b} + bic: + c7 
divides a, is conjugate to its own powers only, and therefore a group K of 
order 6(b? + be + c*) exists for every pair of numbers band c. A pair of gen- 
erating permutations for K can be obtained from those for G. The representa- 
tion is transitive and involves 3(b3 + bic: + ci)?~'a@ letters whenever a is 
divisible by (67 + bic: + c7)” but not by (67 + bic: + cj)?**. The exceptional 
cases when A and G coincide are found to be contained in the groups of order 
6(b? + be + c’). 

The groups G are obtainable from the groups K as quotient groups with respect 
to the cyclic group {(ST)*}, so that the same kind of reciprocal relationship is 
obtained. However, the property of reciprocal central quotient groups does 
not enter. For c = 0 and b = c we have two infinite families of groups," G of 
order 6a? and K of order 18h?._ The former are central quotient groups of the 
latter, but the converse is not true. The operator (ST)**/* (S~'T')** is conju- 
gate to its inverse in G. 

THeEoREM 7. The most general relation that may be adjoined to the conditions 
S§ = T? = (S"T-'ST)® = 1 is (ST)® (S“T)* = 1. The four relations define 
a group of order 6(b? + be + c*), and such a group exists for every pair of numbers 
bandc. In anyone of these groups the period of ST is 2d(b? + bie: + cf); no 
group is possible in which the period of ST is an odd number. 


WasuinerTon, D. C. 


4 Here again, these two infinite families, corresponding to the special cases c = 0 and 
b = c, are the only groups obtained by Miller in his study. 














EXTENSIONS OF THEOREMS OF DESCARTES AND LAGUERRE TO 
THE COMPLEX DOMAIN 


By I. J. ScHOENBERG 
§1. Introduction and statement of results 


1. Much attention has been devoted to the important problem of finding 
limitations for the absolute values of the zeros of a polynomial in terms of the 
absolute values of the coefficients of the polynomial. Much less is known about 
the arguments of the zeros when only the arguments of the coefficients are taken 
into account. Regarding this latter problem in its full generality, I can 
mention only an interesting article by A. J. Kempner.! The classical rule of 
Descartes is a contribution to this problem for real equations as far as real roots 
are concerned. Obreschkoff’s extension of this rule (Theorem I below)? to 
those roots of real equations which lie in a certain angular neighborhood of the 
real axis points the way to a new extension of Descartes’ rule which will take care 
of all the real or complex roots of real or complex equations (Theorem II and 
subsequent remarks in section 4). Furthermore, it will be shown that a theorem 
of Laguerre (Theorem V) actually applies to the roots in rather extended domains 
of the complex plane (Theorem VI). All these results are derived by means of 
the fruitful idea used by Obreschkoff in proving his Theorem I. It consists in 
letting the original theorems (of Descartes and Laguerre respectively) extend 
themselves, so to speak, to complex roots in certain domains by means of a 
classical theorem of Cauchy applied along the boundary of the corresponding 
domain. 


2. The following extension of the rule of Descartes is due to N. Obreschkoff 
(loc. cit.). 
TueoremM I. Let 


(1) f(z) = da + ar + ox? + .:-- + a,2" = 0 


Received August 24, 1934, by the Editors of the Annals of Mathematics, accepted by 
them, and later transferred to this journal. 

1 A.J. Kempner, Uber die Separation complezer Wurzeln algebraischer Gleichungen, Math. 
Annalen, vol. 85 (1922), pp. 49-59. Using systematically the fact that a sum of complex 
numbers with positive real parts can not vanish, Kempner shows how to divide the entire 
plane into consecutive infinite sectors S,, a a, Be, Bu Z ++» with a common vertex at 
the origin such that there are no zeros within the sectors S,, S:, S;, «++ , while each of the 
sectors S;,S;, Si, --» contains at least one zero of the given polynomial. See footnote 
3 below. See also his recent comprehensive article On the complez roots of algebraic equa- 
tions, Bull. Amer. Math. Soc., vol. 41 (1935), pp. 809-843. 

2N. Obreschkoff, Sur un probleme de Laguerre, Comptes rendus de l’ Acad. des Sciences, 
vol. 177 (1923), pp. 102-104. 
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be an algebraic equation with real coefficients. The number v, of variations in the 
sequence of its coefficients is not merely an upper bound for the number Z(x > 0) 
of its positive roots (rule of Descartes), but also an upper bound for the number of all 
of its roots which lie inside the sector | arg x| < m/n, #.e., 


(2) Z{\ argz| < x/n} S tu. 


Substituting x = —z into (1) we also find an estimate for the number of roots 
within the opposite sector  — (x/n) < argz < a+ (x/n). What about Z for 
some sector with the vertex 0 and containing, say, the upper half of the imaginary 
axis in its interior? If we try to use Theorem I after performing the rotation 
x = iz, we shall find in general a new equation in z with complex coefficients to 
which Theorem I does not apply. 


3. This remark suggests the desirability of extending Obreschkoff’s theorem 
to equations (1) with complex coefficients. This obviously necessitates an exten- 
sion of v, to complex numbers ao, a1, --- ,@,. A natural extension of this notion 
is as follows. Let us mark in the complex a-plane the points ao, a, --- , Gp». 
Through the origin O of the a-plane we draw any two straight lines A and A’ 
dividing the plane into four consecutive sectors A, B, C, D with the following 
property: one of the two pairs of opposite sectors, A and C, say, shall not contain 
any of the points a, in itsinterior. Call y (we assume y + 0, hence 0 < ¥ S 7) 
the common aperture of A and C. The points a, are thus assumed to lie inside 
or on the boundary of the remaining opposite sectors B and D. For this reason 
we say that the angles A and C form a separating double sector (of aperture ¥) 
for the coefficients a,. In the process of going through the sequence of points 
Qo, a, --- , @, we count the number of times we have to pass from the sector B 
to D or vice-versa, and call this number v,(S) [= number of variations of our se- 
quence with respect to the separating double sector S = (A, C)|. In counting these 
variations, points a, which are at the origin O are to be disregarded. If all the 
a, are real, we can take y = =z and thus get the usual number p, . 

With this definition we can now state 

TueoreM II. Let (1) be an algebraic equation with real or complex coefficients. 
In the complex a-plane mark the points a, and draw a separating double sector S of 
aperturey (0 < YS). Then 


(3) Zi\argx| < y¥/n} s v,(S). 


For ¥ and n fixed the sector | arg x | < ¥/n is the largest domain for which (3) 
always holds, for this inequality may fail to hold if we add to the sector even a single 
point on its boundary. 

If the equation (1) is real, we can take ¥ = x, and (3) reduces to Obreschkoff's 
inequality (2). 


4. Now if we wish to find an estimate for the number of roots within some 
sector whose bisector makes with the positive axis the angle @, we substitute 
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x = ez into (1) and denote by S’ a separating double sector of aperture y’ for 
the coefficients a) of the transformed equation 


(4) f(e%z) = ao + ae®z + --- + a,e2" =a, +aiz+--- +a4,27=0. 
We obtain by Theorem II* 


ze ~- v <argxz < 64+ ¥\ < v,’(S’). 
Thus Theorem II gives information about the distribution of the arguments of 
the roots of (1) by means of the arguments of the coefficients of this equation. 
Additional information may be obtained by changing the origin of the z-plane. 

In Theorem II it will in general be possible to find for the same equation (1) 
a considerable number of separating double sectors S with different v,(S) corre- 
sponding to them. It is desirable, in order to make the inequality (3) more effec- 
tive, to have y as large and v,(S) as small as possible. Disregarding »,(S) for 
the moment, we can always choose y so as to exceed a certain constant due to 
the following elementary theorem. 

TuHeoreEM III. It is always possible to find for the coefficients of an equation 
(1) of degree n a separating double sector of aperture Y = xr/(n + 1). The con- 
stant r/(n + 1) is here the largest possible for a given degree n. 

Hence Theorem II will give an upper bound for the number of the roots of (1) 
within any sector with vertex at the origin and of aperture 2x/[n(n + 1)], for 
any rotation x = ez will give a new equation (4) to which a separating sector 
of aperture = 7/(n + 1) can be found according to Theorem III. 

The difference of the two sides of the inequality (2) is in any case an even 
number. Easy examples will show that this is not always true for the general 
inequality (3). 


5. Laguerre has extended Descartes’ theorem to exponential polynomials 
(5) F(x) = ae” + ae” + .--. + a,c" = 0 (No <A < +--+ <A,), 


with a, 2 0, andfound that Z{— «0 <2 < ©} <»,. Anextension of this result 
similar to Theorem ITI follows. 

TueoreM IV. Let (5) be an exponential equation with real or complex coeffi- 
cients and real monotonically increasing exponents. In the complex a-plane mark 
the points a, and draw a separating double sector S of aperture y (0 < ¥ S zx). 
Then 


(6) Z{ | 3r| < ¥/(An — ro)} S v0(S). 


3 A “planetarium’”’ as suggested by Kempner (loc. cit., p. 54) would greatly facilitate the 
determination of S’, y’ and V,/(S’) for arbitrary @ in any numerical case. It is a watch-like 
instrument with n hands (vectors) capable of rotating with angular velocities of ratios 
1:2:3: --- :n and capable of starting from arbitrarily assigned positions (corresponding to 
the arguments of a, a2, --- , a, if a2 = 1). Thus an ordinary watch whose hands can be 
moved into any initial positions will take care of any equations of the type 1 + az + 
Qy_2'? = 0. 
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Note that Theorem II is a special case of Theorem IV for \, = v and the new 
variable z = e*. M. Marden‘ has previously shown that | $r| < ¥/(An — Ao) 
is a zero-free region of F(x) provided v,(S) = 0. Just as Theorem II could 
be applied to various angles by rotation of the z-plane, so can Theorem IV 
be applied to various horizontal strips by vertical translation of the z-plane. 
Theorem III insures the applicability of Theorem IV to any horizontal, strip® of 
at least the width 27/[(n + 1) (An — o)]. 

It is of interest to note that the constant ¥/(A, — Ao) of Theorem IV is a 
function of the difference \,, — Ao, and hence independent of the number of terms 
of the exponential sum (6). For this reason it can readily be extended to integral 
functions of the type 


F(z) = i * @ (A) dd. 


6. Let us consider now a rational function of the form 


(7) F(x) = & + & Hived Gn 
r— a 


r— a r— Ay, 





(ao > a1 > --+ > anja, real ¥ 0). 





It is well known that if a, and a,,; have the same sign, then F(x) has an odd 
number of zeros inside the interval (a,4:, a,). Denoting by v. the number of 
variations in the sequence do, a, --- , @,, we conclude that F(x) has at least 
nN — Uq zeros in a, < x < ao and therefore at most v, zeros in the complex plane 
outside the interval a, < x < ao; in particular, we thus get Z{ao << xz < ©} Sm. 
Let us denote by v(ao + a; + --- + an) (= the number of variations in the sum 
a + a; + --- + a,) the number of ordinary variations in the sequence of 
partial sums ao, do + ai, do + @; + G2, ---,@ +--- + a,. A better estimate 
of Z {ao < x < @} is furnished by the following theorem of Laguerre.® 


* Morris Marden, On the zeros of certain rational functions, Trans. Amer. Math. Soc., 
vol. 32 (1930), p. 662. 

5 A theorem on more general exponential polynomials (a, = a polynomial in z) proposed 
by G. Pélya as a problem and proved by Obreschkoff, Jahresbericht der Deutschen Math.- 
Vereinigung, vol. 37 (1928), pp. 82-83, Lésung der Aufgabe 24, when specialized to poly- 
nomials of type (5) with apa, ¥ 0 gives for any finite interval (a, 8) the interesting inequali- 
ties 
6’) Oa = WIE ~ 9 9 5 Blas 3258) = M—WE- 9 5g, 


Results (6) and (6’) do not imply one another. If we apply (6’) to the case of ordinary 
polynomials f(z) = a + az + --+ + a,z" by writing 4, = » (v = 0,1, ---,n), z = e*, 
0 < B — @ < 2, we get the result 


n(@=* 1) 5 Zia s ares 5 6) so(*=*+1), 








2 2x 


which is trivial, for the lower bound is < 0 while the upper bound exceeds n. This was to 
be expected, for (6’) does not utilize the arguments of the coefficients of (5). I am indebted 
to Harry Matison for calling my attention to Pélya’s theorem. 

6 E. Laguerre, Oeuvres, vol. 1, p.41. See footnote 7 below. 
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THeoremM V. The number of real zeros of the function F(x) defined by (7) which 
are greater than ao does not exceed the number of variations in the sum ay + a, + --- 
a 

Moreover, if >>; a, ¥ 0, the difference v(ap + --- + an) — Ziag <a < ~} is 
aneven number. Our extension of this theorem to the complex roots of (7) is 
as follows. 

Tueorem VI. Jf 5°} a, ¥ 0, in Laguerre’s Theorem V v(ao + --- + Gn) is 
not merely an upper bound’ for the number Z{ao < x < «} of real zeros of (7) 
which are greater than a, but also an upper bound for the number of its zeros of 
real part = aw, 7.€., 


(8) Z\Rx > ao} < v(ao + a1 + --+ + Ay). 


If >>, a, = 0, the same inequality holds for Z{Rx > ao} instead of Z{Rx = ao}. 
The property concerning the evenness of the difference remains valid also in 
the half-plane (provided >>} a, ¥ 0), for possible complex zeros appear in pairs. 
By means of a simple homographic transformation, Laguerre (loc. cit., pp. 
42-47) derived from Theorem V the following elegant estimate for the number 
of zeros of (7) within the interval § < x < a, (a41 < — < a@,): 








a, ay-1 ao an 
(9) parr" s(7*o+ Sot {<0 f-- 
at — G41 
Tice 1 + an) 


Similarly Theorem VI shows that the right side of (9) is an upper bound for Z 
within the circle with (¢, a,) as diameter and, if F(¢) + 0, we may even count 
the zeros on the boundary of this circle. 


7. Laguerre’s result in its extended form is of importance, because of the ease 
with which the quotient of a polynomial f(x) of degree n by (x — ao) (x — a1) 
-++ ( — a@,) can be decomposed into partial fractions and thus put in the form 
(7). Thus from the formula 


f(x) “ROC 1)’ (7) fer f(a + vh) 


(x —a)(x-~a—h)---(tx—a—nh) nth z—a—vh 





7 The inequality v, S v(a9 + --- + a,) shows that Laguerre’s upper bound for Z{a) < z 
< «} is more accurate than the upper bound v, derived at the beginning of section6. How- 
ever, the weaker (i.e., larger) bound v, holds for all zeros of (7) outside the real interval 
(an, a), While Laguerre’s bound holds for the zeros in a smaller domain, namely the half- 
plane Rz = a only, according to Theorem VI. Similar remarks will throw new light on 
the results of the author’s recent.paper, Zur Abzdhlung der reellen Wurzeln algebraischer 
Gleichungen, Math. Zeitschrift, vol. 38 (1934), pp. 546-564. The weaker upper bound is 
definitely worse as long as we restrict ourselves to real zeros. This is no longer true as soon 
as complex zeros are also taken into account, for in general weaker upper bounds hold for 
larger domains in the complex plane. 
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we get immediately, taking into consideration Theorem VI, the following corol- 
lary (see Laguerre, loc. cit., p. 157). 

TuHeoreM VII. The number of those zeros of a polynomial f(x) of degree n 
which lie in the half-plane Rx < a does not exceed the number of variations in the 
sum 


fla) — (7) 40 +H +($) sa 42m) —--- + (Dflat ni) a> 0), 


and the difference between this upper bound and the actual number of zeros is even. 
This last theorem can be applied with particular ease to polynomials defined 
by interpolation, i.e., when the values f(a + vh) are given. It shows that if 
f(a) is sufficiently large, we are sure to have no zeros with Rz < a. 
Theorem VI may also be applied to locate the zeros of the derivative of a ra- 
tional function f(z) with only real zeros and poles. This is best shown by an 
example. Let the rational function be 


x*(x — 2) 
(x + 1)*(x — 1)?” 


Besides the obvious multiple zeros z = 0 and x = 2, f’(x) will admit the same 
ZeTOS as 





(10) f(z) = 








f(z) 6 ,6 2 3 
ieee ° hein? a tat os at Ps 


Laguerre’s extended theorems readily give the following results. There are no 
zeros with Rz < —1 and there is exactly one zero with Rz > 2. Furthermore, 
there are no zeros within the circles of diameters (— 1, 0) and (1, 2), while there 
are two zeros or none within the circle on (0, 1). A direct solution is possible 
and gives the zeros 


x = 3andz2z = (11 + iv/23)/18, 


the complex roots being actually in the last mentioned circle. By a previous 
remark it becomes apparent that if z = ais the smallest pole or zero of a rational 
function f(x) of the form (10), then f’(x) will have no zeros with Rz S a (x ¥ a), 
provided the order of the pole or zero x = a is sufficiently high. 


8. It might be of interest to point out finally that Theorem VI can be extended 
by usual continuity considerations to infinite series and integrals of the type 





F(z) = >) —* (m=0<n<--- <p>), 
an rtp 
Fiz) = [204 


giving estimates for the numbers of zeros with Rz > 0. 
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Concerning series of the type (11), the following results are readily derived 
from classical theorems of Abel. If the series converges in one point of the 
plane, it converges in the whole plane except the points x = 0, —pi, —p2, --- , 
and it converges uniformly in every closed and finite domain which is free of 
these points. The case of convergence occurs if and only if p aes (a,/p,) con- 
verges. From these results and Theorem VI the following is readily derived. 

TuHeoremM VIII. The number of zeros of the sum F(x) of the convergent series 


oe 


F@) = 5 (p =O0<p<--- <p>) 





v=0 


within the half-plane Rx > 0 does not exceed the number of variations in the infinite 
suM A + 4, + A2+---. 

If, for example, >>> a, = s ¥ 0, then v(ao + a; + --- ) is certainly finite and 
therefore Z{Rz > 0} is finite also. In the case s ¥ 0, it is again seen by Abel’s 
theorem that F(x) has the sign of s for sufficiently large x. Hence in this case 
(doFa, ¥ 0) the difference v(ay + ai + ---) — Z{Rx > O} is always finite and 
an even number, for F(+0) has the sign of a, while F(x) has the sign of 
s = )- a, for large x > 0, as already remarked. 

The two series* 


‘edt = Sy (-1) [oeu _ sv 1 (-) 
Li-2S. | eae as situa 


are examples with v(ao + a, + ---) = 0. 











§2. Proofs of Theorems II, III and IV extending the theorem of Descartes 


9. A rotation of the a-plane about its origin does not change equation (1); 
hence we may assume that the imaginary axis bisects the separating double 
sector S of aperture y (0 < ¥ S x). The remaining double sector (containing 
the points a,) is now bisected by the real axis, and its aperture is 2n = 7 — y 
(= 0). Hence ¥/n = (x — 2n)/n, and we have to prove the inequality 


(12) Z{\ arg x| < ( — 2n)/n} S »,(S). 


The number of zeros of f(x) inside the sector D: | arg z| < (x — 2n)/n is the 
same as for the function® 


(13) F(x) =e? x ? f(z). 
Let us decompose F(x) into its real and imaginary parts using polar codrdinates. 


8 The zeros of these two functions which are connected with the Gamma function were 
investigated by many authors and finally located by T. H. Gronwall, Trans. Amer. Math. 
Soc., vol. 28 (1926), pp. 391-399 and Annales de l’Ecole Norm. Sup., vol. 33 (1916), pp. 
381-393. 

® Obreschkoff proved Theorem I by applying Cauchy’s theorem directly to f(z). In 
proving Theorem II by this method, the additional factor z~"/? seems to be essential. 
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Let 


(14) a, = pe”, | a, | =1, p= 0 (v = 0, 1, needs »n), z= rev (r>0). 
From (1) and (13) we get 


ll 

? 

oO 

4 
— 
tol 3 

| 
tb! 

RS 

+ 

4 

a 

& 
Nee” 

“t 


(15) a 
+4 >) osin(§—Setwta)r did 
v=0 
= P + iQ. 


There is obviously no restriction in assuming aa, ~ 0. Let us furthermore 
assume for the moment that we have not only | a, | S n, but also 


(16) | a, | <4, 


and that f(x) does not vanish on the two half-lines bounding sector D: | arg x| < 
¥/n. 

Draw about the origin two circles of radii R and ¢ (very large and very small 
respectively) which cut the boundary of D in the points A, B, A’, B’. Consider 
the finite domain D’ which is bounded by the straight segments A’A and BB’ 


and the circular ares AB and A’B’ and whose boundary is described counter- 
clockwise in the order ABB’A’A. Wemay assume F(z) ~ 0 along the boundary 
of D’. 

Let x describe the boundary of D’ counter-clockwise and consider the variation 
of the real function Q/P along this boundary. By a theorem of Cauchy we know 
that 2Z{D’} is equal to the number of zeros of Q/P in which Q/P passes from 
negative to positive values minus the number of its zeros in which this function 
passes from positive to negative values. 

Let us investigate the number of zeros of Q along the boundary of D’. Along 
the arc AB we have —(x — 2n)/n S ¢ S (x — 2n)/n; hence 9 S (ng/2) + 
(x/2) S * — n, and finally 


(0 <) n+ an S r/2+ m¢/2+ a, 547 — 1+ a, (< 7), 


on account of (16). Within this range of values of ¢ cot (7/2 + ng/2 + a,) is 
a finite and continuous function of ¢, and (15) shows that 


lim P/Q = cot (x/2 + ng/2 + an) 

10 If F(z) were a polynomial, Cauchy’s theorem stated above would be a rather simple 
consequence of the fundamental theorem of algebra as shown by Ch. Sturm, Journal de 
Mathématiques, vol. 1 (1836), pp. 290-294. Our function (14) differs from a polynomial! by 
the factor z-"/? only. The origin being outside of D’, Sturm’s elementary proof applies 
also to our case. 
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uniformly in ¢ within this range. Hence P/Q is finite and continuous along AB 
provided R is sufficiently large. In particular, Q will not vanish along AB. 
Similarly 

lim P/Q = cot (x/2 — ng/2 + ap) 


r—0 
uniformly in ¢ for —(x — 2n)/n S ¢ S (x — 2n)/n. This shows that Q will 
not vanish along A’B’ if ¢ is sufficiently small. 
We want to show now that the sine coefficients of Q, namely sin (7/2 — ng/2 + 
ve + a,) are all positive along the boundary of our sector D, i.e., for g¢ = 


+(x — 2n)/n. For taking in each term of the sum 3x + (—}n + v)o +a, 
respectively the smallest and the largest possible value we have 





_® na — 2n ra n 7 naw—2n ns 
0-5 Bt <2 a(S rleta<sts = +n=T 
fory = 0,1,---,n. By applying Descartes’ rule to the polynomial in r 

n/2 — ~ ; r_ y 

r Q= Dd) msin (5 gtwtelr 


for ¢ = +(x — 2n)/n, we see that Q can not vanish along the sides BB’ and 
A’'A more than 2v, = 2v,(S) times. By a proper choice of « and R we may 
assume that Z{D’} = Z{D}, and combining our results with Cauchy’s theorem 
we get 


2Z{D’'} = 2Z{| argxz| < (x — 2n)/n} S 2v,(S). 


This proves the inequality (12). 

Our additional assumptions (16) are now easily removed by slightly increasing 
n (decreasing y). For now (16) certainly hold, and the increase of » (if suffi- 
ciently small) will remove possible zeros of f(x) along the boundary of D without 
decreasing Z{D}. Now (12) is proved as before and » may be decreased to its 
original value. . 

In order to show that (3) may fail to hold if we add to the boundary of D the 
point e~’/™‘, say, consider the equation 


(17) eri +2" = 0 (0 < "7 < r). 


The two lines through the origin containing the points x = 1 and x = e‘*-¥* 
define a separating double sector S of aperture y with v.(S) = 0. Hence 
Z\\ argz| < ¥/n} = 0. This relation will become false if we add the point 
x = e~%/™i to D, for this point is a zero of the equation (17)." 


11 It should be remarked that D ceases to be the largest domain for which (3) holds if, 
besides y and n, v,(S) also has a prescribed value > 0. Thus for real equations (y = 7) 
Obreschkoff really proved more than Z{| arg z| < +/n} S va, namely, Z{| arg z| < x/ 
(n — ve)} S ve, in his paper Uber die Wurzeln algebraischer Gleichungen, Jahresbericht der 
Deutschen Math.-Vereinigung, vol. 33 (1924), p. 61. It would be interesting to improve 
the inequality (3) accordingly. 
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10. In proving Theorem III there is obviously no loss of generality in as- 
suming all the coefficients a, ~ 0. Take a regular pencil = of n + 1 straight 
lines through 0 which divide the plane in 2n + 2 equal angles of aperture 
m/(n + 1). Let one of the lines of the pencil pass through a. The pencil 
defines n + 1 double sectors (each composed of a pair of opposite sectors) and 
the n segments 0a, Oae, --- , Oa, can obviously occupy the interiors of at most 
n of these double sectors. Hence at least one of them will be a separating double 
sector of aperture r/(nm + 1) for the pointsa,. Note on the other hand that the 
points a, = e’*/(+) (»y = 0,1, --- , n) do admit separating double sectors of 
aperture z/(n + 1) but none of greater aperture. Thus Theorem III is proved. 


11. The proof of Theorem IV resembles that of Theorem II. It suffices to 
remark that Cauchy’s theorem is applied to the function 


&(z) = e és ae * (2) 


along the boundary of the rectangular domain ! Rz | < A (A very large), 
| $x | S ¥/(n — Ao). 


§3. Proof of Theorem VI extending the theorem of Laguerre 


12. Let us first consider the case when >> a, ~ 0. We may assume ay = 0, 
0 


and hence a, < @n1 < --- < ai < a = 0. For the rational function F(z) 
defined by (7) we have to prove the inequality 

(18) Z{Rx => 0} S v(ao + a1 + --- + a,). 

Take « > 0 so small that F(z) does not vanish on the line Rz = —e and 


ZiRz > —e} = Z{Rxz = 0}. Draw twocircles about the origin of radii 2e and 
R (very large) which intersect the line Rz = —e in the points A’, B’ and A, B, 
respectively, and consider the domain D’: Rz = —e, 2e S |x| S R, whose 
boundary is described counter-clockwise in the order ABB’A’A. For « suffi- 
ciently small and R sufficiently large we have Z{D’} = Z{Rzx = 0}. 

Let 


(19) x = re'*, r— a, = r,e'*r, DP» = —a, = | a, |. 


From (7) and (19) we get 


(20) F(z) = DE o ei > cs er i> a, me = P+ iQ. 


v=0 v=0 


We wish to apply Cauchy’s theorem to F(x) in D’, and for this purpose let us 
investigate the zeros of Q/P along the boundary ABB’A’A of D’. 

(i) If R is sufficiently large, along the circular are AB Q/P will have the sign 
of —tan ¢ and will therefore vanish only once, for ¢ = 0, and pass there from 
positive to negative values. 
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(ii) Similarly along B’A’ Q/P will have, if ¢ is sufficiently small, the sign of 
—tan ¢ and will therefore vanish once, for ¢ = 0, and pass there from negative 
to positive values. 

(iii) Let us now move the point z along BB’. If we put x = —e + it, 
from the triangle (a,, —«, z) we get sin y, = t/r, or sin g,/r, = t/r? = t/[@ + 
(p, — ©)?]. Hence 


ao ay, an 
Q= GS atargcet  tarece) 

Hence if 2 < (pi — ¢)?, as we may assume, and if we take ? as the new variable, 
Laguerre’s Theorem V shows that Q has at most v(a) + a; + --- + a@,) zeros 
along the half-line z = —e + it (¢ > 0), which implies that Q/P has at most 
v(ao + --- + a,) zeros along BB’. 

(iv) The above result holds along A’A as well because Q(—«e — it) = 
—Q(—e + it). 


By Cauchy’s theorem, therefore, we have 
2Z{Rx =O} = 2Z{D’'} s (ayo + --- + an) +1 —1 = B(aq + --- + ,). 


This proves (18). 
Assuming now that 23 a, = 0, let us prove that 


(21) Z{Rx > 0} S v(a + --- + 4,). 


Let >>3~'a, be positive. A sufficiently slight increase of a, will not change 
ZiRz > 0}. Now > a, > 0 and (18) holds. Hence (21) holds A fortiori. 
In conclusion, let me point out that if ao = 0 and all the other constants occur- 
. ring in (7) are variable (with >°}.a ¥ 0), then Rx = 0 is the largest domain for 
which the inequality (8) holds. This inequality may become false if we add to 
Rr = O even a single point with Rx <0. This is best shown as follows. With 
Pr = —a, by = (Psi — Pr) Doo Ge (Past = Pn + 1), we have 
bo by baa bn 
F(x) = eee 
22)" = rat) tGtmetm + + etpwaetp) tet p’ 
(0 < pr <--- < pn). 
If all the b, are = 0, we know from (8) that Z{Rz 20} = 0. By a certain type 
of argument used by J. v.S. Nagy and Morris Marden” it is readily shown 


that if b, = 0 and all the constants occurring in (22) are variable, then Stz < 0 
is the geometric locus of the zeros of F(x). This proves our last remark. 











INSTITUTE FOR ADVANCED Stupy. 


2 J. v. S. Nagy, Uber die Lage der Wurzeln von linearen Verkniipfungen algebraischer 
Gleichungen, Acta Szeged, vol. 1 (1923), pp. 127-138, and M. Marden, loc. cit. 














CONNECTIONS BETWEEN DIFFERENTIAL GEOMETRY AND 
TOPOLOGY 


II. CLOSED SURFACES 
By SumMNER Byron Myers 


1. Introduction. This paper deals with closed 2-dimensional Riemannian 
manifolds, for brevity designated closed surfaces. The properties of a funda- 
mental locus which we call the minimum point locus with respect to a point A, 
studied in a previous paper' by the author for the case of simply-connected 
analytic surfaces, are determined here for the general class of closed surfaces. 
The locus in question is defined as the locus m of points M on geodesic rays issu- 
ing from a point A, which are the last points along these rays such that the arc 
AM furnishes an absolute minimum (proper or improper) to the length of arcs 
joining A to M. In the case of a closed analytic surface S the principal result 
is that m is a linear graph (i.e., a finite connected 1-dimensional complex) whose 
one-dimensional Betti number equals the one-dimensional Betti number mod 2 of 
the surface. A study is made of the parametrization of m by means of @, the 
angular coérdinate of the geodesic rays through A. It is found that this depends 
on the orientability of S, and also that the number of values of @ yielding one 
point of m equals the order? of that point in m. 

A brief study is also made of non-analytic surfaces. Here we assume, for 
example, that S is a closed regular manifold of* class 5 with a Riemannian line 
element of class C*. The locus m turns out to be a continuous curve (not neces- 
sarily a linear graph) with the same relation as in the analytic case between the 
one-dimensional Betti number of m and the one-dimensional Betti number of S, 
and similar relations among the orientability of S, the parametrization of m by 
means of @, and the order of points of m. 

In both analytic and non-analytic cases, if we subtract the locus m from the 
surface S, the result is a single 2-cell with m as its singular boundary, simply 
covered (except at A) by the geodesic rays through A. Thus is solved the prob- 


Received October 4, 1935; presented to the American Mathematical Society April 19, 
1935. The author is National Research Fellow. 

1 See Myers, Connections between differential geometry and topology, I. Simply connected 
surfaces, this Journal, vol. 1 (1935), pp. 376-391. This paper will be referred to as (I). 
An abstract containing the results of that paper, as well as the results of the present paper 
in the analytic case, appears in the Proc. Nat. Acad. Sci., April, 1935, under the same title. 

2 By the order of a point P of a continuous curve C we mean the number of 1-cells con- 
tained in C which issue from P and are such that no two of them have any point in common 
but P. 

3 See Veblen and Whitehead, The Foundations of Differential Geometry, p. 81. 
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lem of finding the largest domain of the geodesic polar coérdinates and normal 
coérdinates with A as pole. 

The locus in question was originally introduced by Poincaré,‘ who considered 
only closed simply connected surfaces of positive curvature. Poincaré called the 
locus “‘lignes de partage”. J. H.C. Whitehead® considers the same locus in a 
recent paper. He calls it the “cut locus’, and considers it on complete n- 
dimensional Finsler spaces. He obtains the theorem that such a space can be 
decomposed into an n-cell and the cut locus, which forms the singular boundary 
of the n-cell, but he does not study the topological nature of the locus itself. 
The results that we obtain in the present paper for 2-dimensional Riemannian 
spaces can readily be extended to 2-dimensional Finsler spaces. It is probable 
that analogous equalities between the Betti numbers of the space itself and the 
Betti numbers of the locus exist in n dimensions. However, it seems difficult 
to prove that the locus in the analytic n-dimensional case is homeomorphic to a 
finite (n — 1)-dimensional complex. 


2. The analytic case. We assume a knowledge of the definitions and 
terminology of (I). We recall that on a closed (compact) surface every pair of 
points can be joined by a geodesic of class @, i.e., a geodesic every segment of 
which furnishes an absolute minimum, proper or improper, to the length of ares 
joining its end points. Lemmas 1-11 and Theorems 1-3 of (I) hold here as well 
as in the simply connected case. 

Theorem 4 of (I) is replaced by the following 

THEOREM 1. A surface is closed if and only if there exists no geodesic ray of 
class @ through any point on it. 

For Rinow has proved* that from every point of an open surface issues a 
geodesic ray of class @. Furthermore, on a closed surface there can exist no 
infinite set of points without limit point, and on a geodesic ray of class @ such a 
set exists. 

Thus on every geodesic ray through A on S there is a minimum point with 
respect to A. It follows from Lemma 8 of (I) that the locus m of minimum 
points with respect to A is the continuous single-valued image of a circle; i.e., a 
continuous curve. 

Now if we cut off each geodesic ray from A at its minimum point with respect 
to A, the region S — m — A is simply covered by these truncated geodesic rays. 
For if two such geodesic rays intersected before m, the absolute minimum 
property would have stopped on at least one of them at or before the intersection 
with the other, as follows from Lemma 4 of (I). Hence we have 

THEOREM 2. If the minimum point locus m with respect to A is deleted from S, 
the result S — m is a 2-cell o with m as its singular boundary. a is simply covered 


4 Trans. Amer. Math. Soc., vol. 6 (1905), p. 243. 

5 Annals of Math., vol. 36 (1935), pp. 679-704. 

6 W. Rinow, Ueber Zusammenhdnge zwischen der Differentialgeometrie im Grossen und im 
Kleinen, Math. Zeitschrift, vol. 35 (1932), p. 522. 
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(except at A) by the geodesic rays issuing from A, and hence forms the largest domain 
of the geodesic polar coérdinates and normal coérdinates with A as pole. 

No set C of closed curves on m can bound on 8S. For C would separate S into 
two regions R, and R, such that no point in R, could be joined to a point in R, 
without crossing C. But A would be in either R; or Re, say R,, and could be 
joined to any point in Rz by a geodesic of class @. This geodesic would cross C, 
thus contradicting the fact that it is of class @. Let B, be the one-dimensional 
Betti number mod 2 of S. Then we have shown that the one-dimensional Betti 
number of m, i.e., the number of independent sets of closed curves in m, is at 
most B;. On the other hand, the one-dimensional Betti number of m is at least 
B,. Otherwise a non-bounding closed curve independent with respect to ho- 
mology of those of m could be drawn on S. This would contradict Theorem 2. 
Thus the one-dimensional Betti number of m is exactly equal to B,. Hence 

THEOREM 3. The minimum point locus m with respect to any point A of a 
closed surface S is a continuous curve whose one-dimensional Betti number equals 
the one-dimensional Betti number mod 2 of S. 

According to a theorem of curve theory, m is locally a tree? because its one- 
dimensional! Betti number is finite. We shall prove now that in the analytic 
case the number of end points (i.e., points of order 1) of m is finite. From 
this it will follow that m is a linear graph,’ i.e., a finite connected 1-dimensional 
complex. 

By a minimum point of order n with respect to A we mean a point of m which 
can be joined to A by just n geodesics of class @. We prove first that every 
end point P of m is a minimum point of order 1. 

Suppose A could be joined to P by two geodesics of class Q, g:: 6 = 6, and 
g2: 8 = 62. We could find a neighborhood o of P of radius 6 which would be 
divided into two 2-cells a; and o2 by these two geodesic rays. By Lemma 8 of (I) 
we can find an ¢ so small that the geodesic rays for 6; — « < 6 < 0, + € all have 
their minimum points with respect to A within the neighborhood ¢. The rays 
for 0, — ¢< 6 < 6, (if € is small enough) all remain close to g;, by Lemma 1 of (I), 
and all enter one of the regions o; or o2, say o;. The minimum points on these 
geodesic rays are all in o;, for none of these rays can cross g; or gz and remain of 
class @. Similarly, geodesic rays from A whose 6-coérdinates lie between 6 
and @, + ¢ have their minimum points in ¢2. Thus the locus m has two distinct 
continuous curves issuing from P, or the rays for 6; — « < @ < 6 + eall have 
their minimum points with respect to A at P, in which case the complete locus m 
is the single point P, as shown in (I) on p. 387. Either of these cases contradicts 
the assumption that P is an end point of m. Hence every end point of m is a 
minimum point of order 1. 

But there is only a finite number of minimum points of order 1 with respect to 


7™See Menger, Kurventheorie, p. 323. 

8 This follows from Menger, loc. cit., p. 266. Itis easily shown that if a continuous curve 
has a finite number of end points and a finite number of closed curves, it has only a finite 
number of points of order greater than 2. 
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Aon 8S. This is proved as in the simply connected case® in (1) by means of 
Lemmas 9 and 10 and Theorems 2 and 3 of (1). Hence m has only a finite num- 
ber of end points, and m is a linear graph. This linear graph is already partly 
triangulated by means of the end points and branch points, but it may be neces- 
sary to subdivide the closed curves of m in order to make m a complex in the 
technical sense. 

From Lemma 11 of (I) it follows that any are of m containing no points conju- 
gate to A and no interior points of order > 2 is a regular analytic arc. 

TuHeEoreM 4. The locus m on a closed analytic surface is a linear graph. The 
end points of m are conjugate to A, and are cusps turned toward A of the locus of 
first points conjugate to A. Anarc of m containing no points conjugate to A and no 
interior points of order > 2 is a regular analytic arc. 

In the proof of Theorem 4, we have shown that every end point of m is a 
minimum point of order 1. We now show, conversely, that every minimum 
point of order 1 is an end point, thus identifying the points of m of order 1 
with the minimum points of order 1. More generally, we wiil show that the 
order of a point of m equals its order as a minimum point with respect to A. 

Let P be an arbitrary minimum point of order 1 with respect to A on the 
geodesic ray @ = 6. Draw a geodesic circle y of radius 6 about P so small that it 
and its interior o are simply covered by the geodesic rays from P. If ¢ is chosen 
small enough, all the geodesic rays from A for 6 — « < 6 < 6+ eremain close to 
6 = 6 and by Lemma 8 have their minimum points with respect to A in o. 
Hence none of them have points conjugate to A before or when they reach 7, 
so that they form a field F in the neighborhood of 6 = 6 up to and including a 
portion of y. It is easily seen that if two geodesic rays from A in the interval 
6—«¢«< 6 < 64 e intersect again in c, they bound a 2-cell lying in F + co. 

Since the number of minimum points of order 1 is finite, there exists an interval 
6:42 containing @ but no other value of @ which furnishes a minimum point of 
order 1 with respect to A. Each value of @ except 6 in this interval furnishes a 
minimum point of order > 1, and hence to each such value of @ corresponds 
another value of 0, say 6’, which furnishes the same minimum point P. If 6 is 
close enough to 6, then P is very close to P, by Lemma 8 of (I), and from Lemma 
5 and the fact that P is of order 1 we conclude that 6’ is very close to 6. Thus if 
6 is close enough to 6, both 6 and @’ lie in 6,62 and also in the interval 6 — « < 
6<6+.. But according to the conclusion of the previous paragraph, the rays 
6 and @’ bound a 2-cell on S, and the same reasoning as that used in (1), p. 388, 
shows that the interval 06’ must include a value of 6 furnishing a minimum point 
of order 1. Hence 0; << 06<6< 0’ < Bort; < 6 << 8<6< 6. But in 
(I), pp. 388-389, it was proved that if the rays @ and 6’ bound a 2-cell containing 
just one minimum point P of order 1, that point P is an end point of m. Thus 
every minimum point of order 1 is an end point of m, and the set of minimum 
points of order 1 with respect to A is identical with the set of end points of m. 

We now prove by induction that the order of a point of m equals its order as a 


® See (I), p. 387 (top) and p. 388 (bottom). 
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minimum point with respect to A. Assuming the proposition true for all inte- 
gers n < q, we shall prove it for n = g. A point P of m of order q cannot be a 
minimum point of order w > q; for it can be shown that from the latter type of 
point issue w distinct 1-cells of the linear graph m in the same way that we 
showed that from a minimum point of order 2 issue two distinct 1-cells of m. 
Furthermore, P cannot be a minimum point of order < q, for part of our induc- 
tion hypothesis is that a minimum point of order z < q is a point of order z of m. 
Thus the order of a point of m equals its order as a minimum point with respect 
to A, and hence we have the following 

THEOREM 5. The order of a point of m equals its order as a minimum point with 
respect to A. In particular, the end points of m are identical with the minimum 
points of order 1 with respect to A. 

On the basis of this theorem, we see that if we parametrize the continuous 
curve m in terms of @, the order of a point of m must equal the number of values of 
6 furnishing that point. As @ ranges from 0 to 2m each 1-cell of m is traced out 
twice, while each 0-cell of m is covered a number of times equal to its order. 

The linear graph m is the singular image of a simple closed curve y: r = r(@) 
around the pole in the euclidean (r, @)-plane. The whole surface S can be got 
topologically by considering y and its interior with identification of certain pairs 
of 1-cellsony. But it is well known” that if an orientation is given to a polygon, 
the manifold constructed by considering the polygon and its interior with identifi- 
cation of pairs of sides of the polygon is non-orientable or orientable according as 
to whether or not the identification of at least one pair of sides of the polygon is 
made with the same orientation of the two sides concerned. Hence we have 

THEOREM 6. If S is orientable, as @ increases from 0 to 2x every 1-cell of m is 
traced out twice, once in each sense. If S is non-orientable, at least one 1-cell is 
traced out twice in the same sense. 

Thus the connectivity and orientability of the 2-dimensional manifold S, 
and hence the complete topology of S, can be determined from a knowledge of 
the 1-dimensional Betti number of m and the way in which m is traced out. 
Another way of stating this follows. The number of independent closed curves 
in m determines the number of generators in the fundamental group G of S, while 
the manner in which m is traced out determines the generating relation of G. 
Conversely, a knowledge of the topology of S determines the one-dimensional 
Betti number of m and to some extent the way in which m is traced out. 


3. The non-analytic case. We now consider briefly the case where the surface 
S satisfies certain differentiability conditions, but is not necessarily analytic. 
Suppose that S is a closed regular 2-dimensional manifold of class 5 with a line 
element of class C‘. 

Lemma 1 of (I) holds here with the change that x(r, @) and y(r, @) are no longer 
analytic, but only functions of class C*. In Theorem 1 of (I) the function f(r, 6) 
continued in the same way as in the analytic case becomes a function of r, @ of 


10 See, for example, Seifert and Threlfall, Topologie, p. 135. 
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class C? for all @ and allr > 0. This is also true of the function K(r, @). As for 
Theorems 2 and 3 of (1), the locus of first conjugate points to A is again a single 
point, a closed curve, or a set of one or more open curves, parametrizable as 
functions of class C? of 6. There may now be an infinite number of cusps of the 
locus even on a finite segment of it, but the number of curves in the locus can still 
be infinite only if a curve of the set can be found all of whose points are arbi- 
trarily far from A on the geodesics on which they are conjugate to A. Since the 
values of @ furnishing singular points of the locus form a closed set, such values 
cannot be dense on any interval J in 0 S @ S 27 unless all the values of @ in J 
furnish the same conjugate point, as seen from (3.10) of (I). Lemmas 4-9 of 
(I) hold unchanged. Lemma 10 must be modified to allow the possibility that 
M is a limit point of cusps of the locus of first conjugate points to A. In Lemma 
11, the are d now becomes a regular arc of class C*. 

Theorems 1, 2, and 3 of the present paper in no way depend on analyticity. 
Hence they hold for the more general class of surfaces. Thus the minimum point 
locus with respect to a point A is a continuous curve whose one-dimensional Betti 
number equals the connectivity number mod 2 of S. Hence it is locally a tree, 
and therefore a regular curve in the sense of curve theory." Theorem 4, how- 
ever, does not hold in the non-analytic case, for the proof of the finiteness of the 
number of end points, which has as a consequence that m is a linear graph, 
depends essentially on the analyticity of S. We can, however, give some restric- 
tions on the number of end points of m. 

In the first place, the method used in the proof of Theorem 4 can be used here 
to show that if A can be joined to P by two geodesics of class @, then either two 
distinct 1-cells of m issue from P or else a whole interval of values of @ furnish 
geodesic rays of class @ joining A to P. In the latter case we shall say that P is 
a minimum point of order J. Thus every end point P of m is either a minimum 
point of order 1 or a minimum point of order J. In the first case, P is conjugate 
to A on the unique geodesic of class (@ joining it to A, by Lemma 9 of (I), while 
in the second case P is conjugate to A on each geodesic ray of the whole closed 
interval of rays of class @ joining A to P. Furthermore, such a point P is al- 
ways a singular point of the conjugate point locus C; in fact, it is either a cusp of 
C or a limit point of cusps of C. From the reasoning used a few paragraphs back, 
the set of values of 6 furnishing end points of m cannot be everywhere dense in 
0 < @ < 2z, nor can it be dense on any subinterval unless all values of 6 on that 
subinterval furnish just one end point. 

According to our modification of Lemma 11 of (I), an are of m containing no 
point conjugate to A and no interior point of order > 2 is a regular arc of class C*. 

Theorem 5 also is not exactly true for the non-analytic case. We have already 
seen that every end point of m is either a minimum point of order 1 or of order J. 
A minimum point of order J may have any order whatever as a point of m. 
However, we can still prove that a minimum point of order 1 is an end point of 
m. In the first place, this is true for an isolated minimum point of order 1 by 


11 See Menger, loc. cit., p. 96. 
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exactly the same proof as in the analytic case. For a general minimum point of 
order 1, we can give the following proof. 

Let P be the minimum point of order 1, and suppose that P is of order > 1 as a 
point of m. Then from P issue at least two distinct arcs e; and e2 contained in 
m. If the neighborhood o of P used in the proof of Theorem 5 is small enough, 
e, and e, together divide ¢ into two parts o; and ¢2. One of these, say o;, contains 
part of the geodesic ray @ = 8 on which P is a minimum point with respect to A. 
For 6 — « < 6 < 6+ «, the minimum points with respect to A lieine. Further- 
more, since the geodesic rays from A in this interval remain close to 6 = 8, all 
these minimum points lie in o; or on e; + é2, for the rays cannot cross e; + é2 
without losing their minimum property. Thus any geodesic are from A for 
6—«< 6 < 64 e joining A to a point in cz cannot be of class @. Consider a 
sequence of points P; in o2 approaching P, and join them to A by geodesics 
@ = 0;of class@. By Lemma 6 of (i), 0; > 6. This gives a contradiction, and 
hence P cannot be of order > 1 as a point of m. Hence P is an end point of m. 

Following the same induction method used in the analytic case, we can prove 
that a point of order n (n finite) of the locus m is either a minimum point of order 
n with respect to A or a minimum point of order n — q + qJ, where q S n; in 
other words, a point of finite order n of m has as its inverse image n connected 
pieces of the @ axis. If P is a point of” order w of m, it is a minimum point of 
order M + NI, where M + N =>. P cannot be a point of order > w of m, 
for m is a regular curve. 

As for Theorem 6, by a rather complicated proof it can be shown that the 
following is true in the non-analytic case. As @ increases from 0 to 2z, if S is 
orientable, every 1-cell contained in m which has no cyclic branch" is traced out 
twice, once in each sense. If S is non-orientable, at least one 1-cell in m is 
traced out twice in the same sense. 


4. Examples. As a first example, let us consider the projective plane of 
constant positive curvature. This manifold is obtained from a sphere by identi- 
fying diametrically opposite points. Let A be any point on a sphere S of radius 
a, A’ the opposite pole. Then since A and A’ become identical on the projective 
plane p obtained from S, every geodesic through A on p is closed, and of length 
xa. The minimum point with respect to A on a geodesic ray g issuing from A on 
p is at a distance of ra/2 from A along g. Thus the minimum point locus with 
respect to A on p consists of a circle, traced twice in the same sense as @ increases 
from 0 to 2x. The only point conjugate to A on pis A itself. 

On the ordinary torus in 3-dimensional euclidean space the situation is more 
complicated.“ Let A be a point on the outer equator. The minimum point 


12 See Menger, loc. cit., p. 100. 

13 By a 1-cell contained in m we mean a subset of m homeomorphic to a 1-cell. 

14 A 1-cell e contained in m is said to have no cyclic branch if every closed curve in m 
containing any point of e contains the whole of e. 

18 For a study of the geodesics on a torus, see Bliss, Annals of Mathematics, vol. 4 (1902- 
3), pp. 1-21. 
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locus with respect to A consists of (a) the inner equator, (b) the meridian. circle 
through the point A’ diametrically opposite to A on the outer equator, and (c) 
two ares of the outer equator issuing in opposite directions from A’. Roughly 
speaking, the locus consists of two closed curves intersecting in one point plus two 
branches issuing from a point of one of the closed curves. The minimum point 
locus with respect to any point whatever of the torus contains two non-bounding, 
non-homologous closed curves; the locus with respect to any point of the inner 
equator consists entirely of two such closed curves, since the points on the inner 
equator are perfect poles (i.e., points without conjugate point). 

On any orientable surface of genus p > 0, the locus with respect to any point 
contains 2p closed curves, which form a basis for the 1-dimensional homology 
group mod 2 of S as well as a set of generators for the fundamental group of S. 
With respect to a perfect pole of such a surface, the locus consists entirely of 2p 
closed curves. Thus on a closed orientable surface of zero curvature or constant 
negative curvature, where every point is a perfect pole, the locus with respect to 
any point whatever consists entirely of 2p closed curves. This has an obvious 
relation to the well-known method of representation of closed orientable surfaces 
as polygons of 4p sides and their interiors in the euclidean (p = 1) or hyperbolic 
(p > 1) planes with identification of pairs of sides in the proper manner. 
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CONVEX POLYHEDRA AND CRITERIA FOR IRREDUCIBILITY 
By Casper SHANOK 


1. Introduction. This paper gives an application of Minkowski’s' theory of 
convex polyhedra to the construction of irreducibility criteria for polynomials 
in several variables, thus generalizing the results of Dumas? for polynomials in 
one variable obtained by the use of convex polygons. The results of Minkowski 
referred to concern the nature of the least convex polyhedron determined by the 
set of points {p} where each point p is of the form }> s;p;, each s; a constant > 0 
and each p; a point of a given polyhedron K;. As a special case, these results 
include the case of the polyhedron K determined by two given polyhedra K, 
and K, with s,; = s;=1. Thisis the case which is of interest to us for the results 
that follow. For reasons obvious later, we shall term K the product, rather than 
the sum, of K, and K, and shall denote this relation by the notation K = K,. Ko. 


2. Decomposability. If now we consider the converse problem, namely, 
given K to determine K, and Ky such that K,-K, = K, we find first of all that 
we must assume we are dealing with polyhedra whose vertices have integral 
coérdinates, for otherwise K, and K, can be chosen in an infinity of ways, i.e., 
the problem has no meaning. We likewise impose the further restriction that 
neither of the factors of K shall be a point, since this would simply amount to a 
translation of K with no accompanying change in its shape. If, under these 
conditions, it is possible to determine two polyhedra K, and K¢ (either or both 
may be lines or polygons) such that K,-K, = K, we say that K is decomposable. 
We proceed to set up necessary conditions for the decomposability of K by 
considering its projections on the coérdinate planes. 

First projecting the vertices of K on the zy-plane, we determine the least 
convex polygon containing this set of points. This polygon we name the zy- 
boundary polygon of K and denote by b.,. We have at once the result that for 
K to decompose it is necessary that each of the three boundary polygons of K 
decompose.’ For if K decomposes into K, and Ke, the product of the zy- (or 


Received by the Editors of the Annals of Mathematics July 2, 1934, accepted by them, 
and later transferred to this journal; presented to the American Mathematical Society 
April 14, 1933. The author wishes to express his gratitude to Prof. O. Ore for his assistance 
in the preparation of this paper, which is an abstract of a dissertation presented for the 
degree of Doctor of Philosophy in Yale University. 

1H. Minkowski, Gesammelte Abhandlungen, vol. 2, pp. 131-229. 

2G. Dumas, Sur quelques cas d’irréductibilité des polynémes a coefficients rationnels, 
Journal de Mathématiques Pures et Appliquées, (6), vol. 2 (1906), pp. 191-258. 

’ By extending Dumas’ result in the plane to cover the entire least convex polygon, it is 
possible to show that the decomposition of a plane polygon amounts to the division of the 
sides of the given polygon into two sets such that the sides of each set, translated, wherever 
necessary, form a closed convex polygon, each side keeping its outer normal unchanged. 


103 














104 CASPER SHANOK 


xz or yz) boundary polygons of K, and K¢ is the ry- (or xz or yz) boundary poly- 
gon of K. 

The condition stated above is, in fact, even more stringent, for not only is it 
necessary that b,,, for example, decompose, but further that to each point a 
of the decomposition set‘ in question there correspond a lattice point on the side 
or sides of K whose projection contains a. 

To obtain a stronger necessary condition, we next project the faces of the 
lower (or of the upper) part of K on the ry-plane, getting a network N of non- 
overlapping convex polygons filling up the interior of 6,,. Now assuming that 
K decomposes into the product of A, and K2 and forming the two similar net- 
works N, and N» for K; and Ke, respectively, let us determine the relation 
between these networks. In the first place, as stated above, the product of the 
xry-boundary polygons of K,; and Kg is identical with that of K. In the second 
place, it follows from Minkowski’s work® that every elementary polygon’ of N 
is either an elementary polygon of A, or Ke translated, or the product of parts of 
two elementary polygons, one of K, and the other of Ky. This leads to the 
second necessary condition for the decomposability of K, namely, that it must be 
possible to decompose N (and each of the other five similarly determined net- 
works) by decomposing each of its elementary polygons and piecing the decom- 
positions together to form two similar networks. 

In concluding our discussion of decomposability, mention should be made of 
the exceptional, though somewhat trivial, case that K has as a factor a line seg- 
ment parallel to one of the axes, say the z-axis. Due to the fact that such a 
factor has a point as its projection on the ry-plane, the fact that the xy-boundary 
polygon (or that either of the two networks bounded by the zy-boundary poly- 
gon) is indecomposable (i.e., is indecomposable except for a possible point 
factor) need not imply that K is indecomposable, but only that K is indecomposa- 
ble except for a possible line factor parallel to the z-axis. 


3. The general criterion for the case of three variables. Now applying the 
results above to the construction of criteria for irreducibility, let us first con- 
sider the case of polynomials of three variables with coefficients belonging to 
some fixed field. Letting F(z, y,z) = }> Aasyx*y®27 be such a polynomial, we 
plot the set of points {(a, 8, y)} and determine the least convex polyhedron 
containing these points. Regarding this polyhedron, which we shall term the 
polyhedron of F, we proceed to prove the following 

THEOREM 1. Let the reducible polynomial 


F(z, Y) z) = F\(2, Y; z)- F(z, Y; z). 
Then K = K,- Ko, where K, K, and Kz are the polyhedra of F, F, and F2 respectively. 


‘ By a decomposition set we mean the set of lattice points containing (1) all the vertices 
of b,, and (2) such lattice points as mark the points of division of those sides of b,, which 
are broken up in the decomposition of b,, in question. 

5 Loe. cit., p. 186. 

6 By an elementary polygon we mean the projection of a face of K. 
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Proof. Let Fi(z, y, 2) = DY, Adige ty" 27" 


i do de Pr ale ye’ 27". 


and F(x, y, 2) 
Then 


(1) F(z,y,2= > . A aiprys Aegrigerg ee LUTE yB 48" gy! ty” 
a Bey! a By” 


In the expanded product (1) let there be m terms’ containing x*y®z7. Setting S 
equal to the sum of these terms, we have 


(2) S = (bh + --- + by)rty®2? = Ags yx*y* 2”. 


We note here that S may drop out if m = 2, but not ifm = 1. 

A study of the above relations shows that proving K = K,- Kz amounts to 
proving that the sets {(a, 8, y)} and {(@’ + a’’, B’ + B”’, y’ + y’’)} determine 
the same polyhedron. From relations (1) and (2), it follows at once that every 
member of the set {(a, 8, y)} is a member of the set {(a’ + a’’, B’ + BY’, y’ + 
y’")}, ie., K is actually contained in or equal to Ki-Ky. Noting that the con- 
verse statement, that every member of the set {(a’ + a’’, B’ + B’’,y’ + y"’)} 
is a member of the set {(a, 8, y)}, is not necessarily true since b; + --- + dD» 
may equal zero if m = 2, it remains to show that every member of the set 
{(a’ + a’’, B’ + BY’, y’ + y’")} which is likewise a vertex of K,- Kz is a member 
of the set {(a, 8, y)}, thus eliminating the possibility that K is actually con- 
tained in K,-K». But this follows at once from the fact that each vertex of 
K,- Kis uniquely determined,’ i.e., if the point (a, 8, y) is a vertex of K, there is 
just one term Aj.,, yr t® y®' 27’ of Fi(z, y, 2) and just one term , ze’ yh" 20" 
of F(z, 4 y, z) such that (a’ + a’’, B’ BY, y + y"’) = (a,B,y). Since a y 
and Foe y ¥ 0 by hypothesis, A. a’s'y" -An agry 0, ie., (a’ + a’’, B’ + 8”, 
vy’ + y’’) isa member of the set {(a, 8B, y)}. This cumaletes the proof. 

From this theorem it follows immediately that for F(z, y, z) to be reducible,’ 
it is necessary that its polyhedron decompose. 


4. The general criterion for the case of two variables and one prime. We 
next proceed to obtain a criterion of a similar nature for polynomials of two 
variables with coefficients in the field of rational numbers. Letting F(z, y, z) be 
such a polynomial, we write it in the form 


F(z, y) = D0 Aas,p?z*y*, paprime, Aas, 4 0 (mod p). 


Before establishing the criterion, let us first examine the nature of the product 
of two polynomials of the above form. Letting the reducible polynomial 


F(z, y) as F,(z, y)-F.(z, y), 


7m may be equal to 1. 

8 H. Minkowski, loc. cit., p. 181. 

® Factors which are powers of z and y are excluded, as such a factor corresponds to the 
excluded case that K hasa point factor. Such a factor can, of course, always be determined 
by inspection. 











106 CASPER SHANOK 


where F(z, y) = +h Agigry! py x’ y*®, howe # 0 (mod p) and F,(z, y) = 
De Av regrrge PU ee 'y®", A Qrigryr ¥ O (mod p), then 


(1) F(z, y) — 7 Z. Bia gpa prt” geiter afte” | 
a By’ al By” 

Now in the expanded product (1) let there be m terms’ containing x*y*, and let 

px be the lowest power of p occurring in these m terms. Setting S equal to the 

sum of these terms, we have 


S= (bi p* +o + bm p*i) ey? = Agsypr*y’, Aasy ca 0 (mod p). 


Noting that if there is just one term containing p’*, y = p,, and that if there 
are two or more terms containing p’*, y = p,, we see that unless S drops out, 
S is always replaced in the contracted product by a single term in which the 
power of p= is at least as large as the lowest power of p occurring in S, i.e., the 
point (a, 8, y) coincides with or lies above the point (a, 8, px). 

With this in mind, we turn to a consideration of the least convex polyhedra 
K, K, and K, determined as before by the sets of points { (a, 8B, y)}, {(a’, 8’, y’)}, 
and {(a’’, B’’, y’’)}, respectively. From the fact that one point (a, 8, y) may 
replace several different points of the form (a@’ + a’’, B’ + B’’, 7’ + 7’’) as 
indicated above, it is quite clear that the statement K = K,-K, need not be true. 
To obviate this difficulty, we must replace these finite polyhedra by certain 
polyhedra infinite in the direction of the positive z-axis. These polyhedra, 
which we shall term newtonian polyhedra and which we shall denote by N, Ni, 
and Nz respectively, are formed by constructing prism-like figures extending 
upwards indefinitely and having the lower parts of the original polyhedra as 
bases. A set of points defining any one of these may then be got by adding to 
the vertices of its base the infinite set of lattice points lying on those of its edges 
which are parallel to the z-axis. The product of Ni and Nz is then defined as 
before, i.e., if N; and Nz are defined by the sets of points {(ay,, Bw,, yw,)} and 
{ (aw, By, Yx.)}, respectively, their product is defined by the set of points 
(aw, + awn, By, + Bvy Yx, + ¥w.)}. By extending Minkowski’s work, it then 
follows that N,-N2 as defined above is identical with the newtonian polyhedron 
formed on the base of K,-Ke. We are now ready to establish the criterion 
referred to above, by proving the following 

THEOREM 2. Let the reducible polynomial 


F(x, y) = Fi(x, y)-Fe(2, y). 


Then N = N,-No, where N, Ni and N:2 are the newtonian polyhedra of F, F; and F2, 
respectively. 

Proof. We may prove this by showing that each vertex of the base of Ni-N2 
is a point of the set { (a, 8, y)} and that the remaining points of the set { (a, 8, y)} 
lie on or above the base of Ni-N2. 

We start, therefore, by showing that if A = (a, 81, y:) is a vertex of the base 
of N-Ne, it belongs to the set {(a, 8, y)}. Since A is a vertex of the base of 
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N,-N2, it is a member of the set {(a@’ + a’’, B’ + B’’, y’ + y’’)}, ie., (1) con- 
tains a term b;p"x“y*. Further, since A is also a vertex of K,-Ko, A is de- 
termined uniquely, i.e., of the terms of (1) containing z*'y*', only one contains 
p”'. Moreover, of the powers of p contained in these terms, 7; is the smallest, 
since A belongs to the base of Ni-N2. Now applying the results obtained above, 
we see that A must be a member of the set {(@, 8,y)}. This completes the first 
part of the proof. 

Turning now to the second part of the proof, we saw above that each point 
(a, 8, y) coincides with or lies above the point (a, 8, px), where (a, 8, px) is a 
member of the set {(a’ + a’’, B’ + B”, y’ + y’’)}, ie, is a point of Ki-Ke 
and hence lies on or above the base of Ni-N»2. Therefore, a fortiori, the point 
(a, 8, y) lies on or above the base of Ni-N». This completes the proof. 

A direct consequence of this theorem is the necessary condition we sought to 
establish, namely, that for F(z, y) to be reducible (excluding factors which are 
powers of x and y as before) it is necessary—but not sufficient—that its newto- 
nian polyhedron decompose." 


5. Special criteria. We next proceed to combine the general criteria 
obtained in §§3 and 4 with the necessary conditions for the decomposability of 
polyhedra obtained in §2 to get certain special classes of algebraic polynomials 
concerning whose factorability we can formulate definite conclusions. 

TuHeoreM 3. Let 


f(x,y) = a pra y® + ay pte y* + Dla: ps am ys (i # 1,2), 


where the line joining the points A = (a, Bi, r) and B = (as, Be, 8) has no lattice 
points, where, further, the points (a;, B;) lie on or within a triangle having the line 
joining the points a = (a, B:) and b = (ae, Be) as a side, and finally, where for all i 
such that (a;, B:) lies on the line ab, i.e., such that (a; — a)/(8i — Bi) = 
(a2 — a) / (Be aes Bi), 





werti+| oe (a: — a) |. 
ae—- Q 
Then, if f has no factor which is a power of x and y, f is irreducible. 

Proof. In this case b,, has as one side the line ab. Moreover, the remaining 
vertex or vertices lie on or within a triangle with ab as a side." Now by hy- 
pothesis, a is the projection of a point A of P and b of a point Bof P. From the 
condition on the y’s, it then follows that no points lie directly below AB, i.e., AB 


10 As before, to decompose a newtonian polyhedron means to find two newtonian 
polyhedra whose product is identical with the original polyhedron. Further, in order that 
a newtonian polyhedron decompose, it is necessary that its ry-boundary polygon decom- 
pose and that the network formed by projecting the lower part of the newtonian polyhedron 
on the zy-plane decompose. 

11 The reader is asked to construct the figures. 
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is a side of the lower part of P. Moreover, from the condition that AB has no 
lattice points, it follows that ab is indecomposable. Hence b., is indecompos- 
able, and the theorem follows. 

In connection with this theorem, it is interesting to note that in its present 
form it includes as a special case a certain theorem of Glenn" for the case of three 
homogeneous variables. Glenn’s theorem" applies to a homogeneous poly- 
nomial in n variables, and it is easily seen that even our general theorem may 
be extended to this case. It should be noted, however, that the specialization 
of our theorem to Glenn’s case gives a more precise result, since Glenn’s theorem 
contains an extraneous condition." 

TuHeoreM 4. Let 


f(z, y) = a p" am y” +. ae p* gmta yrth + a3 p* gmti: yr tis + a a; pr zai yi 


(t1, te = 0) ’ 


where each point (a;, B;) lies on or within the parallelogram formed by the lines 
joining the points a = (m,n), and d = (m + % + %2, nm + ji + Je) to the points 
b = (m + ti, n + ji), and c = (m + tn, n + je). Then if (r — 8, 1, fi) = 
(t — r, ts, j2) = 1, af, further, for all i such that the point (a;, B;) lies on the line ab 





werti+ [2S wm], 


t 


and finally, if for all i such that the point («;, B;) lies on the line ac 
vi ee¢i14 [4S @—m], 
2 


1220. E. Glenn, Theorems on reducible quantics, Annals of Mathematics, (2), vol. 14 
(1912-1913), p. 30. 

13 To state his theorem, Glenn first defines normal order as follows: Two sets of p numbers 
(ki, kz, +++ , kp), and (Ax, Ax, «++ , Ap) Occur in normal order if the set first to show, when 
read from right to left, a number greater than the number in the corresponding position in 
the other set occurs farthest to the right. Then assuming that the terms are arranged so 
that the subscripts of the coefficients are in normal order, Glenn states the following 
theorem: A set of necessary conditions that a form f all of whose coefficients within the 
interval 


p—p ul p—p—v—l ute 
ee a ee ae 
\°O---O mtn 0---0? » CD. Omtn OO 


are divisible by a prime gq be reducible in the absolute field is given by 


a we 
Spee =O(modq) (¢=yn,H+1,---,u+¥). 


(The starred number is not included in the interval but only shows the upper limit of the 


interval.) 
14 The extraneous condition referred to is the condition that the coefficients of the terms 
combining the letters 21, 22, +--+ , Zp—y—»-1 With the letters Zp_yo41, Zp—p_v4zy *** » Tp—usi be 


divisible by g, a condition which follows from the fact that these terms are included in 
the interval J by the definition of normal order. One way of showing that this condition 
is extraneous is by noting that these terms do not affect the indecomposable side of bz,, 
and hence f is irreducible even if these terms are not divisible by q. 
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then, factors which are powers of x and y being disregarded, either f is a product of 
the form 


f(z, y) = ( 2, ts” y's) ( D as x ris) 
a=0 B=0 

(only such values of a and B being taken which make aj;/i; and Bj2/iz integral) 

or f is irreducible. (f may be a product of the indicated form only if f has a term 

containing x™ tits y™ it), 

Proof. Under these conditions b,, has as two sides the lines ab and ac. 
Furthermore, the remaining vertex or vertices lie on or within the parallelogram 
abdc. But ab and ac, each being the projection of an indecomposable side of 
the lower part of P, are likewise both indecomposable. Now let us note that a 
parallelogram with two adjacent sides indecomposable decomposes uniquely 
into the product of two line segments, one equal to one of these sides, and the 
other equal to the other side, and that any other convex polygon having two 
adjacent sides in common with this parallelogram, and its remaining vertices 
on the sides or in the interior of the parallelogram, must be indecomposable. 
It then follows that if b., has a vertex at (m + % + 72, n + ji + je), bey decom- 
poses uniquely into the product of the two line segments joining (0, 0) to (%, j:) 
and to (%2, j2); otherwise b,, is indecomposable. In the second alternative, of 
course, f is irreducible. In the first alternative, the factors of P corresponding to 
the line segments joining (0, 0) to (7, j:) and to (72, je) are polygons situated 
entirely in the planes y = jiz/t; and y = jor/te, respectively, i.e., f may have 


‘1 t2 
factors of the form >> ag 2% y®/* and >> ag x8 ySis!*, Then since the decom- 
a=0 s=0 


position of b,, is unique, factors which are powers of x and y being disregarded, 
either f is a product of the form 


f(x, y) = (= Ae x* y's) (> a3 7 y's) ’ 


or f is irreducible. This completes the theorem. 
TuHeoreM 5. Let 


S(x,y) = ar pce ys + ag pram ys + Dla py am ys = (i # 1,2), 


where the line joining A = (a, fi, r) to B = (ae, Bs, 8) has no lattice points, where 
the points a = (a, Bi) and b = (ae, Be) lie on b.,, where the vertices of bz, on each 
side of ab lie on or within a triangle with ab as a side, where further the line AB is 
an edge of P, and where finally for all i such that (a;, 8;) lies on the line ab 





vi zr+i+| — (a — a) | 
ae- GQ) 
Then, if f has no factor which is a power of x and y, f is irreducible. 
Proof. In this case, from the condition on the y’s, it follows that there are 
no points of P directly below the line AB, i.e., AB is a side or a diagonal of a face 
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of the lower part of P. But, by hypothesis, AB is an edge, i.e., its projection ab 
appears in the network formed by projecting the lower part of P. Moreover, 
since AB is indecomposable, ab is likewise indecomposable. Now let us note 
that if a network satisfies the conditions (1) an indecomposable side ab of an 
elementary polygon is a diagonal of its boundary polygon b, and (2) on each 
side of ab the remaining vertices lie on or within a triangle with ab as side, the 
network must be indecomposable. It then follows that the network formed 
by projecting the lower part of P is indecomposable. Hence it follows that, if 
f has no factor which is a power of x and y, f is irreducible. 

We note here that Theorems 3, 4, and 5 are equally valid (with one exception 
noted below) if the fixed prime p is replaced by the variable z. And, what is 
more, they are equally valid (with this same exception) if the conditions de- 
signed to make certain lines part of the lower part of the polyhedron are replaced 
by similar conditions designed to make these lines part of the upper part of 
the polyhedron and p is replaced by z. Thus Theorem 4, for example, is valid 
(with this same exception) if, in either or both conditions on the y’s, the sign = 
is replaced by the sign <, when p is replaced by z. The exception mentioned 
is that f(x, y, z) may have a factor of the form a; + agz + --- + a,2". This 
possibility arises from the fact that the indecomposability of b., does not pre- 
clude the possibility of a point factor of b.,. But such a point factor of b., 
may be the projection of a line factor of the polyhedron, parallel to the z-axis, 
and may thus give rise to a factor of f(z, y, z) of the form indicated. However, 
the existence or non-existence of such a factor can always be determined by 
elementary methods without resort to polyhedra. 


6. The case of one variable and two primes. We conclude this paper 
by a few remarks on the treatment by Fujiwara" of a problem of a similar nature, 
namely, the application of the polyhedra method to polynomials in one variable 
written in the form f(z) = >> Aas, p* g® x7, p and q primes and Ags, # 0 
(mod p,q). After defining K’(f) to be that part of the surface of the poly- 
hedron of f(x) for which the direction of the inner normals lies within the domain 
(x = 0, y = 0) and assuming that A(x) = f(x)-g(x), Fujiwara states the fol- 
lowing result: If K’(f) and K’(g) have no parallel faces, then K’(h) contains 
every face of K’(f) and K’(g), and only these faces, unchanged as to shape and 
direction and only changed as to position; and furthermore, that if K’(f) and 
K’'(g) have a pair of parallel faces 7(f) and z(g), then K’(h) has a face r(h) 
parallel to these faces and the boundary of r(h) is the product polygon of the 
boundaries of r(f) and x(g). By the application of these results to polynomials 
f(x) so constructed that K’(f) shall consist only of a single triangle, Fujiwara 


16M. Fujiwara, Uber Kriterien fiir Irreduzibilitat ganzzahliger algebraischer Gleichungen, 
Tohoku Mathematical Journal, vol. 17 (1920), pp. 10-17. 
16 Loc. cit., p. 14. 
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then derives the result’ that 
J (x) = aor’ + ay pq? xt + a2 pg’ x* + as pig? x? + ay pg x + as p*g', 
where all the a’s ¥ 0 (mod p, q), is irreducible. 

However, in view of the fact that the congruence properties of p and q are 
entirely independent, it appeared to us A priori unlikely that results of so general 
a nature could be true. Indeed, by taking f(r) = aor? + ayp*gz® + arp*x? + 
asp*q* and g(x) = bor + bip*gx* + beq*x*? + bsp*g?, where the a’s and b’s are all 4 0 
(mod p, gq), and assuming that abo + acb; # 0 (mod p, gq), we find that K’(h) 
does not contain the faces of K’(f) or of K’(g), in contradiction to Fujiwara’s 
general result above. In fact, this example illustrates the more or less complete 
breakdown of the polyhedra method in this case, for not one of the faces of the 


polyhedron of f(x) or of g(x) appears in the polyhedron of h(x). Furthermore, 
if we now take 


f(x) = 40,755,50425 — 26-5-11** — 696-5-117%2* — 54.1152? — 5®. 11x — 
3, 261-58-115, 
we find that f(z) is reducible into the product of f’(z) and f’’(x), where 
S'(x) = 40,755,50425 — 1,630,346-5*- 112? + 81,514-5°.1122 — 3,261.57-11° 
and 
f"@) =245-llr+5-1P, 


in contradiction to Fujiwara’s special criterion above. 

In conclusion, we wish to state that whether or not the above results can be 
supplemented by results of a positive nature for this case remains to be de- 
termined. However, the above considerations seem to indicate that there is 
little, if any, possibility of extending the polyhedra method to this case. 


YALe UNIVERSITY. 


17 Loc. cit., p. 17. 














FOURIER SERIES CONVERGENCE CRITERIA, AS APPLIED TO 
CONTINUOUS FUNCTIONS 


By J. A. CLarKson AND W. C. RANDELS 


Whether or not there exists a continuous function whose Fourier series di- 
verges everywhere, or almost everywhere, or on a set of points of positive 
measure, remains an unsolved problem. If a local condition is known which is 
sufficient to insure convergence of the Fourier series at a point, one is naturally 
led to raise the same question about the local condition itself: do there exist con- 
tinuous functions which violate it at every point? For the criterion of Jordan, 
for example, the answer is clearly yes; for the more recent and more delicate 
criteria the question presents greater difficulty. Mazurkiewicz' and Kaczmarz* 
have shown that the answer is also affirmative in the case of the Dini criterion. 
It is the purpose of this note to answer this question for several more general 
convergence criteria. 

Given any continuous function f(z), which is periodic with period 27, we 
define 


of; 2; = ot) =f@+) +f@— db — Fe) 


ait = > (- » (Hot +0), 


j=0 


and 


We first consider the condition 


(Ly) lim / : | A‘ g(t) | dt = 0 
sso Js f 

for a fixed integer k; any condition L, insures convergence of the Fourier series, 
and the conditions are increasingly general; that is, L; implies Li,1. Ly, is the 
familiar Lebesgue criterion. 

Let C be the space of continuous functions, periodic (27), with the customary 
norm. We first prove 

THEOREM 1. For any positive integer k, the subset A C C of functions such that 
for each x we have 
lim | *| abe |dt=4« 

3 


6—+0 


is of the second category in C, and its complement is of the first category. 


Received January 8, 1935 by the Editors of the Annals of Mathematics, accepted by them 
and later transferred to this journal. The second named author is Sterling Research 
Fellow at Yale University. 

1 Studia Mathematica, vol. 3 (1931), p. 114. 

2 Ibid., p. 189. 
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We shall show that Theorem 1 is a consequence of the following theorem of 
Banach 3° 

THEOREM A. [Banach]. Let the operation U(f, x, 5), which makes correspond 
to every element f of a Banach space B, every number x (—x S x < 1) and every 
number 6 (0 <6 S 1) areal non-negative number U, satisfy the following conditions: 

(i) For 6 fixed, U(f, x, 5) is continuous in f and x. 

(ii) UC, z, 6) sa U(-f, zt, 6) ’ 

U(f + 9, x, 6) S U(f, x, 5) + UG, z, 6) . 

(iii) There exists an everywhere dense set H C B such that for every weH, 
U (w, x, 6) ts a bounded function of x and 6. 

(iv) Given r, M > 0, there exists an element geB, || g || < r, with 

sup U(g, z,5) > M(—-r S 2 <7). 
é 


The set D, of elements f in B such that for all x 
lim U(f, 2, 6) =+, 


6-—+0 


is of the second category in B, and a complement of a set of the first category. 
Let C be the Banach space in question; clearly the operation 


U2 = [abe la 


satisfies conditions (i) and (ii) of Theorem A, and condition (iii) is satisfied by 
taking for H the set of trigonometric polynomials. Since U(ef, x, 6) = | c| 
U(f, x, 6), condition (iv) will be satisfied if there exists in C a sequence {g,(zx)} 
with || g, || S 1, and lim [inf sup U (gn, 2, )] = + «. We proceed to show 


that g,(z) = | sin nz | is such a sequence. 
By virtue of the periodicity (x/n) in x of U(gn, x, 6), we need only establish 

that 

lim [ inf sup U(g,,2,6)]=+°. 

no LOSzSr/n 6 
Let 2» be that solution of 2sin z — +/2 cos x = 0 which lies in the interval 
(0, r/4). Let I} ( = 1,2,--- ,8;n = 1, 2,3, --- ) bethe intervals (0, 2/n), 
(xo/n, /4n), [x /4n, (x — 2x9)/2n], [(x — 2xo)/2n, r/2n], ete. We shall show 
that corresponding to each »v there are three constants a,, 6,, c, (0 S a, S 7/4, 
0 < 6, S r/2,c¢, > 0) such that‘ forz eI*, 


| A5 in en(a,/n) | > ¢, (n = 1, 2, 3, --- ). 


If we assume this to be true, since 5 {Az n Gn(t)} = O(n) uniformly in z and 
t, it will follow that there will exist an interval (a/, a) containing a,, of length 


3 Banach, Uber die Bairesche Kategorie gewisser Functionenmengen, Studia Mathematica, 
vol. 3 (1931), pp. 174-179. The changes in statement are non-essential. 
‘In the following we employ the notation ¢(g,, z, t) = ¢n(t). 
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h,, such that for a{/n < t < a//n and rel” we have [Asin ¢n(t) | > c,/2 
(n = 1, 2,3,---). There is such an interval within each interval 


[mx/n, (m + 1)x/n] (m = 1,2,---,n—1), 
whence we have, for all x eJ*, 


n-1 


” 1 
bas tab, ne ld z >) [ 
. m=1 ™ 


U(gn, x, 6,/n) = | 


(m+1)4/n 


1 ?, 
7 | AS, in en(l) | at 


x/n 


n—-1 n—-1l 


Cy n h, eh, 


=L12 (m+ir'n aeest 





Thus, uniformly for z «I? , U(gn, x, 6,/n) + © as n — «, and the desired prop- 
erty of the sequence will be established. It merely remains, then, to show that 
the constants a,, 4,, c, can be chosen as stated. 

For x eI}, we choose a; = 0, 6; = 7/2. Then 


k 


At /2_ ¢a(0) = >) (— 1) (‘) eal jx/2n) 


(2 (3 
-> be a — > (0; 1) e0(n/2n) 


= ; [en(0) — ¢n(w/2n)] > (‘) = (cos nz — sin nz) > (‘) : 


7=0 
This clearly exceeds some c¢; > 0 for x eI}. 
For z eI} we take az = 52 = 7/4. Then we have 


k 


At jan on(w/4n) = >) (— 1) (') ” (F + x) 


Li] ew 
o 2 *) on(r/4n) — 2 Fis :) ¢n(/2n) 
(F] wed 
+2 (4; 4 2) eG /40) - 2, (4543) 
[3] (F) 


= (+/2 cos nz — 2 sin nz) vt (*) — 2(cos nx — sin nz) 2 i ) 


7=0 
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(F) 
” . k 
< (\/2 — 2) cos nz 2, Bea 1): 
This again is bounded away from zero uniformly for z e/>. 

The remaining six cases are handled in similar fashion. We omit the compu- 
tation, but supply the following table showing, for each value of v, the values 
of a, and 6, and the value of | Aj, yn n(@,/n) | or a function which is less than 
the latter in absolute value as in the second case above. 








Interval | a, é, | ai 7 ¢n(a,/n) | 
 (/k 
0, xo/n 0 x/2 | = (cos nx — sin nx) ( ' 
: 2 ; 





Xo/n, r/4n 1/4 1/4 > (2 — V/2) cosnzx 2; ( A ) 





m/4n, (r — 2x9)/2n r/4 | r/4 | = (2 — V2) sin nz b> Pha :) 





(x — 2x)/2n, r/2n 0 r/2 | = (sin nz — cos nz) » (‘) 








a/2n, (x + 220)/2n 0 r/2 | = (sin nz + cos nz) p> (‘) 
7=0 
ers 
(x + 22x0)/2n, 34/4n r/4 | 2/4 | => (2 — V2) sin nz 4j + :) 

















x 
= k 
32 /4n, (x — 2o)/n r/4 | 2/4 | = (V2 — 2) cos nz ) 
/ 2 4j+1 
~\ (k 
(x — 2o)/n, r/n 0 r/2 | = — (sin nz + cos nz) pF (‘) 
7=0 





The sequence {g,(x)} has, then, the required property; Theorem A may be 
applied, and our result follows at once. 
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Gergen has given a complete analysis of the various known criteria for con- 
vergence of a Fourier series, and suggested certain generalizations. In par- 
ticular he shows that a sufficient condition is given by 


(Gi) lim im f 1) At g(t) |dt = 0. 
t+0 30 Jeol 
As in the case of the L;, G; implies G.41, and G, is implied by all of the previously 
known criteria mentioned by Gergen. Our theorem allows us to infer at once 
THEOREM 2. For any positive integer k, the set D C C of functions f such that 
for each x we have 


lim im | 1) Ak oft) | dt = + x 
gto so Jes l 


is of the second category in C, and its complement is a set of the first category. 
Proof. By virtue of Theorem 1, it will be sufficient to show that at a fixed 
point x the condition 


(1) fim [" 1) abe |at= + = 
s+o Ja ft 
implies 
lim Jim [Flatow |a = +o. 
E—+00 6—-+0 J 


Assume (1) to be true; then for any fixed — > 1 we have 
™ 1 k “ 1 k ” 1 k 
— | Aj g(t) | dt — ~ | A; g(t) | dt = —| A; g(t) | dt, 
sil ral s lt 
which, as | Aj g(t) | is less than some bound B, is 


£6 
<B | © = Bloge. 
6 


Thus for any fixed £, lim | ; | A5 g(t) | dt = + «©, and Theorem 2 follows. 
£8 


+0 
It may also be noted that the total additivity of the set property of being of the 
first category allows us to combine the above results and state finally 
THEOREM 3. The set E C C of functions which at no point x satisfy any of the 
conditions G, or Ly (k = 1, 2, --- ) is of the second category in C, and its comple- 
ment is of the first category. 


INSTITUTE FOR ADVANCED Stupy AND YALE UNIVERSITY. 


5 Gergen, Convergence and summability criteria for Fourier series, Quarterly Journal of 
Mathematics, (Oxford Series), vol. 1 (1930), p. 252. 














ON LOCAL BETTI NUMBERS 
By H. E. Vauauan, Jr. 


1. Introduction. Several types of local Betti numbers have been introduced 
recently by Alexandroff' and by Cech.2 The local invariants introduced in this 
paper were discovered during an attempt to define edge and kernel points of a 
compact metric space. Incidentally, they give a direct generalization of the 
notion of the order, at a point, of a 1-dimensional set.’ 

Section 2 consists of a list of theorems, a knowledge of which is necessary in 
the later sections. In §3 the numbers 8‘(a, M), i = 0, are defined for each point 
a of a compact metric space M, and this definition is illustrated in §4 by exam- 
ples. In §5 are given several definitions of edge and kernel points which lead 
to simple necessary conditions that a compact metric space be imbeddable in 
the compact euclidean space of the same dimension. §6 is devoted to the deter- 
mination of the Borel class of the set of all points of M for which the numbers 
B*(a, M) satisfy certain inequalities. 

In §7 the numbers 8‘(a, M) are related to the local connectedness of the set 
M, and also to that of its complement when M is considered as a subset of a 
euclidean space. In order to extend these theorems, certain auxiliary theorems 
on the addition of irreducible membranes are required, and these are given in §8. 
Their immediate consequences are then developed in §9. 

There exist in the literature numerous characterizations of the plane, the 
closed 2-cell and 2-manifold. The majority of these are purely set-theoretic, 
excepting certain definitions of Whitney and van Kampen‘ which make use of 
mixed methods. We give below, in §10, a characterization of the 2-manifold 
in terms of the numbers 8‘(a, M). In §11 it is shown that a similar character- 
ization can be given for the closed 2-cell and, in fact, for any 2-dimensional set 
obtained from a 2-manifold by the omission of a finite number of open 2-cells. 
In §12 necessary and sufficient conditions are given that every point of a locally 
compact metric space have a neighborhood homeomorphic with a 2-cell, and 
these are applied to give characterizations of the open 2-cell (or euclidean plane) 
and of the class of cylinder-trees.5 The characterizations mentioned in this 
paragraph are of a purely combinatorial nature. 


Received June 17, 1935; presented to the American Mathematical Society, April 19, 1935. 

1 On local properties of closed sets, Annals of Mathematics, vol. 36 (1935), pp. 1-35. 

2 Sur les nombres de Betti locaux, Annals of Mathematics, vol. 35 (1934), pp. 678-701. 

3 Menger, Kurventheorie, p. 96. 

‘van Kampen, On some characterizations of 2-dimensional manifolds, this journal, vol. 1 
(1935), p. 87. 

5 Zippin, On continuous curves and the Jordan curve theorem, American Journal of Mathe- 
matics, vol. 52 (1930), pp. 331-350. 
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In §13 some properties of Alexandroff’s local Betti numbers are proved and 
an inequality is shown to exist between these and the local invariants of the 
present paper. 

Section 14 contains a list of unsolved problems. 

I wish to take this opportunity to express my indebtedness to Professor R. L. 
Wilder who has supervised this investigation and who has aided me constantly 
by giving many valuable suggestions. 


2. Theorems used in the succeeding sections. In this paper R* will always 
denote the compact n-dimensional euclidean space. 

Derinition. If A is any metric set and S is any system of open subsets of A, 
the S-regular part of A is the set of all points of A each of which is contained in 
arbitrarily small sets of the system S. The S-irregular part of A is the comple- 
ment with respect to A of the S-regular part. 

TuHeorEM A. For every metric set A and every system S of open subsets of 
A, the S-regular part of A is a G; in A, the S-irregular part of A an F, in A.® 

TuHeorEeM B. If M is a compact metric space and p‘(M) is finite, there exists 
an 7 > 0 such that every complete 7-cycle of diameter < » bounds on M. If 
M is a compact metric space which is locally j-connected, 0 < j S 7, then p‘(M) 
is finite.’ 

TueoreMm C. Let C be the sum and C*" the intersection of two closed sets 
of points, A and B,in R*. Then every k-cycle L‘, k < n — 1, of R* — C which 
bounds a chain L‘*' of R" — A and a chain L‘* of R= — B must also bound in 
R" — C provided the chains L‘*' and L‘*' may be so chosen that L4*! + Li* 
bounds in Rt — C*". This is true even for k = n — 1, unless C* is vacuous.’ 

TueoreM D. Let F’, F’’ be two closed subsets of R™ such that F’F”’ carries 
a complete r-cycle which fails to bound on F’F” but which bounds on F’ and 
on F’’. There exists an (m — r — 2)-cycle in R™ — (F’ + F’’) which bounds 
in R” — F’ and in R™ — F”, but not in R™ — (F’ + F”)! 

TuHeoreM E. Let F’, F” be two closed subsets of R™, and y"-"~ a cycle in 
R™ — (F’ + F”) which bounds in R™ — F’ and in R" — F” but not in R™ — 
(F’ + F”). Then F’F” carries a complete r-cycle which bounds on F’ and on 
F”’ but not on F’F’’8 

TuHeorem F. Let M be a compact metric space which is the irreducible car- 
rier of an essential complete m-cycle and K a closed subset of M such that 
p™-(K) =k. Then M — K has at most k + 1 components.” 

TueoreM G. A locally 0-connected compact metric space is homeomorphic 


® Menger, Kurventheorie, p. 103. 

7 Wilder, On locally connected spaces, this journal, vol. 1 (1935), pp. 543-555. 

8 Alexander, A proof and extension of the Jordan-Brouwer separation theorem, Trans- 
actions of the American Mathematical Society, vol. 23 (1922), p. 342. 

® Alexandroff, Untersuchungen wiber Gestalt und Lage abgeschlossener Mengen beliebiger 
Dimension, Annals of Mathematics, vol. 30 (1928), p. 178. 

10 Wilder, Domains and their boundaries in E,, Mathematische Annalen, vol. 109 (1933), 
p. 281. 
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with a 2-dimensional manifold if it contains irreducibly a 2-cycle and is sepa- 
rated by each simple closed curve of diameter less than 6 > 0.‘ 

TuHeorEeM H. A necessary and sufficient condition that a locally 0-connected, 
locally compact metric continuum be a cylinder-tree is that it be cut by every 
simple closed curve but by no are." 

THeorEeM I. A necessary and sufficient condition that a locally 0-connected, 
locally compact metric continuum S be a cylinder-tree is that it be cyclically 
connected and, if K is any simple closed curve of S, every point of K is a limit 
point of S — K and S — K is the sum of precisely two components.” 

THeEorEM J. A necessary and sufficient condition that a locally 0-connected, 
locally compact cyclically connected metric continuum be homeomorphic with 
a subset of a spherical surface is that it do not contain a primitive skew curve. 


3. Definition of 8‘(a, M). Let M be a compact metric space, a a point of 
M, and k = dim, M. There exist arbitrarily small neighborhoods of a whose 
boundaries are (k — 1)-dimensional compact metric spaces. For every non- 
negative integer 7 and real number ¢ > 0, let 8‘ (a, M) be the smallest integer b 
such that there exists a neighborhood G of a such that 6(@) < «, dim(G — G) = 
k — 1, and“ p(G@ — G) = b. Then Bi(a, M) is defined for every « > 0 and, 
as ¢ approaches zero, is a monotone, non-decreasing function. Consequently it 
approaches a limit, which is a non-negative integer or ~. In case the limit is 
finite it is denoted by B‘(a, M). If the limit is infinite, two cases arise: (1) 
Bi(a, M) is finite for all « > 0, in which case B‘(a, M) = w; (2) for sufficiently 
small values of ¢, 8i(a, M) = «, in which case B‘(a, M) = &). In the definition 
of 6°(a, M) a 0-cycle is defined as an even number of points. The coefficient 
domain is, of course, arbitrary, but in the present paper it will always be as- 
sumed finite, i.e., mod m = 2, for reasons of convergence. 

Remarks. I. From the fundamental properties of complete i-cycles it follows 
that Bi(a, M) = Ofori > k — 1. 

II. It is immediately evident from the definition that B*(a, M) is a local 
topological invariant of M and, in particular, is independent of any space in 
which M may be considered to be imbedded. 

III. Although the hypothesis that the 8‘(a, M) have definite values makes it 
possible to choose, for each value of 7, an arbitrarily small neighborhood G satis- 
fying the conditions dim(G — G) = k — 1, p‘(G — G) = B*(a, M), if this number 
is finite, or p‘(G — G) finite if B‘(a, M) = «, it is not in general possible, as can 
be shown by examples, to choose a single neighborhood G which satisfies these 


1 Zippin, loc. cit., p. 341. 

12 Zippin, loc. cit., p. 348. 

13 Claytor, Topological immersion of peanian continua in a spherical surface, Annals of 
Mathematics, vol. 35 (1934), p. 832, and Zippin, On semi-compact spaces, American Journal 
of Mathematics, vol. 57 (1935), p. 339. 

4 Vietoris, Uber den héheren Zusammenhang kompakter Raume und eine Klasse von 
z enhangstreuen Abbildungen, Mathematische Annalen, vol. 97 (1926), pp. 454-472. 
When we speak of complete cycles we refer to the Vietoris ‘‘“Fundamentalfolgen’”’. 
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conditions for all values of 7. Of course, by a proper modification of the def- 
inition, this could be done, but as yet it has not appeared desirable to add this 
extra complication. 

IV. If the set M is considered as lying in a second space, R*, for instance, the 
definition of B‘(a, M) can be given in terms of neighborhoods of a in this space. 
It has not, however, been shown, and indeed seems unlikely, that similar topo- 
logical invariants can be defined in terms of any particular class of neighborhoods 
in the imbedding space. This is due to the impossibility, in general, of extend- 
ing a topological transformation. It can be easily shown that definition in 
terms of spherical neighborhoods would not give topological invariants. In this 
connection it is to be noted that while the local invariants introduced by Alexan- 
droff are defined in terms of spherical neighborhoods, a double limit process is 
required. 

4. Examples. 1. For any point a of a compact metric space such that 
dim,.M = 1, we have 6*(a, M) = ord,M — 1, if the order is finite, B°(a, M) = 
ord, M, if the order is w or No, and B°(a, M) = Nb, if the order is No or c. 
Bi(a, M)= 0, if i > 0. 

2. For any point a of an n-dimensional (combinatorial) manifold, or interior 
point of an n-cell, B‘(a, M) = 0, (0 Si <n — 1), andp""(a, M) = 1. Forany 
boundary point a of a (closed) n-cell B‘(a, M) = 0,7 = 0. 

3. Let M be a set consisting of n 2-cells with a common edge. If a is a point 
of this edge, then 6°(a, M) = 0, B'\(a, M) = 0 or n — 1 according as a is or is not 
an end point, and 8‘(a, M) = 0, if i > 1. 

4. Let M be a set consisting of n 2-cells having only an interior point a in 
common. Then #*(a, M) = n — 1, (a, M) = nand Bi(a, M) = 0, if i > 1. 


5. Definition of edge point and kernel point. The usual definitions of 
boundary point and interior point of a set M imbedded in R” are as follows. 
The point a is a boundary point of M if every sphere with center at a contains 
a point of M anda point not of M. The point a is an interior point of M if there 
exists a sphere with center at a which is entirely contained in M. These def- 
initions are taken to define edge points and kernel points respectively in the case 
of an n-dimensional closed subset of R", and the purpose of this section is to give 
them an invariant formulation in terms of the local Betti numbers. 

First, if M is an n-dimensional closed subset of R", n # 0, and B""'(a, M) = 0, 
then a is a boundary point of M. For, since M is not 0-dimensional, a is a limit 
point of M, while inside any sphere with center at a there is a neighborhood (in 
R") whose boundary intersects M in a set whose (n — 1)-th Betti number is zero 
and hence is a proper subset of this boundary. Consequently, interior to the 
sphere there is a point of this boundary not belonging to M. This proves the 
statement. 

Conversely, if a is a boundary point of M, 8"-"'(a, M) = 0. For there exists an 
arbitrarily small sphere having a as center whose boundary is not contained in M. 

These remarks lead to the following definitions, which will be further justified 
later. 
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DerriniTion. The point a is called a k-edge point of the compact metric space 
M if dim,M = k and p*"(a, M) = 0. 

Derinition. The point a is called a k-kernel point of the compact metric 
space M if dim,.M = k and 8*"(a, M) > 0. 

DeriniTion. The point a is called an ordinary k-kernel point of the compact 
metric space M if dim,M = k and p*"(a, M) = 1. 

Derinition. The point a is called a regular k-kernel point of the compact 
metric space M if dim, M = k and B‘(a, M) = 0,0 Si< k—1, 6*"(a, M) = 1. 

The preceding discussion shows that a necessary condition that it be possible 
to imbed a compact metric space M in R" is that 8"-\(a, M) < 1 for every point 
a of M, while for every point such that 8""(a, M) = 1 it is necessary that 
Bi(a,M) = 0,0 Si<n-—1. This may be restated in the following 

THEOREM 1. A necessary condition that a compact n-dimensional metric space 
M be imbeddable in R” is that every point a satisfying dim,M = n be either an 
n-edge point or a regular n-kernel point. 

The above condition is naturally not sufficient. In fact, it is not even suf- 
ficient for “local imbeddability”’, as may be shown by the example of a sphere 
with infinitely many “handles”. In this case a point a exists such that no neigh- 
borhood of a can be imbedded in R?. Moreover, it is possible to construct a 
2-dimensional set, every point of which is a regular 2-kernel point but which 
contains no open subset which can be imbedded in R?. 

Several other definitions of kernel points and edge points have been given, 
including two by Alexandroff."" By a result of his it follows that every n- 
dimensional set contains an n-kernel point as defined above. 


6. Closure properties of certain sets. It is possible to apply Theorem A 
to the solution of this problem. To do so, let S, be the class of all open subsets 
G of M such that dim(G — G) < k — 1, let S2 be the class of all open subsets G 
of M such that dim(G — G) < k — 1, p'(G — G) S p, let Ss be the class of all 
open subsets G of M such that dim(G — G) s k — 1, p(G — G) finite. Using 
these for S in Theorem A and letting n = dim M,k S n, the following results 
are obtained."® 

1. The set of points a of M for which 


dim.M < kisaG; 


>k F, 
<k G; 
2k F, 
= k Gs, Gio, Fy 
=n F,. 


(These relations, due to Menger, are well known.) 


8 See footnote 1, p. 27, and Dimensionstheorie, Mathematische Annalen, vol. 106 (1932), 
pp. 161-238. 

16 If A represents a class of sets, the symbol A, represents the class consisting of those 
sets which may be obtained as the difference of two sets of the class A. See Menger, 
Kurventheorie, p. 105. 








122 H. E. VAUGHAN, JR. 


2. Those points a of the S.-regular part of M for which dim,M = k have 
Bi(a, M) S p(< p + 1), and the S,-regular part of M contains only points a 
such that dim,.M < k. Those points a of the S:-irregular part of M for which 
dim.M = k have B‘(a, M) > p( 2 p+ 1). 

3. Those points a of the S;-regular part of M for which dim,M = & have 
B'(a, M) finite or w, and the S;-regular part of M contains only points a such that 
dim.M < k. Those points a of the S;-irregular part of M for which dim,M = k 
have B‘(a, M) = ®o. 

From these remarks several results may be deduced, of which the following 
are the more important. 

The set of points a of M such that dim,M = k and 


Bi(a, M) S & form a G, 


= N Gio 
zw Gin — Gigs 
=> W Gs, —_ G50 
Sw Gi, 
<a Gipe 
> Pp Gop 
= Pp Grp 
= p Gio 
=< P Gs, 
<. P Gs, ° 


THEOREM 2. The set of k-edge points of a compact metric space is a G;,. 

THEOREM 3. The set of k-kernel points of a compact metric space is a G5». 

TuHeoreM 4. The set of ordinary k-kernel points of a compact metric space is 
a Gispo- 


7. Local connectedness. In the examples of §4 we have seen that the 
local Betti numbers give a measure of the ramification of the compact metric 
space M. In the present section we give some theorems relating the numbers 
B*(a, M) to the local i-connectedness properties of M, and, in case M is imbedded 
in R", to the uniform local i-connectedness of R" — M. 

Derrnition. If a is a point of the compact metric space M such that to 
every « > 0 there corresponds a 6 > 0 such that every complete i-cycle carried 
by S(a, 5) bounds on S(a, e), then M is said to be locally i-connected at the point a. 
If M is locally i-connected at each of its points it is said to be locally i-connected. 

The following theorem shows the relation between local 7-connectedness and 
the local Betti numbers. 


— 


TueoreM 5. Let M be a compact metric space and aa point of M such that 
B'(a, M) is finite or w. Suppose further that one of the following three conditions 
is satisfied. 

1. p*(M) is finite. 

2. There exists a real number n > 0 such that every complete i-cycle carried by 
S(a, n) bounds on M. 
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3. p(a, M) = 0." 

Then M is locally i-connected at the point a. 

Proof. By theorem B, condition 1 implies condition 2. Consequently it is 
sufficient to prove the theorem for each of the conditions 2 and 3. The proof of 
the first case follows the lines of the proof of Lemma 3‘ of the paper cited in foot- 
note 7 and will not be reproduced here. For the second case, suppose condition 
3 is satisfied. Let « > 0 be arbitrarily given. There exists an 7 > 0 such that 
every complete i-cycle on S(a, «) mod [M — S(a, «)] bounds on M mod 
[M — S(a, n)]. Let G be a neighborhood of a of diameter < 7 and satisfying 
the conditions dim(G — G) = dim.M — 1, p(G — G) = m, finite. The 
proof then proceeds as in the preceding case. 

Remark. The preceding theorem is true even in the case B‘(a, M) = Nb, if 
the neighborhood G may be chosen so that its diameter is < ¢« and < 7 and 
such that every complete i-cycle on G — G bounds in S(a, «). 

As a corollary to the preceding theorem, we have the following well known 
result. 

Coro.tiary. If M is a compact metric continuum and a is a point of M such 
that ord,M is finite or w, then M is locally 0-connected at a. 

Derinition. A domain D of the compact euclidean space R” is called uni- 
formly locally i-connected (u.1.i-c.) if, for every « > 0, there exists a 5 > 0 such 
that every i-cycle in D of diameter < 6 bounds a chain in D of diameter < «. 

THEOREM 6. Let K be a closed subset of R", D a domain of R" — K such that 
the following conditions are satisfied. 

1. If aisa point of D — D, then 8"-*(a, K) < 1; 

2. If K cuts R” locally at a, one of the local domains is a subset of a domain 
D, of Rn — K distinct from D and such that D + D, is a subset of a domain com- 
plementary to some relative neighborhood of a in K. 

Then D is uniformly locally 0-connected. 

Proof. Suppose D is not u.l.0-c. There exists a point a of D — D and an 
e > 0 such that, for every ¢ > 0, S(a, c)D contains a 0-cycle which fails to 
bound in S(a, «)D. Let G be a relative neighborhood of a with respect to K 
which satisfies 2, is contained in S(a, ¢) and is such that p""*(G — G) < 1. Let 
o« > 0 be chosen so that S(a, c)K is contained in G. Let x} + x} be a 0-cycle 
of S(a, c)D which fails to bound in S(a, =D. Let y{ be a point of S(a, «)D. 
Then there exist chains L} and L} in S(a, c) such that L} — 2} + y?, L} — 
zy +y! in R* — (K + F(a, e) — G). Let L = Li} + L}. Then L' > 
xz! +2}. Since D is connected, there exists a chain L} in D such that 
Li 2} +2} in R" — G. Then L' + L} is a 1-cycle in R" — (G — G). If 





17 See footnote 1, p. 2. 

18 For simplicity the proofs of the following theorems are stated in terms of mod 2 topol- 
ogy, i.e., the coefficient domain consists of the integers mod 2. They may be modified to 
hold for any finite coefficient domain. Compare with proofs in R. L. Wilder’s paper, A 
converse of the Jordan-Brouwer separation theorem in three dimensions, Transactions of 
the American Mathematical Society, vol. 32 (1930), p. 635. 
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L' + L} ~ Oin R" — (G — G@, it follows from Theorem C that x? + 2} ~ 0 
in Rn — (K + F(a, e)) and, consequently, in S(a, «)D, a contradiction. 

From 2 there exists a chain L} in R" — G such that L} — 2} + y!. Then 
Li + L} is a 1-cyele linking G — G. For if not, by virtue of Theorem C, 
x} + y! would bound in R* — K. 

Similarly, Lj + L; + L} links G — G. Hence, by 1, Li + L} ~ Li + 
L} + L} in Rn — (@G—G) orl) + Li} +L} ~ 0. This has been shown to 
lead to a contradiction. 

TueoreM 7. Let K be a closed subset of R", D a domain of R" — K such that 
the following conditions are satisfied, where r is any fixed integer, 1 S r S n — 2. 

1. If ais any point of D — D, then 8"-(a, K) = 0. 

2. If a is any point of D — D, there exists a relative neighborhood G’ of a 
and a real number ¢ > 0 such that any r-cycle in S(a, ¢)D bounds in the comple- 
ment of G’. 

Then D is uniformly locally r-connected. 

Proof. Suppose D is not u.l.r-c. There exists a point a of D — D and an 
e > 0 such that, for every ¢ > 0, S(a, ¢)D contains an r-cycle which fails to 
bound in S(a, «)D. Let G be a relative neighborhood of a contained in S(a, e) 
and in the G’ of 2, and such that p»-"--(G — G) = 0. Let o > 0 be chosen so 
that S(a, o)K is contained in G, and less than the o of 2. Let y’ be an r-cycle 
of S(a,c)D. There exists a chain L{*' in S(a, o) such that L{*! 3 y* in 
R" — (K + F(a, «) — G) and a chain L3*' in R" — G’ such that L3*' 4 y" 
in Rn — G. Then L{*! + L3*" is an (r + 1)-cycle of Rn — (G— G) and, by 
hypothesis, bounds there. Consequently y’ bounds in Rn — (K + F(a, e)), 
a contradiction. 


8. Addition theorems. In this section we interpolate some general addition 
theorems which are necessary to the further development of our investigation. 

DeriniTion. A compact metric space K is said to be an irreducible membrane 
with respect to a complete (n — 1)-cycle y""! ify"! ~ 0 on K but on no proper 
closed subset of K.* 

Derinition. An n-dimensional compact metric space M is said to be an 
n-dimensional closed cantorian manifold if p"(M) > 0 while, if M’ is any proper 
closed subset of M, p"(M’) = 0. It is said to be regularly closed if p"(M) = 1.8 

TuHeoreM 8. Let J be an (n — 1)-dimensional regularly closed cantorian mani- 
fold, K, and K, two n-dimensional irreducible membranes with respect to the essen- 
tial complete (n — 1)-cycle carried by J such that K\K, = J and p"(K;) = 0, 
7=1,2. Then K, + Ke is an n-dimensional regularly closed cantorian manifold. 

Proof. We may suppose K,; + Kz imbedded in the compact euclidean space 
R?+1_ It is then sufficient to show that 

1. K, + Ke is linked by an n-cycle in R*"*' — (K, + Ko), 

2. No proper subset of K; + Kz has this property, 

3. The n-cycle in condition 1 is unique. 

Proof of 1. We apply Theorem D, setting K, = F’, K, = F’’,2n + 1 =m, 
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n—1z=r. ThenJ = F’F”, and carries an (n — 1)-cycle which bounds on 
K, and on Ky but not on J, and hence p"(R®"*! — (Ki + Kz2)) > 0. 

Proof of 2. We apply Theorem E. Let S be any proper closed subset of 
K, + Ke and set SK, = F’, SK. = FF”, 2n+1=m,n—12= pr. Then 
m—r— 2 =n, and every n-cycle of R®*"*' — S bounds in R***+' — SK, and in 

n+l _ SKo, since p"(SK;) = p"(K;) = 0,7 = 1, 2, and, consequently, every 
such cycle bounds in R?"*' — S unless SK,K, = SJ carries an (n — 1)-cycle 
which fails to bound on SJ but which bounds on SK;, i = 1, 2. In order that 
SJ carry a non-bounding (n — 1)-cycle, it is necessary that SJ = J. In order 
that such a cycle bound on SK;,, it is necessary that SK; = K,. Since Sisa 
proper subset of AK, + Ke, these conditions cannot both be satisfied, and the 
proof that K, + Ke is a closed cantorian manifold is complete. That 
K, + Ke is regularly closed follows from an addition theorem due to Mayer." 

In some cases in which the conditions p"(K;) = 0 are not known to be satis- 
fied, the following corollary is useful. 

Corotiary. Let J, K, and Kz satisfy the hypotheses of the preceding theorem 
except that p"(K,) is not required to be zero. Then K, + Kez is the irreducible 
carrier of an essential complete n-cycle. 

Proof. This proof is essentially the same as that of the theorem. It is only 
necessary to make use of the fact that the linking cycle given by Theorem D 
bounds in the complement of K; and hence, part 1 being as before, in part 2 this 
cycle bounds in the complement of SK;. This leads as before to a contradiction 
unless S = K, + Ke. 

The preceding theorem may be generalized as follows. 

TuHeoreM 9. Let J be the carrier of a complete (n — 1)-cycle which fails to 
bound on J, and let K, and Kz be two n-dimensional irreducible membranes with 
respect to this cycle such that KiK, = J, p"(K,;) = 0,7 = 1, 2. Furthermore, 
suppose that K, and Kz are irreducible membranes with respect to any complete 
(n — 1)-cycle carried by J which fails to bound on J but which bounds on K, and 
on Ks. Then K, + Ke is an n-dimensional closed cantorian manifold. Also, 
p"(K, + Ke) is the number of (n — 1)-cycles of the type described. 

Proof. The proof follows the same lines as that of the preceding theorem. 
That of part 1 may be used as it stands. In the proof of part 2, there is the 
alternative that SJ may contain an (n — 1)-cycle which fails to bound on SJ 
but which bounds on J, on SK; and on SK». In this case the argument of 
part 1 shows that J + SK,, which is contained in K,, carries a non-bounding 
n-cycle. This contradicts the assumption that K, is n-dimensional and that 
p"(K,) = 0. The last part of the theorem follows as before.” 

The corollary of Theorem 8 can be extended in the case of the preceding 
theorem. 


19 Monatshefte fiir Mathematik und Physik, vol. 36 (1929), p. 40. See also Whyburn, 
Cyclic elements of higher orders, American Journal of Mathematics, vol. 36 (1934), p. 136, 
footnote. 
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Combining the above theorem with one due to Alexandroff,” we get 

TueoreM 10. The necessary and sufficient condition that the compact metric 
space M be an n-dimensional closed cantorian manifold is as follows. 

1. M = Ki + Ke with Kj, i = 1, 2, n-dimensional compact metric spaces such 
that p"(K;) = 0. 

2. Every complete (n — 1)-cycle carried by K,Ke which fails to bound on K,Kz 
but which bounds on K, and on Kz has these sets as irreducible membranes. 

3. At least one such cycle as described in 2 exists. 

The lower dimensional connectivities of a closed cantorian manifold con- 
sidered as the sum of two irreducible membranes may be found by applying the 
Mayer addition theorem.'* The following special case may also be proved 
by the use of Theorem C: 

THEOREM 11. Jf, in addition to the hypotheses of Theorem 9, p"-"-'(KiK:) = 0 
and p”"*(K;) = 0,7 = 1, 2, then p" "(Ki + Ke) = 0. 

Proof. To apply Theorem C, let K; = A, Ke = B,2n+1l=m,n+r=k. 
Then p*™*"*(R™*1 oa K,K:2) on p” "(Ki Ke) wit 0, p"**(Res* es K;) an p”*(K,) = 0, 
and the statement follows. 


9. Application of addition theorems. Using Theorem F it is possible to 
obtain the following extremely useful result. 

THEOREM 12. Let M be an n-dimensional compact metric space which is the 
irreducible carrier of a complete n-cycle which fails to bound on M. If ais a point 
of M such that B"-'(a, M) is finite or w, then M is locally 0-connected at a. 

Proof. This follows from the fact that arbitrarily small neighborhoods of a 
may be chosen whose boundaries have finite (n — 1)-dimensional Betti num- 
bers, and consequently, by Theorem F, separate M into a finite number of 
components. That component of such a separation which contains the point 
a has a diameter at most equal to that of the neighborhood whose boundary 
determines the separation and is itself a connected neighborhood of a. Conse- 
quently a has arbitrarily small connected neighborhoods and is a point of local 
0-connectedness of M. 

The question now arises as to whether or not the local condition in the hy- 
pothesis of the preceding theorem is sufficient to insure the same conclusion for 
other classes of compact metric spaces. The following theorem answers this in 
the affirmative, making use of the addition theorems already developed. 

THEOREM 13. Let M be an n-dimensional compact metric space with p"(M) = 0, 
J a locally 0-connected closed subset of M which carries a complete (n — 1)-cycle 
which fails to bound on J but with respect to which M is an irreducible membrane, 
and such that every complete (n — 1)-cycle carried by J which fails to bound on J 
but bounds on M has M for an irreducible membrane. Suppose further that if a 
is any point of M, B"-*(a, M) is finite or w. Then M is locally 0-connected at each 
of its points. 

Proof. M is locally 0-connected at all points of M — J. Let M’ be a set 


20 See footnote 9, p. 186. 
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homeomorphic to M and so situated that MM’ = J. Then by Theorem 9 
M + M’ is a closed cantorian manifold, at every point a of which, except 
possibly those of J, 8"""(a, M + M’) is finite orw. From the previous theorem 
it follows that M + M’ is locally 0-connected at each such point and the same 
is true of M itself. 

The space M is locally 0-connected at each point of the set J. Suppose that 
a is a point of J at which M is not locally 0-connected. There exists an e > 0 
such that any neighborhood of a of diameter less than ¢ has an infinite number 
of components. Let 6 > 0 be chosen corresponding to ¢« with respect to the 
local 0-connectedness of J at the point a. Let G be a neighborhood of a con- 
tained in S(a, 6) such that dim(G — G) = n — land p™"(G — G) = m, finite. 
There exists an infinite sequence, (g;), of components of G. At least m + 1 
of these have no limit points (and hence no points) on GJ. For, if all but a 
finite number had such points, they might be added to the component of S(a, ¢)J 
determined by a, and it would follow that any two points of M sufficiently near 
to a would belong to a connected subset of S(a, «) and M would be locally 
0-connected at a. Now let g be one of these m + l components. AI of its limit 
points in G belong to it, and none of its limit points in G@ — G belong toJ. More- 
over, no point of g is a limit point of M — g, since such a point would lie in 
G — J and hence in M — J and be a point of non-local 0-connectedness of M 
in M — J. Consequently (G — G@)g separates g from M. If the set M’ of 
the preceding paragraph is again added to M, it follows that g is separated by 
the same set from M + M’ since, having no limit points on J = MM’, g can 
have none on M’. By Theorem F it follows that® p»"(@ — G) = m + 1. 

The condition that J be itself locally 0-connected is necessary, as is shown 
by the following example. Let M be the compact plane set whose boundary, 
taken as J, consists of the following three parts: (1) the curve y = sin 1/2, 
0 <x S 1/z, (2) the segment x = 0, —1 S y S 1, (3) the are (x — 1/27)? + 
(y — 3)* = 1/49, y S — 3. 


10. Characterization of the 2-manifold. 

TuHeoreM 14. Let M be an n-dimensional compact metric space such that 
p"(M) = m > 0, while if M’ is any proper closed subset of M, p"(M') < m. Let 
a be a point of M. There exists a positive integer k < m such that, if G is any 
sufficiently small neighborhood of a, p"(M — G) = m — k, and B""(a, M) 2 k. 

Proof. The existence of the number k follows from the fact that, as the 
diameter of G decreases, p"(M — G) increases, or remains constant, but never 
exceeds the value m — 1. That 8""'(a, M) = k follows from the Mayer addi- 
tion theorem," since G — G must carry at least k complete (n — 1)-cycles which 
fail to bound on G — G. 

Corotiary. Let M be an n-dimensional closed cantorian manifold such that, 
for every point a of M, B""(a, M) <= 1. Then M is regularly closed and locally 
0-connected. 


21 See footnote 9, p. 153. 
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Proof. If p"(M) = m, for every point a of M,k = m. But this implies, by 
the preceding theorem, that 8""'(a, M) = m, and consequently, m = 1. The 
local 0-connectedness of M follows from Theorem 12. 

Coro.tuary. Let M be an n-dimensional closed cantorian manifold imbedded 
in R"*! and such that, for every point a of M, B""(a, M) S 1. Then M separates 
R"* into exactly two uniformly locally 0-connected complementary domains, of 
which it is the common boundary. 

Proof. It follows from the preceding corollary and Theorem 6. 

Corotuary. Let M be a 2-dimensional locally 1-connected closed cantorian 
manifold imbedded in R* and such that, for each point a of M, B\(a,M) <= 1. Then 
M is a 2-dimensional combinatorial manifold. 

Proof. This follows from the preceding corollary and a theorem due to 
Wilder.” 

The following theorem shows, as might be expected, that the restriction in 
the preceding corollary that M be imbedded in R* is unnecessary. 

PrincipaL THEeoreM A. Let M be a 2-dimensional closed cantorian manifold, 
such that, if a is any point of M, B'(a, M) S 1. Then M is a 2-dimensional com- 
binatorial manifold. 

Proof. As immediate consequences of the hypotheses and of Theorem 12, 
it follows that M is locally 0-connected and that, for every point a of M, 
B\(a, M) = 1. By Theorem G it is sufficient to show the existence of a real 
number 6 > 0 such that every simple closed curve of M of diameter < 6 cuts M. 
Assuming that this is false, there exists a point a of M such that, for every real 
number 6 > 0, S(a, 6) contains a simple closed curve which fails to cut M. 

Let «€ > 0 be arbitrarily chosen. Then, since M is locally 1-connected, 
6 > 0 may be chosen in such a manner that every complete 1-cycle carried by 
S(a, 6) bounds on S(a, €). By hypothesis, S(a, 6) contains a simple closed curve 
J which fails to cut M. The essential complete 1-cycle carried by J bounds in 
S(a, «) and there exists a subset K of S(a, €) which is an irreducible membrane 
with respect to this cycle. 

Let p’ be a point of K — J, ra point of M — K. Since M — J isa connected 
open subset of the Peano continuum M, there is an are a’, with end points p’ 
andr,in M — J. Let p be the first point of K, consequently a point of K — J, 
on a’ in the direction from r to p’. Let a denote the subare pr of a’. 

Let » > 0 be so chosen that » < 3p(p, J + r). Let G be a neighborhood of 
p such that (1) G C S(p, 7»), (2) dim(G — G) = 1, (3) p(@ — G) = 1. Let 
y? = (yi, ---,¥2,--- ) be an essential complete 2-cycle carried by M, y? being 
an ¢,-cycle with lim «, = 0. By an e,-transformation of those vertices of y? 


which are within a distance «, of G — G, we may insure that each cell of y? 
either has all of its vertices on G, or none of its vertices on G. Let 1? denote the 
subcomplex of y? composed of all cells of the former class. The boundary of 
I? is then an ¢,-cycle i} on G—G. By making a proper choice of a subsequence 
of the cycles 7}, it is possible to obtain a complete l-cycle i! = (ij, --- , i}, --+ ). 


22 See footnote 10, p. 306. 
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Since M is an irreducible carrier of the complete 2-cycle y2, it follows that G is 
an irreducible membrane with respect to the complete cycle 7’. 

Let j' = (ji, «++ Jn) +++ ) be an essential complete 1-cycle on J, where j} 
is an ¢,-cycle, and let k? be an e¢,-chain realizing the homology j! ~ 0 irre- 


ducibly on K. As before, by an ¢,-transformation of the vertices of k?, it is 
possible to insure that each cell of k? either has all of its vertices on G or none of 
its vertices on G. Let k?2 denote the subcomplex of k? consisting of all cells of 
the former class. The boundary of k? is then an ¢,-cycle 7} on G— G. We 
again suppose a proper subsequence of the cycles 7} to be chosen in such a way 
as to form a complete l-cycle 7! = (7, --- , 73, --- ). 

From (2) it follows that there exists an ¢,-complex, m?, on G — G such that 
m? +i! + i} onG—G. Moreover, y? + k? i! on M, I? + m? >i! on G. 
Since a — pis on the carrier of y? but not on the carrier of k?, it is a part of the 
carrier of y? + k?. Then y? + k? + 1? + m? 0, and this cycle is carried by 
M — (a — p)G, a proper closed subset of M. Since this cycle differs from the 
non-bounding complete 2-cycle y? only in a small neighborhood of a, it is also 
a non-bounding complete 2-cycle. This, however, contradicts the fact that VM is 
a 2-dimensional closed cantorian manifold. This proves the theorem. 

Coro.uary. In the hypothesis of the preceding theorem the condition that M be 
locally 1-connected may be replaced by any one of the conditions 1, 2 and 3 of Theo- 
rem 5 (fori = 1). 

Proof. Since 6'(a, M) is required to be not greater than 1 for every point 
a of M, the hypothesis of Theorem 5 is satisfied and M is locally 1-connected at 
each of its points. 

Principal Theorem A can be stated in several ways. The following statement 
brings out some points of interest. 

THEOREM 15. Let M be a compact metric space satisfying the following con- 
ditions: 

1. dim M = 2, 

2. p?(M) > 0, but, if M’ is any proper closed subset of M, p?(M') = 0, 

3. p'(M) is finite, 

4. if ais any point of M, B(a, M) S 1. 

Then M is a 2-dimensional combinatorial manifold. 

Thus we begin with the point set notion of a 2-dimensional compact metric 
space, and by subjecting it to certain combinatorial conditions obtain the class 
of 2-dimensional combinatorial manifolds. Moreover, the only local restric- 
tion, except for the dimension, is that supplied by the number 6'(a, M). It is 
to be noted that if we wish to characterize any particular type of manifold, such 
as the sphere, we need only require p'(M) to have some particular value, in this 
case zero, and, in some cases, also require orientability or non-orientability.” 


11. Characterization of the closed 2-cell. 
PrincipaAL THEOREM B. Let M be a 2-dimensional compact metric space 
with p?(M) = p'(M) = 0, J a simple closed curve contained in M and such that 


23 Veblen, Analysis Situs, 2nd ed., p. 50. 
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M is an irreducible membrane with respect to some essential complete 1-cycle carried 
by J. Suppose also that, if a is a point of M, B'(a, M) S 1, while, in particular, 
if aisa point of J, B\(a, M) = 0. Then M isa closed 2-cell. 

Proof. The condition that M be locally 1-connected, which was necessary 
in the hypothesis of Principal Theorem A, is here replaced by the stronger and 
certainly necessary condition p'(M) = 0. 

Since M is an irreducible membrane with respect to an essential complete 
l-cycle carried by J, it is an irreducible membrane with respect to any such 
cycle, since all of them are homologous on J. 

Let C be a 2-cell bounded by J and such that MC = J. From Theorem 8 it 
follows that M + C is a 2-dimensional closed cantorian manifold, while from 
Theorem 11 it follows that p'(M + C) = 0. Moreover, M + C is evidently 
locally 1-connected, and, if a is any point of M + C — J, B(a, M + C) = 1. 
If this equality can be shown to hold for each point of J, it will follow from 
Principal Theorem A that M + C is a 2-sphere and, consequently, that M is 
a closed 2-cell. 

By Theorem 13, M is locally 0-connected at all points so that, if a is a point 
of J and e > 0, there exists a connected neighborhood G of a such that 6(G) < «, 
dim(G — G) = 1 and p(G — G) = 0. (All neighborhoods are with respect 
to M.) 

It is first necessary to show that no point of J is a local cut point of M. Todo 
this, suppose that a is a point of J which is a local cut point of M. Let G, bea 
connected neighborhood of a such that dim(G, — G,) = 1, p\(G; — G,) = 0 and 
G, — ais not connected. Let G, be a neighborhood of a contained in G, having 
the same properties and also being sufficiently small so that JG; is contained in the 
component of JG; determined by a. It follows that at most two components of 
G, — a have points in common with the set (G. — a)J. Also, if a component of 
G, — ahas, as its only limit point on GJ, the point a, the portion of this compo- 
nent in G, is separated from the closed cantorian manifold M + C by a set con- 
sisting of the point a and a subset of the boundary of G2 which has no point on J. 
This subset must then have a positive first Betti number.** This contradicts the 
hypothesis that dim(G. — Gz) = 1 and p\(Gz — Gz) = 0. Since G, is connected, 
every component of G; — a must either contain points of (G. — a)J or have a 
as its only limit point. The preceding analysis therefore shows that G; — a 
has exactly two components determined by the two arcs of Gz.J — a. The 
closures of these components will be denoted by A: and A. 

As the essential complete 1-cycle on J, we may take a cycle y = (yi, ---, 
Yn) +++ ) Where 7, is an ¢,-cycle, €, — 0, whose vertices are arranged in a definite 
cyclic order on J and include the point a. Let C, be an e,-chain on M bounded 
by yn. Make an e,-deformation of the vertices of C, so that any cell of C,, 
either has all its vertices on A; or none of its vertices on A,G;. This deformation 
may be carried out so as not to affect vertices of y, near a. Let C, be the sub- 
chain of C, consisting of all cells of the latter whose vertices are on A;. The 


24 See footnote 21. 
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boundary of C,, consists of a cycle which is the sum of two chains, one on G, — Gi, 
the otheronJ. The carrier of the latter must contain a subarc of J ending at a, 
and a carries a boundary vertex of this chain. However, a cannot carry a 
boundary vertex of the other chain since this chain is at a positive distance 
from a. Consequently the sum of the two chains cannot be a cycle and we 
reach a contradiction. Therefore a is not a local cut point of M. 

Since this is true, it follows that, if ¢ > 0, S(a, c) contains an are a in 
S(a, ¢) — a joining two points p and q of the component of JS(a, e) determined 
by a which are separated in this component by the point a. Let « > 0 be 
chosen arbitrarily and let the above o be so chosen that, if 8 is the are paq, 
any complete 1-cycle carried by a + 8 bounds in S(a, €). That this is possible 
follows from the local 1-connectedness of M. Now suppose a + £8 does not 
separate M. It is possible to repeat, word for word, the steps in the proof of 
Theorem 14, replacing J in that theorem by a + 8. 

So far we have (M + C) — (a + 8) separated, (C being a 2-cell bounded by 
J, MC = J), one of the components, say M,, being such that its closure is a 
subset of M of diameter less than e, which is an irreducible membrane with re- 
spect to an (any) essential complete l-cycle on a + 8. Since a + 8 separates 
M;, from the remainder of M + C, which contains the are J — 8, the only limit 
points of M; on J are points of 8. By Theorem F, (M + C) — (J + a) has, 
from the connectivity of J, at most three components. One of these is C, 
another M,, and the third, M2, is the remainder of M. 

Let (yn), (v3), (v2) be essential complete 1-cycles on J, a + 8, and (J — 8) + a@ 
= J, respectively, and such that y, = y; + 72. 

yn bounds irreducibly on M, y} bounds irreducibly on M;. Consequently, 
their difference, y?, bounds on M. Let K be an irreducible membrane with 
respect to y2 contained in M. Since M is an irreducible membrane with respect 
to 7», it follows that K must contain M2, since K is closed and since the carrier 
of the homology y! ~ 0 is M,. 

If a does not separate M, then M, contains interior points of the are 8. Let 
r be one such point. Let C’ be a 2-cell bounded by J; and such that MC’ = J. 
By Theorem 8, K + C’ is a closed cantorian manifold, the hypothesis p*(K) = 0 
of the theorem being satisfied, since M, which contains K, is 2-dimensional 
and p?(M) = 0. 

Let G be any neighborhood of r in M + C’ so small that GC’ = 0, and such 
that dim(G — G) = 1. Then Gis a neighborhood of r in M and KG is a neigh- 
borhood of rin K + C’. Consequently, p\(KG — KG) > Oand p\(G — G) > 0. 
But this contradicts the hypothesis that 6'\(r, M) = 0. Therefore M — a = 
M, + (8 — p — q) + M2. 

It follows that if a is a point of J, there exist arbitrarily small neighborhoods 
of a in M whose boundaries are ares having both end points on J. Each of 
these may be extended to a neighborhood of a in M + C whose boundary is a 
simple closed curve. Consequently Bi(a, M + C) Ss 1, and the theorem is 
proved. 
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It is interesting to note that Alexandroff® raised the question as to whether 
or not a locally 0- and 1-connected set M which is an irreducible membrane 
with respect to a simple closed curve and satisfies the conditions p'\(M) = 
p?(M) = 0 is necessarily a closed 2-cell. This is answered in the negative by 
the example shown below. 

The same method may be used to give characterizations of sets obtained by omitting 
a finite number of open 2-cells from 2-manifolds. In general, J will be replaced 
by a finite number of simple closed curves and p'(M) will be required to have 
some non-zero finite value. The local conditions remaining as in Principal 


“ELEMENT A-A- 





Fig. 1. This figure represents a 2-cell, an interior portion of which has been stretched out 
into a wedge-shaped surface and then bent down to make contact with the rest of the 2-cell 
along a line, the sharp edge of the wedge coinciding with a portion of the boundary of the 
2-cell. The configuration evidently satisfies the conditions suggested by Alexandroff and 
fails to satisfy those of Principal Theorem B only along the T-shaped ‘“‘locus of singulari- 
ties’. At these points the 1-dimensional local Betti number has one of the values 2 and 3. 


Theorem B will insure that when 2-cells or other simple elements bounded by 
the simple closed curves are added the conditions of Principal Theorem A will 
be satisfied. In some cases it may be necessary to make some hypothesis con- 
cerning orientability. As an example, we give the following 

THEOREM 16. Let M be a 2-dimensional compact metric space with p?(M) = 0, 
p'(M) = 1,/J a simple closed curve contained in M and such that M is an irreduci- 
ble membrane with respect to an essential complete 1-cycle carried by J. Suppose 
also that, if ais a point of M, B'(a, M) € 1, and, if aisa point of J, B'(a, M) = 0. 
Then M is homeomorphic to a Moebius strip. 


25 See footnote 9, p. 181. 
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Proof. Let C be a 2-cell bounded by J such that MC = J. Just as in the 
previous theorem it can be shown that M + C is a combinatorial manifold, while, 
from the Mayer addition theorem, p?(M + C) = 1. Consequently, M + C 
is a projective plane, and M is a Moebius strip. 


12. Characterization of the open 2-cell. The following theorem is frequently 
useful. 

THeoreM 17. Let M be an n-dimensional locally compact (or compact) metric 
continuum such that, if a is any point of M, 8"-(a, M) = 1, and, infact, if «> 0,4 
neighborhood G of a exists such that 6(G) < «, dim(G — G) = n— 1, p"""(G—G)=1 
and G is an irreducible membrane with respect to a non-bounding complete (n — 1)- 
cycle of G — G. Then if J is any set which carries a complete (n — 1)-cycle which 
bounds on a set K such that K — J and M — (K + J) are non-vacuous, M — J 
is not connected. 

Remark. It is sufficient to assume that the local condition applies only to points 
of K — J. 

Proof. Suppose M — J is connected. Let r be a point of M — K. Since 
J does not cut M, the component of M — K containing r has a limit point p 
on K — J. Take e€ < p(p, J) and let G be a neighborhood of p satisfying the 
hypothesis of the theorem. Then G is an irreducible membrane with respect 
to the cycle on G — G. However, GK is also an irreducible membrane with 
respect to this cycle, since K is an irreducible membrane with respect to the 
cycle on J. But GK is a proper subset of G, since it contains no points of the 
component of M — K containing r, while G does. This contradiction proves 
the theorem. 

Making use of this theorem we obtain a characterization of sets all points of 
which have 2-cell neighborhoods, as follows. 

PrincipaL THEeorem C. Let M be a 2-dimensional locally compact metric 
continuum such that, if a is any point of M, B'(a, M) = 1 and, in fact, such that 
for every « > 0 there exists a (compact) neighborhood G of a of diameter < «€ such 
that dim(G — G) = 1, p\(G — G) = 1, p(G) is finite and G is an irreducible mem- 
brane with respect to a complete cycleonG — G. Then every point of M has a 2-cell 
neighborhood. 

Proof. Let a be a point of M, G; a neighborhood of a of the type described. 
By Theorem 5, G;, is locally 1-connected. 

Let G be a neighborhood of a of the type described and so small that every 
complete 1-cycle on G bounds on G;. Then Theorem 17 applies for every J 
contained in G, i.e., G; — J is not connected. We will show shortly that M is 
locally 0-connected. From this it follows that G — J is not connected, and, in 
particular, that any simple closed curve of G separates G. 

From the hypotheses on G — G it follows that this set contains a regularly 
closed 1-dimensional cantorian manifold C.2* Let G@’ be an auxiliary set homeo- 
morphic with G and such that GG’ = C. By the corollary to Theorem 8, 


26 Wilder, Point sets in three and higher dimensions, Bulletin of the American Mathe- 
matical Society, vol. 38 (1932), pp. 649-692; see bottom p. 681. 
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G + G’ is the irreducible carrier of an essential complete 2-cycle. By Theorems 
12 and F it follows that G is locally 0-connected and is cut by no arc of G. 

Since G is locally 0-connected, locally compact, and cut by every simple closed 
curve but by no are, it follows from Theorem H that G is a cylinder-tree, and, 
consequently, that every point of M has a 2-cell neighborhood. 

It is evident that, if every point of M has a 2-cell neighborhood, then M satis- 
fies all the hypotheses of the theorem. These conditions are then necessary and 
sufficient. In place of the condition ‘“p'(G) is finite” the equivalent hypothesis 
“(M is locally 1-connected”’ might be used. 

PrincipAL THEOREM D. Let M be a 2-dimensional locally compact, non- 
compact, metric continuum such that every complete 1-cycle carried by M bounds 
on a compact set on M and such that, if a is any point of M and « > 0 any real 
number, there exists a (compact) neighborhood G of a of diameter less than € such 
that dim(G — G) = 1, p(G@ — G) = 1, and G is an irreducible membrane with 
respect to a complete 1-cycle on G — G. Then M is an open 2-cell. 

Proof. By Theorems 5 and 18 any point of M has a 2-cell neighborhood and, 
by Theorem 17, every simple closed curve on M cuts M. By Theorem I it 
remains to show that, if J is a simple closed curve in M, M — J has just two 
components. To do this, we note that, since M is locally 0-connected, every 
component of M — J has every point of J as a limit point. For, if not, there 
is a component C of M — J and a point p of J which is a limit point of C and an 
end point of an are of J, no interior point of which is a limit point of C. Let 
U be a 2-cell neighborhood of p. Since U is a 2-cell, exactly two components 
of U — UJ have pas a limit point and each of these has as limit points all points 
of an are of J to which p is interior. This contradiction proves the statement in 
question. Now suppose that M — J has three components. Each has the 
point p of J as limit point, but we have just seen that in any 2-cell neighborhood 
of p there are only two such components. 

Principal Theorem C can also be used to give a characterization of cylinder- 
trees, i.e., subsets of the 2-sphere which are complementary to closed, totally 
disconnected sets. All that is necessary is to add the condition that M is im- 
beddable in R?. This is conveniently done by means of Theorem J. 

PrincipaAL THEOREM E. Let M be as described in Theorem 18 and, in addition, 
contain no primitive skew curve. Then M is a cylinder-tree. 


13. Some properties of Alexandroff’s local Betti numbers.' 

THEOREM 18. Let K be a closed subset of R", D a domain of R" — K such that, 
if a is any point of D — D, p™*“(a, K) = 0. Then D is uniformly locally 
i-connected. 

Proof. If D is not u.l.i-c., there exists a number e > 0 and a point a of D — D 
such that, if ¢ < «, S(a, ¢) — K contains an i-cycle which does not bound in 
S(a, «) — K. However, since p**"(a, K) = qgi(a, R" — K) = 0, an ¢’ < «€ 
and ao’ < e’ and < go exist such that every 7-cycle in S(a, ¢’) — K bounds in 
S(a, e’) — K, and, consequently, in S(a, «) — K. 








~ =» i ie hs 
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As a converse theorem we have 

THEOREM 19. Let K be a closed subset of R", a a point of K such that only a 
finite number of domains of R" — K have a as a boundary point and each such 
domain is uniformly locally i-connected, i # 0. Then p"*-(a, K) = 0. 

Proof. Suppose p”~*'(a, K) # 0. Then gi(a, R" — K) # 0 and for every 
e > 0 there exists a ¢ > 0 such that, if ¢’ < ¢, S(a, ¢’) — K contains an i-cycle 
which fails to bound in S(a, «) — K. This cycle may be taken as irreducible. 
Since i + 0 we may suppose it is a connected set and therefore contained in 
some complementary domain of Rk" — K. Since there is, by hypothesis, 
only a finite number of domains having a as a boundary point one of these con- 
tains arbitrarily small 7-cycles of the above type and hence is not locally 7-con- 
nected. 

For uniform local 0-connectedness we have the stronger result, supplementing 
Theorem 18, 

THEOREM 20. Let K be a closed subset of R", D a domain of R" — K such that 
if a is any point of D — D, it is a boundary point of a finite number, exactly ka, 
of domains of Rn — K and p*"(a, K) = ka — 1. Then D is uniformly locally 
0-connected. 

Proof. If D is not u.l.0-c., there exists a point a of D — D and a number 
e > 0 such that, if ¢ < «, D[S(a, ¢) — K] contains a 0-cycle which does not 
bound in D[S(a, «) — K]. However, there exists an «’ < e andao’ < e’ and 
< o such that there are exactly k, — 1 0-cycles in S(a, ¢’) — K which are inde- 
pendent in S(a, e’) — K. But these must consist of pairs of points in different 
domains and, consequently, any O-cycle of D[S(a, o’) — K] bounds in 
S(a, e’) — K and consequently in D[S(a, «) — Ky). 

As a converse to this theorem we have 

THEOREM 21. Let K be a closed subset of R", a a point of K which is a boundary 
point of a finite number, k., of domains of R" — K, each domain being uniformly 
locally 0-connected. Then p™(a, K) = ka — 1. © 

Proof. For every « > 0 there exists a o > O such that every 0-cycle in 
S(a, o) which is contained in a single domain of R" — K bounds in S(a, «). 
Hence p*"'(a, K) S k, — 1, and the equality is an obvious conclusion. 

From these theorems we obtain a characterization of those of R. L. Wilder’s 
generalized closed (n — 1)-manifolds®’ which can be imbedded in R”. 

THEOREM 22. The necessary and sufficient condition that a closed set M in R* 
be a generalized closed (n — 1)-manifold is 

1. p™""(M) = 1, while, if M’' is any proper closed subset of M, p""\(M') = 0. 

2. If ais any point of M, p"*"(a, M) = 0,1 S i S n — 2, p™ (a, M) = 1. 

DertinitTion. Let M be a closed subset of R*, N; a neighborhood in R* of 
the point a of M. A cycle y’ of N; — M is said to be irreducibly linked with M 
at a if y’ does not bound in N, — M but bounds in N, — (M — G), where G 
is an arbitrarily small relative neighborhood of a. 


27 Wilder, Generalized closed manifolds in n-space, Annals of Mathematics, vol. 35 (1934), 
pp. 876-903. 
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With respect to the 6’s we have the following 

THEOREM 23. Let M be a closed subset of R" and aa point of M at which M is 
irreducibly linked by m independent (in N, — M) r-cycles y{,¥3, +--+ ,¥m- Then 
pr-r*(a, M) = m. 

Proof. Replace M by the boundary of N,; together with the points of M 
interior to N;. From now on this set will be denoted by M. Let G be any 
relative (to M) neighborhood of a interior to N; and such that the distance of 
any point of G from a is less than the distance of the y; from a. Let M’ be the 
relative boundary of G,G’ = M — (G+ M’). Each of the y; bounds in Rk" — 
(G’ + M’) by hypothesis and also in R" — (G + M’), since G + M’ is contained 
in a sphere which excludes all the y!. Therefore each bounds a chain K** in 
R" — (G4 + M’) and a chain K**' in R* — (G’ + M’). Then Ki*' 4+ Kt? 
is a cycle in R" — M’. If any linear combination of these cycles bounds in 
Rk" — M’, the corresponding linear combination of the y;’s bounds in R" — M, 
by Theorem C, a contradiction which proves the theorem. 

The following theorem gives a relation between the 8’s and Alexandroff’s 
local Betti numbers.” 

Turorem 24. If M has no (n — r — 1)-dimensional condensation at a and 
p.*(M) is finite, then B** (a, M) = p2-""*(M). If p2-"*(M) is infinite, 
B""*(a, M) = w or No, depending on whether or not the base determining p.~"'(M) 
can be so chosen that there exists a ¢ > 0 such that all cycles of the base have points in 
M — S(a, oc). 

Proof. Suppose that M has no (n — r — 1)-dimensional condensation at a 
and p.-"'(M) = m, finite. Then pi(R" — M) = m. Let y; = (ya,---, 
Vir, +++ ), @ = 1, 2,---,m), be a base at a (in R" — M). Let « > O be any 
real number. The sequences y; may be assumed to be such that every pair of 
cycles of the sequence y; are homologous in S(a, «) — M. Take o so small that 
some cycle of each sequence lies in S(a, €) — S(a,c). For simplicity of notation 
this may be assumed to be ya. Let G be a neighborhood of a in S(a, a). 
Let o’ > 0 be chosen so that S(a, o’)M is contained in G. Let yix,, 
(¢ = 1, 2,---,m), be a set of cycles contained in S(a, ¢’). Then ya ~ 0 in 
R* — G;ya ~ yi; ~ Oin R" — (M + F(a, &) — G). These homologies deter- 
mine an (r + 1)-cyele y;*'. If y:* bounds in R" — (@ — G), then ya bounds 
in R" — (M + F(a, «)), or in S(a, e) — M, a contradiction. Consequently 
v**' links G — G, and similar reasoning shows that the m cycles y;*' are in- 
dependent in R" — (G — G). Consequently p*"-*(G — G) = m and p= 
(a, M) = m. 

If p2—"'(M) is infinite, 8"-"-*(a, M) = w or No, the first case occurring if it 
is impossible to choose a ¢ > 0 so that some yix,; lies outside S(a, ¢), for all 7. 

Since p”*—'(a, M) finite or w is a sufficient condition that M have no (n — r — 
1)-dimensional condensation at a, we have the 

Corotiary. If p’(a, M) is finite or w, then B’"(a, M) = p’(a, M). 


28 See footnote 1, pp. 16 and 25. 
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14. Unsolved problems. 1. It seems reasonable to suppose that many of 
the theorems in the theory of order of points of a 1-dimensional set could be 
extended to the n-dimensional case in terms of the 8’s. However, in most cases, 
this seems to be very difficult. 

2. Another problem is to give sufficient conditions, in terms of the 8’s and, 
probably, local connectedness properties, that a point of an n-dimensional com- 
pact metric space have a neighborhood which can be imbedded in R". 

3. A problem closely related to the preceding is that of extending the char- 
acterizations of sections 10, 11, and 12 to the corresponding n-dimensional sets. 

4. Finally, under what conditions does the equality B*"'(a, M) = p‘(a, M) 
hold? It seems probable that a partial answer is that it does whenever the latter 
is finite, but this has yet to be proved. 


UNIVERSITY OF MICHIGAN. 











ON THE POISSON SUMMABILITY OF FOURIER SERIES 
By Norman LEvINSsON 
1. Let f(z) be a Lebesgue integrable function of period 27, and let 
d(x) = f(y +z) +fly — x) — 2s. 


It is well known that if 


1 € z m—1 
(1.0) lim t | (1 - ) (x) dx = 0, 
e>0 € Jo € 
then 
Mae ay i. : rz 
(1.1) lim — o(x) dx (1 — z)" cos — dz = 0 
e-0 € Jo 0 € 


for n > m, where (1.1) is the n-th Riesz mean of the Fourier series for f(x) at 
r= y. 

In his conversation class, Hardy carried this relation over to Poisson sum- 
mability of Fourier series by proving in a very simple manner that 


lim « 2) 6-5 “dz =0 
e—0 0 


implies the Poisson summability of the Fourier series of f(z) at the point x = y, 


and conjectured that 
1 e\!+6 
lim I $(zx) “3 dz _ 0 
«0 0 x 


also implies the P summability for b > 0. We shall show this to be the case. 
We shall also show that there is another exponential kernel exp [— (z/e)'*}, 
similarly related to P summability. 

Our theorems are 

THEOREM 1. Let E(m, a) represent 


1 1 x a -(=)"™" 
(1.2) lim 1 (2) e \« (x) dx = 0, m>—1, a 20, 
«0 0 
and P(m) represent 
1 

(1.3) tim + f — m>-—-, 

0 € Jo fx \2+™ 2 
a + 


where (x) is defined as above. Then E(n, a) forn > mand a = 0, or E(m, a) for 
a > O implies P(m), while P(m) implies E(n, a) form > n. 
Received October 7, 1935. 
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THEOREM 2. If E-'(m, a) represents 


1 a e\itm 
(1.4) lim f (‘) c@) o(x) dz = 0, m>-—l1, a >0, 
o \z 2 


e—0 
and P(m) is defined as in Theorem 1, the conclusion of Theorem 1 holds with E 
replaced by E-. 
P(m) becomes ordinary Poisson summability for m = 0. 
Theorem 1 shows that E(m, a) implies E(n, 8) if m > n. That E(m, a) im- 
plies E(m, 8) if a > B = 0 follows immediately from 


1 a—s z\i+m e 
et" 7 = 1 +m [ (1 = yt) L+m . ye () () dy 
a-—pBp 0 y y 
r 
1+ m 


Similar results hold for E-'(m, a). 








2. Since Wiener’s fundamental work' on tauberian and related theorems, it is 
quite natural to use Fourier transform methods on these theorems.2. We require 
the following lemmas. 

Lemma 1. Let 


(2.0) lim : I (2) (x) dz = 0, 
where 
(2.1) J | o(z) |dxr << 
and 
(2.2) |Ni(z)| << A<o, 
If R(x) is a function such that 
(2.3) I | R(x) | dz < @, 
and 
: dx 

(2.4 [ir@i%®<e, 
and if 

} N.(z) = ¥ , *) dy 
(2.5) @ = | rw (2) 4 


1 N. Wiener, Tauberian theorems, Annals of Mathematics, vol. 33 (1932), pp. 1-100. 
? Wiener, loc. cit., has successfully applied these methods to Riesz summability of 
Fourier series. 
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then 


(2.6) lim i [ v,(2) o(x) dx = 0. 


e-0 € € 


Since the following repeated integral is absolutely convergent, the order of 
integration can be interchanged, giving 


oo 1 
IP t)s [me 
€ Jo € Y Jo y 
1 +00 1 
= . I (x) ac | r(¥) N; (:) dy = I o(x) Neo (2) dx. 
€ Jo 0 € y y € Jo € 


] 1 
| o(2) (2) dx} <M <>», 


by (2.0), (2.1), and (2.2), it follows that 


€ Jo € y 0 y 
s/t" (2) ay 4! [ wi(2) ote ar} + arf | ee | ay, 
0 € 0 l | b/e 


Since for sufficiently small 6 the first term on the right is arbitrarily small inde- 
pendently of ¢, and since for sufficiently small e, and any fixed 6, the second term 
is arbitrarily small, we have 


x 1 
lim a | r(¥) dy i v2) o(x) dx = 0, 
«0 € Jo €/ Y Jo y 


which, combined with (2.7), proves the lemma. 
Lemma 2. Lemma 1 remains valid if (2.4) is replaced by | R(x) | < A, x < 3, 
and 


(2.7) 


Since 





[ inmif<«. 
0 x 
For (2.7) remains true, since 
-) | €/2 wv | ad 
LieinGites [Gis +4 fa) 
0 e/|| \w/ly o | Wiy «2! \E 


af im@|%+4 Ry) | %. 
0 J] 1/2 y 


y 


dy 
y 





IIA 


The remainder of the proof follows as in Lemma 1. 
From these lemmas, it is evident that the solution of the integral equation (2.5) 
for R(x) when N,(x) and N2(x) are given is very important. It is to solve this 





a a it 
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equation that the Fourier transform, or rather in this case the Mellin transform, 
is so useful. Of course, the Mellin transform is only a Fourier transform upon 
which the transformation y = e* has been performed. We require the follow- 
ing two well-known theorems. 

TuHeorEM A. If k(w) is analytic in the stripa < u S b, where w = u + iv, 
and if 


/ |k(u + tv) |?}dv << A < w, asusb 


IIA 


there exists a function 


F(z) = lim. ss |. : k(w)x"— dw, asusb. 


Ss 
Ave “Tl J—jAin 
Moreover, 
k(w) = / F(x) 2" dz, a<u<b, 
0 
and 


[x |Paitar = 2 [ | k(u + iv) |? dv, asucsb. 


TuHeoreM B. If F(x) is a function such that 


[ \F@ia < 2 


and if 


k(iv) = [ F(x) 2-* dz, 


then 


a 1 ; im—1 (5 ie) 
F(x) = lim on | x1 k(iv) (1 ~ dv 


Ao = a 


almost everywhere. 
We can now solve the integral equation (2.5). We have* 
Lemma 3. Let Ni(x) and N2(x) belongtoL (0, ~). Let 


k(w) = i Ni(x) x” dx 


* The condition that Ni(z) and N.(z) belong to L(0,~) in this lemma can be replaced 
by a variety of other conditions. For example, if for some fixed b, 2-*N,(z) and x2*N,(z) 
belong to L(0,«) and if r(w) is analytic and belongs to L? in the strip b — 6 S$ u < b +4, 
then z2*R(z) belongs to L(0,<) and it can readily be shown that R(z) is a solution of 
the equation (2.5). 
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and let k2(w) be similarly defined. Let r(w) = ke(w)/ki(w). If r(w) is analytic, 
and 

| lr(u+iv)Pdv< M<a« 


in the strip, —6 S u S 4, for some 6 > 0, then 
iA 
(2.8) R(x) = Lim. = r(w) 2’—! dw 


Ax 2r —iA 


is a solution of the integral equation (2.5), and 


(2.9) [ |R(z)|\ dx < a. 


That R(x) defined as in (2.8) exists follows from Theorem A. It also follows 
from this theorem that 


[ | R(x) |? et¥+1 dr < x. 


Using Schwarz’s inequality, we get (2.9). To show that R(z) is a solution of the 
integral equation, we set 
H(z) = | R(y) w(2) S, 
0 7] 
pe 


[jm iae s [Re | dy [ \NuG)| ar. 


ef 


Thus A(z) belongs to L(0, «). Moreover, 


[ H(x) x-** dx [ x? dx [ rwn(2) 4 - 


r(iv) ki(iv) = ke(iv) . 


It follows immediately from Theorem B that H(x) = N2(z). This proves the 
lemma. 


Then 


3. In proving Theorems 1 and 2 we shall need the following transforms, which 
can readily be computed. 











a. ed T 1 
| =" (1 + m) 2) _ 
+ m) cos = i+m 
ws iat _z'+m _— 1 l—w+ea > 
[= € dz = +1 iia ), m> -—1l, a2Q0, 





[ eerer nae = - r(itete), m>-—1l, a20. 
e 1+m 1+m 
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We shall also make frequent use of 


rly! 
2 


|T(a + iy)| ~ (2n)'e 2 |yle 


in what follows. 
Proof of Theorem 2. First we prove that P(m) implies E-'(n, a) if m > n. 
Let 


1 —Ii—n 


Ni(z) = Pera | ’ N(x) = 2-7 e-* 


Then clearly N;(x) and N2(z) are absolutely integrable. Using the terminology 


of Lemma 3, 
kw) _ 2(1 + m) (142 +*) 0005 (F+*) 
ki(w)-r(1 + 2n) l+n ~2\Ll+m 


is analytic and belongs to L? uniformly along every ordinate for which —é6 Ss 
u <= 1+ 6 for some 6> 0. Thus the conditions of Lemma 3 are satisfied, 
and there exists an absolutely integrable R(x) such that (2.5) is satisfied. By 
Theorem A, setting u = 1 + 4, 








r(w) = 


| | R(x) |? a dz < w. 
0 
Thus by Schwarz’s inequality 
1 
[ | R(x) | < @. 
0 zx 


The conditions of Lemma 1 are now fulfilled, and therefore 


1 
tim > f (x) v(?) dz = 0. 
e-0 € Jo € 


This proves that P(m) implies E-\(n, a) if m > n. 
We shall now prove that part of the theorem which states that E-'(n, a) im- 
plies P(m) forn > m. Let 








_— 1 
N,(zx) = gee ’ N.(z) = res | ° 
Here 
tie ke(w) " a(1 + n) 1 
ki(w) 2(1+4+m) r(? +w+ *) T ( + ™) 
a SER AED. Te Fe 
l+n 2\1+m 


is analytic and belongs to L? uniformly along every ordinate for —é < u S 4 for 
some 6>0. Thus as before, Lemma 3 is satisfied and an absolutely integrable 


0 Ri) - TLE” =f. xe! dw 
"Sree. (Hest)... (#+8) 
r| —_———— ] cos —= 
i+n 2\i+ m 
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exists. If now we displace the path of integration to the right and observe that 
w = lisa pole, we have 














R(x) = l+tn 1 
1+m (++ m + *) 
l+n 
+ m(1 +n) Pa Rs Le x’ dw 
2(1 + m) 2m —t20+2+2m (? > a a *) 2 (¢ a *) ; 
r —— } cos— | ———_ 
l+n 2\1l+m™ 
But this yields 
ae a — —_* -_.. +. a, 
_- l+m C+2+*) 
Oe te 
l+n 


where A is some positive number. Since m > —}4, R(x) is bounded for finite 
x and the conditions of Lemma 2 are fulfilled. This proves that E~'(n, a) implies 
P(m) if n > m. 

Finally, we want to show that E~'(m, a), a > 0, implies P(m). The proof is 
precisely like the preceding one in every detail, except that R(x) in (3.0) is defined 
asalimitinthe mean. This completes the proof of Theorem 2. 

Proof of Theorem 1. First we show that P(m) implies E(n, a) ifm >n. This 
proceeds just as in the corresponding proof in Theorem 2 if we observe that 


) _ 21+ m) ,, ets) 5(P+2) 
aac asp ees 8 2\i +m 


is analytic and belongs to L* along every ordinate —é < u S< 1 + 6 for some 
56> 0. 
Now let us prove that E(n, a) implies P(m) forn > m. If a = 0, then 
m(1 +n) 1 


re) = 30m) r(1=) om? (@ EM) 
ry —— cos ~ {| ——— 
l+n 2\l + m 


and the proof goes just as in the first part of Theorem 2 by the use of Lemmas 1 
and 3. If a> 0, then the proof proceeds in the same way as in the second part 
of Theorem 2. Here 


l1+n) 1 wae xr”! dw 
R(z) = m( + | . — -. 
@) 2(1 + m) 2mt Jin r(? +a— “) Tt (¢ oa =) 


“Tn )2\itm 














We displace the path of integration to the right to the ordinate u = 1 +4 4, 
where 6 > 0 is sufficiently small so that the path crosses only the pole at w = 1, 
and in this way, we can show that R(x) is bounded for finite z. In every other 
detail, the proofs are identical. 
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Finally, we prove that E(m, «) implies P(m) fora > 0. As usual, Lemma 3 
leads immediately to an absolutely integrable R(x) defined by 


tA (w—1) (m+1) . 
R(x) = Lim. — eu : 
Bae L r(1 ,) ve 
om Ss , 
r+ 2 


= I'(w) ['(1 — w), we have 





iA 


7: T 
Using — 
sin rw 


tA 2.(m+1) (w—1) = >} 
i) « 0 eatin J. / rintDe) T(L — w) P(w) sin grw 4, 
avo 2ml Jia 
n(i- w+ 55) 


iw} 1 a 
——_-1 
= : = —, xDD T(w) sin }rwdw z(1 — z)i+™ * dz 
a 271 —ia-} 0 
:, 
l1+m 
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If a 2 1 + m, it follows from the second law of the mean that R(x) is 
bounded. Thus for a = 1 + m, an application of Lemma 2 proves this part of 
Theorem 1. 

If a < 1 + m, we must proceed somewhat differently. We have, setting 

z=] — yr'**, 


(1 + m)x--™« 


2-5 a —. = a7) 
R(z) = ‘ea [sn tars) 8 5(1 + m) re m) - | cosy y** ay} 


1+ m 
vs a a Ta  e men | 
(3.1) — cos zit ir(-2,.) sin 2(1 + m) i siny y! ay} 
; 1 Ta 
_ —Il—m+a oe jane nine: 
= (l+ m)z sin (4 ad + =) + O(1). 


Clearly we cannot use either Lemma 1 or Lemma 2 because of the behavior of 
R(x) at x = 0. However, we can show that the operations performed in these 
lemmas are valid here. First we show that the inversion of the order of integra- 
tion of (2.7) is justified in this case. We have 


(a) [ 2) ae am? [ante [H0) ne) 
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since N,(x) = z%e-** is uniformly bounded and R(z) is absolutely integrable. 
If we can show that 


(3.2) vn i o(x) dx I R(¥) w(?) _ 
so Jo 0 € Y/Y 


we have (2.7) of Lemma 1, and can proceed with the remaining argument of 
Lemma 1 to complete the proof of this final part of Theorem 1. 
Using (3.1), we can prove (3.2), if we can show that 


(3.3) I, =lim | $(z) dz [ yt ain a(S - atra)G) «“ " 


and 


(3.4) lim [ "| oa) | dz [ (2) eG) a = 0. 


But (3.4) can readily be shown by breaking the z interval of integration (0, 1) 
into (0, 5) and (6, 1). 
For (3.3), we get, on setting w = y~'-", 





Ds "aie * ewertm cin ( + =< ) 
I= — 7 lim r6(2) de |” sin (« ras dw, 


and by the second law of the mean 
2 : 1 , ait a r+m 
lh|s aa eyar im [ x* | o(z) |e (3) dz = 0, 
since $(x) is absolutely integrable. This completes the proof of Theorem 1. 


CAMBRIDGE, ENGLAND. 

















CONCERNING THE TRANSITIVE PROPERTIES OF GEODESICS 
ON A RATIONAL POLYHEDRON 


By Rautpx H. Fox anp RicHarp B. KERSHNER 


This paper considers geodesics on ordinary polyhedrons' in an abstract space. 
A geodesic on an ordinary polyhedron becomes an ordinary straight line if the 
sequence of faces belonging to that geodesic is thought of as spread out on a 
plane. We shall be concerned in what follows only with rational polyhedrons, 
that is, ordinary polyhedrons in which the sum of all angles at any corner is a 
rational multiple of x. The problem may be considered as an elementary 
illustration of the ‘billard ball’ problem considered by Birkhoff in Chapter VI 
of his Colloquium Publication Dynamical Systems and was suggested to us by 
Wintner. The geometrical condition of rationality defined above is, in the 
main, the condition on integrability in the sense of Birkhoff or, in the case of 
a periodic solution, the rationality of the rotation number. 

If a direction is moved parallel to itself along any closed curve, which meets 
no corners, it can only come back to a finite number of positions, for the closed 
curve can be deformed continuously, without passing over any corners, into 
another which admits a decomposition into simple loops, each loop consisting of 
a closed circuit about a corner. Each circuit changes the direction by the sum 
of the angles about the corner and there is but a finite number of corners. 

Now? an “Ueberlagerungsfliche” P for the rational polyhedron II may be de- 
fined as follows. Consider an arbitrary but fixed direction*® on one of the faces 
of IT and all possible simple curves on II starting from a fixed point in the interior 
of that face and not meeting any corners. Let the initial direction be moved 
parallel to itself along each of these curves. If two such curves have a common 
end point but different directions there, we consider the two end points to be on 
different faces. The totality of faces, distinct in this sense, constitutes the 
Ueberlagerungsfliche P. The most important properties of P are the following. 

(1) Pisa finite polyhedron. For to each face of II there corresponds a finite 

number of faces of P. 


Received by the Editors of the Annals of Mathematics November 9, 1934, accepted by 
them, and later transferred to this journal. 

1 Ordinary polyhedrons are meant in the sense of E. Steinitz, Polyeder und Raumein- 
teilungen, Encyklopiidie der Mathematischen Wissenschaften, vol. III, Part I,, p. 15. 
Geodesics on polyhedrons have been considered, for instance, by P. Stickel, Geoddtische 
Linien auf Polyederflichen, Rend. Circ. Mat. Palermo, vol. 22 (1906), pp. 141-151 and by C. 
Rodenberg, Geodatische Linien auf Polyederflachen, ibid., vol. 23 (1907), pp. 107-125. 

? For the ideas involved here cf. H. Weyl, Die Idee der Riemannschen Flache, Berlin, 1923. 

3 The particular choice of this direction has, of course, no influence on the construction 
we are going to make. 
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(2) Except at the corners of P, there is one and only one direction parallel to 

any given direction in any point of P. 

(3) As a consequence, a geodesic on P, which is not closed, does not meet itself 

on P. 

(4) Except at corners, the structure of P is locally that of the euclidean plane. 

At corners its local construction is that of a branch point of finite order of a 

Riemann surface over a euclidean plane. 

If a segment of a geodesic is moved continuously parallel to itself in a direction 
perpendicular to itself, the resulting set of segments of geodesics is called a 
(geodesical) strip. Of course a strip cannot contain a corner of P in its interior; 
as long as no corners are brought into the interior a strip may be continued, 
either by lengthening the original segment or by widening the parallel movement. 

The continuation of a geodesic over a corner has not been defined and, indeed, 
it is not in general‘ possible to define a unique continuation. However, it is 
always possible to define a right-hand and a left-hand continuation according as 
the (oriented) geodesic is considered as a limiting position of parallel geodesic 
segments to the right or to the left of it. If the geodesic meets several corners, 
we shall always consider as a finite segment of a singular® geodesic only such a 
segment that is the limit of a variable non-singular segment. 

As far as geodesics are concerned it is P that is fundamental and not II. Hence 
we shall really be interested in the behavior of geodesics on P although the 
theorem which we shall prove will describe indirectly the behavior of geodesics 
on II. The following is the theorem. 

If P is the Ueberlagerungsflache of a rational polyhedron and g is a geodesic on P 
which is not closed, then g is dense on a subset T of P which is the closure of a sub- 
region and may be the whole of P. In addition 

i. IT consists of a finite number of strips; 

ii. T' ts bounded by a finite number of segments of singular geodesics with corners 
at both ends; 

iii. any geodesic of T (not excluding the singular geodesics in T) is dense in YT. 
In particular, if g is dense in P, any parallel geodesic is dense in P. 

If the geodesic is dense on P, then i and ii of the theorem need no proof. 
Suppose then that g is a non-closed geodesic which is not dense in P. Then there 
exists at least one strip free of gon P. Let S, be such a strip, bounded on at least 
one side by a segment h which is either a segment of g or a limit of segments of g, 
and which cannot be widened at the side opposite to h. As long as not both 
end points of h are corners, we can lengthen S, with the understanding that if a 
corner appears in its path we must at the same time make S, narrow enough to 
get by. This narrowing can only happen a finite number of times, for there is 


4 A unique continuation will exist if the sum of the angles about the corner is an integral 
multiple of x. On the tetrahedron, for example, continuation is unique at all corners, since 
the sum of the angles about any corner is z. 

5 A singular geodesic is one that meets (at least one) corner. Cf. Stickel, loc. cit., §12, 
and Rodenberg, loc. cit., §5. 
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only a finite number of corners, by (4), and each corner can only be met in a finite 
number of sheets, by (4), and in each sheet only once, by (2). So the minimum 
width of S, is positive and as the area of P is finite, by (1), it follows that the 
maximum length of h is finite. This cannot be caused by h being closed, for the 
closure of h implies the closure of g. So the reason must be that both end points 
of h are corners. 

The total number of such strips S;, with maximal h is finite for the total number 
of segments in the direction of h with corners at both end points is obviously 
finite. So the width of all these strips has a positive minimum and their length 
a positive maximum. It is now obvious that the point set T covered by g and 
its closure is the sum of a finite number of strips, and that its boundary consists of 
a finite number of segments each with corners at both end points. 

Let 9; be a segment of a geodesic in T. Then a geodesic g generated by 9g, (if 
g: meets a corner in T’ it may actually happen that there are two geodesics gener- 
ated by 9;) is a geodesic lying completely in T. (It is not excluded that 9, be a 
boundary of I, but in this ease g; consists of the inner continuations of g:.) gi 
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is not a closed geodesic, since a closed geodesic has a minimum distance from 
corners and g is by hypothesis dense in T. Hence g; is dense on a subregion plus 
its limit points, [', having properties i andii. But since every segment of q; is a 
limit of segments of g, we have Tl; = [. This proves property iii of I. 

The meaning of the theorem just proved is that, given a direction on a face of 
II, P is split up by a finite number of segments of singular geodesics with corners 
as end points into a (necessarily finite) number of “closures of subregions”’ 


a. Bi! se Re r,, T,, --- , Bs; a20,8 20,a+8 21, 


each “subregion” having properties i and ii and no two “subregions” having 
any region in common; a geodesic in the given direction in I’; lies completely in 
and is dense on I’ ;; a geodesic in the given direction in I; lies completely in TF; and 
is closed (i.e., the I’s are strips of closed geodesics). 

In general, a direction on a face of II will not induce an actual “splitting up’’ 
of P; what we mean by this is that the number of directions on a face of II, for 
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which a geodesic in P in this direction is not everywhere dense on P, is countable. 
Since P. Stickel has proved*® that the number of strips of closed geodesics is 
countable, it is sufficient to show that the number of segments of singular 
geodesics with corners at both ends is countable. But this is evident, for the 
number of such segments, of length less than a given number, is finite. 

On the other hand, there do actually exist polyhedrons which “split” into 
l’-regions in certain directions. For example, consider a P-surface consisting of 
the fronts and backs of three rectangles, the width of the first being the sum of 
the widths of the other two, and joined together as indicated in the diagram on 
page 149. If the widths of II and III are in an irrational ratio, a geodesic parallel 
to a side of one of these rectangles will be everywhere dense on the upper sides 
of these three rectangles.’ (If the ratio is rational, a geodesic parallel to a side 
is closed.) 

For those rational polygons (two-sided) with which the plane can be paved 
and those regular polyhedrons whose faces are such polygons (such as the tetra- 
hedron,® cube, octahedron, icosahedron, etc., but not the dodecahedron) it is 
certain that non-closed geodesics are dense on the whole Ueberlagerungsfliche. 
For on them a singular geodesic which meets two corners is necessarily closed, 
there being but a finite number of sheets and a finite number of corners; hence 
there cannot exist proper subregions of the type T. Whether there exist any 
proper subregions of the type T' on the dodecahedron or on an arbitrary rational! 
triangle is an open question. 


Tue Jouns Hopkins UNIVERSITY. 


® Loc. cit., §10. 

7 The example may be so chosen that it admits a realization in the ordinary three-space. 
One starts with a rectangle ABA’B’ of height 1 and width 1/\/3 and marks points O and O’ 
on AB and A’B’, respectively, so that OB = O’B’ = (2 + \/2)(\/3 — 1)/2\/3. Sub- 
sequently, one marks points P, Q’ on AB’ and P’,Qon A’Bsothat Z AOP = Z A'O'P’ =x/6 
and ZBOQ = Z B'0’Q’ = «/8. A billiard ball bouncing about the hexagon POQP’O’Q’, 
with initial direction parallel to PQ’, will follow the path of a geodesic and will be dense 
on a certain I’-region of the Ueberlagerungsfliche of POQP’O’Q’. If we fasten a rectangle 
RSTU to the edge PQ’ so that R and S lie between P and Q’, a geodesic g, parallel to PQ’, 
will be dense on POQP’O'Q'STUR; if parallel to PQ’ but below it, g will be closed on STUR. 
This is a concrete example showing that regions of both types, I and T, may exist on the 
same polyhedron in the same direction. 

’ The tetrahedron may be treated in a much simpler manner by Kronecker’s approxima- 
tion theorem. For the tetrahedron a non-closed geodesic is not only everywhere dense but 
uniformly everywhere dense on its Ueberlagerungsfliche. Cf. H. Bohr and R. Courant, 
Diophantische Approximationen und Riemannsche {-Funktion, Journal fiir die reine und 
angewandte Math., vol. 144 (1914), pp. 258-263. 














A CLASS OF BOUNDARY PROBLEMS OF HIGHLY IRREGULAR TYPE 
By Joun I. Vass 


1. Introduction. In dealing with expansion problems involving ordinary 
linear differential equations with boundary conditions, Birkhoff,' in 1908, dis- 
tinguished between so-called regular and irregular systems. The distinguishing 
features of regularity, as defined by him, are intrinsic? and not peculiar to the 
method of treatment. The expansions for arbitrary functions in terms of the 
solutions of regular systems have been discussed under hypotheses of con- 
siderable generality.’ 

The irregularities of differential systems fall into two classifications, mild 
irregularities and those which are more severe. A system may have one or 
both types. Expansions involving mildly irregular systems have been treated 
in rather general cases by Langer‘ and Stone.’ On the other hand, the expansion 
problem for highly irregular systems has been successfully studied only in very 
elementary cases. In so far as is known to the author, this theory has been 
developed only for differential systems which are highly irregular without being 
mildly irregular and which have an equation of the type 


d™u — 
(1) ae t MU =9, (n = 3), 


in the work of Hopkins® and L. E. Ward.’ 

If the conditions of regularity are not fulfilled, the series expansions are, in 
general, non-convergent, but may be summable by suitable means. It was 
found that a function f(z) satisfies certain very restrictive conditions in order 
to be expansible as a uniformly convergent series in the solutions of highly 


Received August 5, 1935; presented to the American Mathematical Society, April 6, 
1934. The author wishes to express his appreciation to Professor R. E. Langer for his 
many helpful suggestions in the writing of this paper. 


1G. D. Birkhoff, Trans. Amer. Math. Soc., vol. 9 (1908), pp. 373-395. 

2D. Jackson, Proceedings Amer. Acad., vol. 51 (1915-16), p. 383, et seq. 

3J. D. Tamarkin, Mathematische Zeitschrift, vol. 27 (1927), pp. 1-54. 

4R. E. Langer, Trans. Amer. Math. Soc., vol. 31 (1929), pp. 868-906. 

5M. H. Stone, Trans. Amer. Math. Soc., vol. 29 (1927), pp. 25-53. 

6 J. W. Hopkins, Trans. Amer. Math. Soc., vol. 20 (1919), pp. 245-259. 

7L. E. Ward, (a) Trans. Amer. Math. Soc., vol. 29 (1927), pp. 716-731; (b) Annals of 
Math., vol. 26 (1925), p. 21, et seq.; (c) Trans. Amer. Math. Soc., vol. 34 (1933), pp. 417-434, 
with the equation d*u/dz* + [p? + r(z)]u = 0, r(x) a convergent power series in z*. 
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irregular systems, in which appropriate boundary conditions are associated 
with equation (1). For instance, in the case of a system involving equation (1) 
for n = 3, Ward® showed that, when zx is considered complex, the function f(x) 
is analytic in a specified region of the complex plane; and that, besides satis- 
fying certain other auxiliary conditions, f(z) is of such a nature that when 
written in the form 


F(x) = gila*) + xgo(x*) + 2° ¢3(2%), 


the functions ¢)(x*), xge(zx*), 2°g3(2*) satisfy certain differential relations dictated 
by the boundary conditions. 
The n linearly independent solutions of equation (1) are 


ri 
ee Daz k = 1,2,.--,n, w=e", 
The complex numbers w?*-?, k = 1, 2, ---,n, present in these solutions, are 


symmetrically located on the unit circle. This symmetry played an essential 
part in the methods used by both Hopkins and Ward. While for certain pur- 
poses such symmetry could be highly desirable, it seems to the author that 
in these expansion problems it obscures the course of reasoning which must be 
applied to other problems in which the differential equations have solutions 
not involving such symmetric numbers. 

In the literature no mention is made of the expansion problem for highly 
irregular systems of the second order. The present paper takes up the ex- 
pansion theory for such systems of the second order. The reason for this 
omission in the literature is that with an equation of type (1), n = 2, the sys- 
tems corresponding to those studied by Hopkins and Ward are regular. The 
highly irregular systems of the second order considered here have an equation 
with solutions lacking such symmetric numbers as those mentioned above. In 
the hope that some light may be shed on the general situation for highly irregular 
boundary problems, the author is here concerned with finding the determining 
factors for the restrictions on f(x) for systems with an equation differing in type 
from those previously considered. 

The methods used are essentially those of Hopkins and Ward, and the con- 
clusions reached show an appropriate similarity to the results found in their 
papers. The present paper deduces certain sufficient conditions (see Theorems I 
and II at the end of this article) for convergence at interior points of the funda- 
mental interval for expansions involving systems which are highly irregular. 
To avoid complications not pertinent to the argument, the systems considered 
here are free from mild irregularities.® 


8 Reference (7) (a). 

® A treatment of the expansion problems for analogous systems which are at the same 
time both mildly and highly irregular may be found in the author’s doctor’s dissertation 
on file at the University Library, Madison, Wisconsin. 
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2. The differential system. Any differential equation of the type 
u'’(x) + mdu’(x) + med u(x) = O, 


where m and me: are such constants, real or complex, that the equation 
y + my + me = 0 has roots which are distinct, non-zero, of equal absolute 
value, and with arguments differing by a rational fraction of 7, can be reduced 
by a simple change of parameter to the normalized form 


(2) u''(x) — 2p coscu’(x) + pru(x) = 0, 


with c = pr/q,0 < 2p <q. The functions 


u;(x) = ” ia j = ;. 2, Qa, = e*, a= ee 
form a complete set of solutions for this equation. Two distinct differential 
systems will be formed by adjoining boundary conditions to equation (2). 
As a first condition, System I will have u’(0) = 0 and System II will have 
u(0) = 0, and each system will involve the ordinary linear boundary condition 


W2(u) = ayu’(O) + aeou(O) + ba w’(1) + dboou(l) = 0. 


The fundamental interval can be taken as (0, 1) without loss of generality. 

It is at once evident that, with » = 0 and the constant coefficients a;;, bj; 
properly chosen, these two systems are included in the following System III, 
which consists of equation (2) with the boundary conditions 


W,(u) 
W2( u) 


Wi(u) + vp 'Wulu) = 0, 
Wa(u) + Wau) = 0, 


where 


W io(u) 
Wa(u) 


ay u'(0) + apu(0), 
bau’(1) oe biou(1), (i = l, 2). 


This system is regular for vy ¥ 0. 
The two-rowed determinants of the matrix 


Wro(u), vp Wi(us), W o(ue), vp Wis(ue) 
| Weo(ur), Wau), Wo(u2), War(ue) I 
are functions of p. If [A;] is a polynomial in p~' of the type 


Aa Aw Ais 
p + pe p* 








[Ai] = Ay + (é = 0,1, 2,3, 4, 5; A; ¥ 0), 
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the six independent two-rowed determinants of this matrix may be written as 
follows: 

Dy = | Wiolur), Wio(ue) | = pl[Aol, 

D, = | (vp~)?-* Warl(ur), Wio(ue) | = p*[Aije™?’, 


(3) Dz = | Wio(ur), (vp)? * War(ue) | = p*[Aale**’, 

Ds = | (vp)? Wau), (vp~")?-* Wa (ue) | = vl As] efartar)e 

D, = | W ious), (vp~!)?-* Wir(ur) | = plAg] fo, 

Ds = | Wio(ue), (vyp~')?-* Wa(ue) | = p*[Asle**’, (¢ = 1,2). 
The System III is compatible if p is a root of the characteristic equation 

’ 3 
(4) ‘ite — a eae = > D; = 0. 
2 us), W2(ue) i=0 


If any one of the terms in this sum is identically zero, the system is highly 
irregular. This occurs when »v = 0, for then D; = 0, and it is this system which 
is to be studied here. Thus for the two values of the parameter vy = 0 and 
v = 1 it is possible to make use of the similarity, such as it is, between the 
highly irregular and the regular systems. It is for this purpose that the pa- 
rameter v is inserted in the system. 


3. The characteristic values and the contours 7,. Transcendental equations, 
such as equation (4), and their solutions have been discussed by Langer” and 
others. From their work it is readily inferred that there are two distributions 
of characteristic values, one for »y = 0 and one fory = 1. If » = 1, these values 
are spaced at asymptotically regular intervals along the rays I, I, IIa, and 
IIIb (see Fig. 1). For the irregular system with vy = 0 the values are spaced 
along rays I, II, and III, where III is the positive axis of reals. 

The characteristic vaiues thus located may be ordered for either set according 
to numerical magnitude by assigning subscripts so that | p,| S | pa4i|. If char- 
acteristic values have the same numerical magnitude, either actually or asymp- 
totically, they are considered in counter-clockwise order. 

It is possible to construct"! in the p-plane an infinite sequence of contours, 
Yn, N = Ny, Ne, M3, --- , Which are concentric circles with center at the origin, 
the same sequence serving for both v’s. These contours have the following 
properties. 


(i) Every point of any contour y, lies at a distance greater than a definite 
positive constant 6 from any characteristic value; 

(ii) The contour 7, has a radius R,, where R, — ~ asn— ~ over the sequence 
nN, Ne, eee > 


10R. E. Langer, Trans. Amer. Math. Soc., vol. 31 (1929), pp. 837-844. 
11 See reference (4), p. 878. 
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(iii) The contour 7, contains in its interior just n characteristic values; 
(iv) The number r of characteristic values between any two consecutive con- 
tours satisfies the inequality 1 < r S 4. 


A further important fact to be noted is that A(p, v) or the quotient of A(p, v) 
divided by any of its terms is uniformly bounded from zero for p on any con- 
tour yn, ” sufficiently large. In particular, if »y = 0, for such values of p, 
A(p, 0) is expressible in the form 


(5) A(p, 0) = p’e™"Ar(p)[Ai] , 


with A; uniformly bounded from zero. 
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The solution of the differential equation (2) which satisfies the first boundary 
condition is, except possibly for a constant factor, 


u(x) = Wi(us)e™"* — Wi(w)e™*. 
This also satisfies the second boundary condition for characteristic values of p, 
thus forming the set of characteristic functions u(x), i = 1, 2, ---. 
4. The formal expansion of an arbitrary function. Let f(x) be any arbi- 
trary function and let the function fi(x) be defined by the relation 
ji(z) = f(z) — af(1) — (1 — x)fO), 
so that fi(z) has the property that it vanishes at both z = Oandz=1. A dis- 
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cussion will now be made of the formal expansion of f,(7) in a series of charac- 
teristic functions, 
xe 


(6) fiz) ~ - a,u™(x) . 


n=1 


Langer™ has shown that the sum of the first n terms of such expansions as (6) 
are associated with the contour integral 


1 
kid aa i i M(@fi(s)dods , 
2mt Jo Jyn 
where G is the Green’s function and the operator M(G) is defined by the relation 


M(G) = (a: + a2)G.(x, 8, p) + pG(z, 8, p), 


the subscript s indicating differentiation with respect to that variable. 

The convergence problem will now be taken to be that of showing that the 
value fi(x) is the limit of the contour integral [,,(2) as n — & over the sequence 
Ni, Ne, Ns, ---. If more than one characteristic value necessarily lies between 
consecutive contours, the process corresponds to a summation of the series 
with the terms properly grouped. 


5. The integral 7,(x). The Green’s function for the System III is given 
by the well-known formula 


| u(x), wuelx), g(a,s, p)| 


| 
| 
| 
| 


(7) G(z, 8, p,v) = — Wilur), Wilue), Wilg) 
(p, v 


| 
| 


| We(u), W2(ue2), W2(g) 
where 


og ptnleduale) = rule(s) 
Prieto ui (s)ue(s) — us(s)u(s)’ 


the positive or negative sign being taken according as x> or <s. The Green’s 
function (7) can be expanded into the form 


| u(x), u2(x), 0 | 
G(x, 8, p) = g(x, 8, p) + se Wio(u), Wro(ue), Ww(9) 
Walu:), Welus), We(g) | 


12 See reference (4), p. 887. The operator M(G) can be derived easily from the integrand 
of the last integral on the page by the choice of f(z) indicated in the footnote there, and 
integration of the last term by parts. 
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| u(x),  ua(z), 0 | 
Wulw), Wulue), Wul(g) 
Wem), Welus), Walg) | 
= g(x, s, p) + h,(z, s, p) . 


a 


vit 





Using this latter expression for G(z, s, p) in M(G), the integral J,,(2, v) becomes 


1 1 
(8) I,(z, v) = aI [ M (g)fi(s)dpds to 5 / [ M (h,)fi(s)dpds . 


From the literature it can be drawn that, if f,(x) is integrable and of bounded 
variation on the open interval (0, 1), the first integral on the right of (8) con- 
verges to 3 {fi(a~) + fi(zt)} asn — ~. Likewise from the same source, it is 
concluded that if vy # 0, the second integral on the right of (8) converges uni- 
formly to zero. The proof that this integral converges to zero does not require 
that » ~ 0, except for the integration with respect to p over y,1, the are of y, 
which lies in the sector S:, where R(ap) 2 0 and R(ap) > 0. (See Fig. 1.) 
Hence for v = 0 it is the integral 


T(x) os +f [ M (ho) fi(s)dpds 9 


which requires special treatment and any conditions under which this integral 
converges uniformly to zero give the results sought. The convergence of this 
integral is the prime contribution of this paper. 


6. The form of J,:(x). Since z is present only as a parameter in 
u(x), u2(z), 0 

Wio(u), Wl), Wl) | 
W.(u:), Welue), We(g) 


H(z, s, p) = 


~~ 


1) 
A(p) | 
| 


it follows that 
| u(x), u(x), 0 
M (ho) = x | Wo(um), Wo(ue), Ww(M(qg)) | 


(ep) | 
| Welw), Welus), We(M(Q)) | 





Let the function g(z, s, p) be written in the form 
2 
g(x, 8, p) = + 4 ys u;(x) v;(s) ’ 
i=1 


13 Cf. reference (4). 
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the positive sign being used if x > s and the negative sign if r < s. From 
differentiation with respect to s of the functions 


e7aps — e-a2P8 


Vv) (s) = v2(s) = 


p(a, — az) ’ p(a; — az) ’ 


the formula for M(g) is found to be 
2 
M(g) = ¥ 4p D aj uz) vis), 
i=1 


where the sign is negative or positive according as x > or < s. In similar 
fashion, 


WiwlM(g)] = 3p > az vi(s) Wio(us) , 


WAM (9)] = 40 - a? vi(8) [Waolu:) — Wau] . 


Thus 
u(x) ’ ue(zx) ’ 0 


M(ho) = _. Wo), Wio(ua), ted a’ vi(s) Wio(us) 








> Walw), > Wai(us), D> a? vi(s) [Waolus) — Wau] 


i=0 
and by linear combination of columns 


u(x), u2(zx), R(z, 8, p) 


M(ho) = x Wrolus), Waolua), Do part vi(s) Wow) 








Wa(u:), Wawa), pat? v4(s) Wao(u) 


where 
2 
R(z, 8, p) = tod. a? us(x) v;(s) . 


This evaluation of M(ho) may be used in the integral J,:(2). The determinant 
displayed in M(ho) is now expanded by means of the minors of the elements 
in the last column. The integrals due to the term R(z, s, p) will not be dis- 
cussed further, since they may be merged with the first term on the right of (8). 
If the symbols y;; are defined by 


Yai(2, 8) _ a; u;(x)v,(s) ’ (i,j _ 1, 2) ’ 
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the contribution J’, due to M(ho) becomes 


, oi 3 
I,,(z) = Ont / [. i {(Do + Ds)yu + Dsyxr — Diyn 


+ (Do + D,)y2} fils)dpds , 
where the D; are those of (3) (with » = 0). 


7. The convergence of the integral I/,(x). The integral J/,(x) will now 
be expanded by substituting the definitions of u,(x), v;(s) in the functions 


yi(z, 8). If l; = —% 





, t = 1, 2, this integral can be expressed as the sum 


Qa a,” 
of six integrals, 


~ 2 ah [. A'(p)e%* =” pLAdl fils) dods , 
-— uh f A-"(p)e™1? 2-9 +44” 92[ As] f(s) dpds , 


(c) waif a A“M(p)e%2 42°C 92[ As] fi(s) dods , 
(9) 1 
@) oF =38 I. AH(phersrzteir— 52 Ad] fi(s) dds , 


(e) a [ i A-"(p)e**” = p[ Ao] fils) dpds , 


(f) a / A-"(p)e%? 2-9 F% 62[ As] fi(s) dpds : 


The sector S; is now to be divided into the sub-sectors S,;, 7 = 1, 2, equal 
in size and with the positive axis of reals as separating boundary. For the 
sector Sy, which is situated in the fourth quadrant, 0 < R(azp) S R(arp), 
while for the sector Sj. in the first quadrant, 0 < R(ap) < R(azp). The 
convergence proof will be carried out for the sector Sy only, since the argument 
is the same for S,2 with the réles played by a; and az interchanged. Hereafter, 
the symbol y,: will be understood to refer to the are of yn in Su. 

The integrals (a) and (e) in (9) will be considered first. If the form (5) is 
used for A(p) in these integrals, they become 


—ml [' -1 @, p(z—s—1) [4 dp 
(a) = i [a (pe = fils) : ds, 


1 
(e) sh [ aitipreneero-ieumeu [At] 566) ® ae, 
mt 0 Yn 1 p 


respectively. In this form these two integrals are readily seen to converge 
uniformly to zero by reason of the following lemma. 
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Lemma I." Let & be a real variable on the interval (0, &), p a complex variable 
ranging over the circular contours y,, and y, any arc of Yn lying entirely in the 
half-plane R(c\p) S 0 (ce; aconstant ~ 0). Then, if E(p) is uniformly bounded for 
|p| sufficiently large and (£) ts any function which is integrable on the interval 
(0, &:) and O S ¢’ S &” S &,, the integral 


a 
I | E(p)e"?* y(&) ~ dé + 0 
g’ Yn p 
uniformly asn—> @. 

The irregularity of the problem is such that the convergence for the remaining 
four integrals of (9) cannot be shown by methods like those employed for the 
integrals (a) and (e). The individual integrals apparently do not converge, 
and hence combinations of them are to be considered. Let it be assumed that 
fi(x) is differentiable. Then, since fi(s) vanishes at both s = 0 and s = 1, 
integrations by parts with respect to s leave the integrals in the forms 


@ —s)+a , 
(a) a i i A ‘(pe ,P (2—-s)+a,p plAdlfy (s) dpds ’ 
0 Yn 


—l ~ a a —s) , 
(b) =| / A “(pe saibtialat nati plAslfy (s) dpds , 
0 Yn 
(10) 


ra 
0) 58 [Pf srorerroe ptAdgivo dods, 
Tt Jo Yn 


1 
(d) =a i AW "(p)e%2? 9) +1" DLAs] fi (8) dods . 
a7 Jo rm 
For these the parts arising from integrating from x to 1 can be shown to 
converge to zero by the methods used for the integrals (a) and (e). 


8. The integrals (10) over the interval (0, x). It remains to be shown 
that the portions of the integrals (10) taken over the interval (0, x) are uni- 
formly convergent to zero as n — © when f,(s) is suitably restricted. To show 
this convergence, a change in the path of integration with respect to s is made. 
As a first step, in the integrals involving e~*'’* the change of variable s = agt/a; 
is made, while in the integrals involving e~“**, s is merely replaced by t. This 
has the effect of making the exponents in these two exponentials the same, so 
that the integrals become 


# [ | atiphereren-” Ag) j, (#4) dpdt , 
2Qri 0 Yn ay 


—l, |* ’ 
5 : | / AN(p)e™?*F%22"" 5[ As] f(t) dpdt, 
7 0 Yn 


1h | aHipherrinen Aa fi(22t) dade, 
me Jo Ym a 


4 / I A~'(p)et2? P+" p[ All f; (t) dpdt.. 
ue Yn 


(11) 


4 Tamarkin, loc. cit., p. 43. 
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Consider the circle of radius x with center at x = 0, which will be referred to 
as the “z-circle”. The path of integration is now changed from the straight 
lines 0 to x and 0 to ar/az to the straight line 0 to a,x, together with the ares 
of the “z-circle” from ax to x and ax to ax/az. (See Fig. 2.) To render 
this and subsequent steps valid, the function fi(z), with z a complex variable, 
is to be assumed analytic within and on the “z-circle”’. Furthermore, and 
this is an essential restriction, fi(¢) is to be such a function that the integrals 
(11) over the path 0 to ax exactly cancel. 








complex x plane 
Fig. 2 
9. Convergence along the arcs az to x and a,x to a r/a2, The two expo- 
nentials in the integrals (11) are related by the equality 


(12) ete ttaze(1—¢) = eft e—a2p) (2-1) a,p(z—t)+a,p ; 


e 


Since R(aip) 2 R(a2p) on the are 7, and z is an interior point of the interval 
(0, 1), the first exponential factor on the right is bounded for large values 
of |p|. By means of (12) and the relation 


e%20(z—t tarp ewe (zt) 


Ale) ——s LAs JAi(o) 
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the portion of the integrals (11) over the ares az to a,x/a2 and ar to x can be 
written in the following form: 


ls a,z/a, a,e(s—t) Az {ae dp 
(a) ane [ E(z, p) e ( Ea hi (2 ) * dt ’ 
ye(z—t A; , dp 
ow > sf [ tener [4lio%a, 
om l a,z/a, a,p(z—t) A, +{ Qe dp 
© fl” [ewenne [Bay ea, 


(d) ff E(p) "= fi Bat, 
Te Jaz Jym p 


where E(x, p) = Aj'(p) e'*?~*”) “*-») and E(p) = Aj'(p). For these integrals 
to converge uniformly to zero, it is necessary that along the arc of the “z-circle” 
x to mxr/a2, R[—azp(t — z)] S 0. To show that this is true, let w be the argu- 
ment of t so that (¢ — x) = z(e* — 1). If R[—asp(t — z)] is to be less than or 
equal to zero, the argument » = arg (e“ — 1), considered as the angle of rota- 
tion applied to the vector p in the complex plane, must be such that the vec- 
tor p remains in the half-plane where R(a29) = 0. Since the vector p is in the 
sector S;, bounded by the axis of reals and the ray R(a2p) = 0, it must not be 
rotated positively through an angle greater than } + + arga,. (See Figs. 1 
and 2.) Hence 7 is restricted by the inequality, 


(14) OsSnS}3ur+argam. 


The vector e“ — 1 is a chord of the unit circle subtending the angle w at the 
origin. From the triangle thus formed, it is evident that 7 = 37 + }w. Asw 
varies from 0 to arg (a:/a2) = 2 arg a, » varies from 3x to 37 + arg a. 
Therefore the conclusion is drawn that the condition (14) is satisfied when ¢ 
ranges over the are x to a2/ae. 

The fact that the integrals (13) converge uniformly to zero depends on the 
following lemma. Here E will denote a bounded function of p or of z and p, 
while ¢’ and t’’ will represent two points on the “z-circle” such that 0 < argt’ < 
arg t’’ < 2 arg a. 

LemmalIl. If y(t) is integrable on the arc of the “‘x-circle’’ for whichO S argt S 
2 arg am, the integral 


“” 
(15) I= J i Been yy) Lato 
t’ Yn 


(13) 


uniformly asn — «. 

Proof. With p on the are yn, —azp lies on the are of the circle y, in the 
second quadrant bounded by the positive axis of imaginaries and the ray 
through —a:. If ¢ is defined ef the relation \ = —aep = iR,e'*, then ¢ must 
satisfy the inequality 0 < ¢ < 3 — arg q in the )-plane. 














BOUNDARY PROBLEMS OF HIGHLY IRREGULAR TYPE 163 


This change of variable, together with the fact that ¢ = ze”, 0 S w S 2 arg am, 
makes it possible to write the integral (15) in the form 


” 
arg t’’ 2 are % 
I= — i E ei&nz(cos ¢+i sin ¢) [(cos w—1)+i sin w) Vilw)z ew dedw , 
0 


arg t’ 


The real part of the product 


i (cos ¢ + isin ¢) [(cos w — 1) + Zsin o 


— {sin ¢g (cos w — 1) + sin w cos ¢} . 
By means of the trigonometric relations 
cosw — 1 = — 2sin? dw, sin w = 2 sin 4w cos 4w 
this expression reduces to 
(16) —2sin 4 cos (}w + ¢). 


The relations 


(i) snjw2>jw, OS fw S}r7, 
(ii) cos (}w + ¢) = sin [34 — (40 + ¢)] 
2 sin [37 — (arg a + ¢)] 
=3(3r-—arga—y), $n 2}r —-arga—¢20 


are then used to show that the quantity (16) is less than or equal to the value 
— hw (4e — arga — ¢). 
It follows that 
arg" 


}x—arga, 
| Z| < M, [ et Rnzo(hr— arg a,—¢) | vw) | dedw 


argi’ 





argt”’ 
< M, [ 4 {1 me thes eQe—arge,)} | vi(w) | dw . 
aret’ R,zrw 
Therefore 


1 2arga, bn 
IT) s Made fo Lvale) | do + ["lvale) | de} 0, 


if 5, tends to zero, so that R,5, ~ © asn— o., 


10. Analysis of the restrictions on f,(t). Finally, the integrals of (11) exactly 
cancel along the straight line path 0 to az if the function fi(¢) is properly re- 
stricted. This condition on f,(¢) is of importance, since it determines a class 
of functions for which the expansion (6) is possible. 
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It will be observed that the integrals under consideration will cancel over 
the path 0 to a,x when the equality 


{etieetese p”[ Ag] ae erPttaie 621 44} si( ) 
i 
_ {eMsPzteeP 521 45] + ett tare 521 Ai] f(t) 


is satisfied. In other notation, this equation reads 
fu(e)Dy — wala) DISi(2t) = u(e)De + ler Hi(O, 
or further, 
W our) {ur(x) Worle) — we(x) Wor(ur) } fi(e ) 


= Wio(ue) {ui(z)Walus) — ue(r)Wa(ur)} $1. 
This equation will be an identity only when 


(17) Wel) fi ) = Wr(ue) fi (t) . 


Since f(t) is analytic within the unit circle, and fi(0) = 0, this function can 
be expressed as a power series, 


fi) = at + ce + cs + ---. 


For the System I, the condition (17) after integration reduces to 


(18) h (« ) = (2) 500, 


and upon comparison of the power series for the two members of this equality 
it is seen that equation (18) is an identity if fi(é) is a function of the type 


(19) fi) = OD cresat", 
r=0 
where the integer k is determined by the relation (a2/a;)* = 1. That such an 
integer k exists follows from the fact that a; and a2 are both commensurable 
with 7. 
On the other hand, for the System II, the condition (17) becomes 


= 
a“ 
£18 
“ene” 
I| 


= "40, 
a) 


which is satisfied if 


(20) fi =t > Creqil”™ . 


r=0 
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From the above discussion it is apparent that the actual nature of the func- 
tions in (19) and (20) depends upon the value of a, that is, upon the argument c 
in the differential equation. If we return to the definition of f,(x), it is also 
clear that a class of functions f(z) which are expansible in a uniformly con- 
vergent series of characteristic functions has been determined for each of the 
Systems I and II, namely: for System I, 


f(z) = Ci + aCz + 2° ,(2"), 
and for System II, 
f(x) = Ci + x2(x"). 


11. Sufficient conditions for uniform convergence. The conditions just 
established are sufficient to insure uniform convergence of an expansion of 
a function f(x) in a series of characteristic values for the systems discussed. 
These conditions are summarized by the following theorems. 

THEOREM I. For the System I, sufficient conditions for the expansibility of a 
function f(x) in a uniformly convergent series of characteristic functions on the 
open interval (0, 1) are 

(i) that the function f(x) be integrable and of bounded variation on 0 < x < 1, 
(ii) that the function f(x) be analytic within the unit circle in the complex plane, 
(iii) that the function f(x) be of the structure 


Ci ~ xrC2 4 x? h(x"), 


where k is determined as the smallest integer for which (a2/a,)* = 1. 

THEOREM II. For the System II, sufficient conditions for the expansibility of a 
function f(x) in a uniformly convergent series of characteristic functions on the 
open interval (0, 1) are 

(i) that the function f(x) be integrable and of bounded variation on 0 < x < 1, 
(ii) that the function f(x) be analytic within the unit circle in the complex plane, 
(iii) that the function f(x) be of the structure 


Ci + 2%,(2"), 
where k is determined as the smallest integer for which (a2/a;)* = 1. 


UNIVERSITY OF WISCONSIN AT MILWAUKEE. 











A PARTICULAR SEQUENCE OF STEP FUNCTIONS 
By Netson DunrorpD 


1. Introduction. If a sequence of real functions f,(¢) summable on (0, 1) 
converges in measure to f(t) and if 


(1) lim / f,(t) dt exists 
5 


n=o 


for every measurable subset 6 of (0, 1), then f(é) is summable and 


lim / f.(t) dt = i f(t) dt 
n=o Jé é 


uniformly with respect! to 6. But (1) may hold for every interval 6 in (0, 1) 
and the conclusion in the weaker form 


lim f(t) dt = i f(t) dt almost everywhere 
0 0 


may not be true even if it is assumed that f(t) is summable and that 


(2) fn(t) = f(t), (except on a set whose measure approaches zero with 1/n). 


The present note is concerned with the behavior of [ f,(t) dt under the 
0 


assumption (2) and we assume without loss of generality that f(t) = 0. The 
result is that there exists a sequence of positive step functions f,(t) satisfying (2) 
with f(t) = 0 such that for every summable function g(t) except for those in a cer- 


tain set of the first category the sequence i f.(t)g(t) dt is everywhere dense in the 
0 


space of measurable functions. This is embodied in Theorem 2. The principle 
(which is Theorem 1) underlying the construction of the sequence f,(¢) is a gen- 
eralization of an abstraction of an argument used by J. Marcinkiewicz* to show 
the existence of a continuous function g(t) which depends only upon a given 
sequence of positive numbers h, — 0 such that an arbitrary measurable func- 


Received January 19, 1936. 


1 This can be proved by combining results found in the following references. Saks, 
Addition to the note on some functionals, Trans. Amer. Math. Soc., vol. 35 (1933), p. 969; 
Jeffery, The integrability of a sequence of functions, Trans. Amer. Math. Soc., vol. 33 (1931), 
p. 435, B; and Dunford, Integration in general analysis, Trans. Amer. Math. Soc., vol. 37 
(1935), p. 447, Theorem 9. 

2 Sur les nombres dérivés, Fund. Math., vol. 24 (1935), pp. 305-308. 
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tion may be approached almost everywhere by an appropriately chosen sub- 
sequence of [g(t + h.) — o(t)]/hn. 


2. The fundamental principle underlying the construction. 

THEOREM 1. Let X and Y be metric spaces and {y,} dense in Y. Let T,x 
be a sequence of continuous functions with domain X and range in Y satisfying 
the following condition. For each m there is an unbounded sequence of integers n; 
such that the set of x which satisfy the equation lim T,;2 = Ym is dense in X. Then 
for every x in X, except for a set of the first category, the sequence T,,x is dense in Y. 

Let X, be the set for which T,,z is dense in Y. Then zx ¢ X;, if and only if 
for every pair of integers m, n there is a gq = n such that (7,2, yn) < 1/n. 
Thus X — Xi = }} Xun, where X,,, is the set of x for which (7,2, ym) = 1/n, 


q2n. Since T,z is continuous, the set X,,,, is closed and the hypothesis on the 
sequence Tx shows that X,,,, is non-dense. Hence X — X; is of the first cat- 
egory in X. 

Marcinkiewicz* constructed his example by taking X as the space of con- 
tinuous functions, Y as the space of measurable functions, {y,,} the set of 
polynomials with rational coefficients and T,x2 = [x(t + h,) — x(t)]/h,. It 
is obvious that for each m there is an xz such that T,2—> y». Also in the e-neigh- 
borhood of an arbitrary z there is an x» which is constructed like the Cantor 
function on each of a finite number of intervals (intervals upon which the 
oscillation of x is < €) such that T,,2.—+0. Thus the set of x for which T,, 7 — y,, 
is everywhere dense in X. Since X is not of the first category, the existence 
of the continuous function g(t) described in the introduction follows from 
Theorem 1. 


3. The construction of the sequence f,(/). A sequence of partitions of (0, 1) 
is defined by 
5 mi = (0 = t Ss 1/2"), 8 mi = [(i = 1)/2" <i Ss 1/2"), (i = 2, 3, ae a"*). 
A function f(t) will be said to belong to the class X,, in case there is a positive 
number » < 1/2”~' such that 
f(0) = 0, f(t) = 0 for [i — 1)/2™"' <t Ss (i — 1)/2™" + Ol, 
(i = a. 2, re 27), 


and f(t) is constant on the remaining part of 6n¢ (¢ = 1, 2,---, 2"). 


Lemma 1. For every integer m the set >> X, is dense in L [the space of real 


ra—m™ 


Junctions summable on (0, 1)]. 


3 This paragraph of exposition is inserted since the reader (without a critical examina- 
tion of Marcinkiewiez’ paper) may see no connection between Theorem 1 and the function 
g(t) described in the introduction. 
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Let f(t) be an arbitrary summable function and « > 0. Then there is an 
integer m’ = m and a function g(t) which is constant on each of the intervals 
bmi, 2 = 1,2, --- , 2”, such that 


S-g\|| = [ | f(t) — g(t) | dt < «/2. 


There is a positive number 6 < 1 such that / | g(t) | dt < «/2 if m(e) < 46. 
Thus if » = 6/2”’-' and 
h(0) = 0, A(t) = Ofor (¢ — 1)/2"""' < t S (i — 1)/2"" 479, 
(7 - 1, alti a, 
h(t) = g(t) elsewhere on (0, 1), 
then A(t) belongs to X,,, and ||f—h|!| S$ || f—g|! + |g —Ah| < «, which 
completes the proof. 
Define the step functions s,,,(¢) = n2”~—' for t in the intervals 
5(i, m,n) = [i — 1)/2"-' <t S ( — 1)/2™* 4+ 1/(n2™")], (= 1,---, 2"), 
Smn(t) = 0 elsewhere on (0, 1). 


Lemma 2. For each m the set of functions f(t) in L for which 
tim [ Salt) f(t) dt = 0 O<x<1) 
n 0 


is everywhere dense in L. 


Fix f() in >> X,,; then for some m’ = m and some n > 0, f(t) = 0 on the in- 


n=m 


tervals y; (j = 1, --- , 2”) where 
n=(Osten), w= (GG —1)/2"1 <t s (Gj — 1/2" + Ol. 


Now Smn,(t) = 0 except on the intervals 5(z, m, n). Since m’ = m, the integer 
j = 1+ (@ — 1)2”-™ is no greater than 2”’—', and for this 7 the interval y; 
contains 6(7, m, n), provided n is so large that 1/(n2"™-') < ». Thus for n 
sufficiently large, s».(t)f(t)} = 0 for0 < ¢ S$ 1. The conclusion follows from 
Lemma 1. 

The set of functions each of which vanishes at the origin and is a rational 
constant on the rest of 5,:,7 = 1, --- , 2", for some m = 1, 2, --- form a de- 
numerable set everywhere dense in S, the space* of measurable functions. Let 
this set be ordered in any manner into the sequence F,(z). 


4 See Banach, Théorie des Opérations Linéaires, Warsaw, 1932, p. 9. 
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Lemma 3. For every integer p there is an m, and a function f(t) in L such that 
lim [ Smpn(t) f(Odt = F(x) (0Os221). 
n 0 


Suppose F(x) = a; for rin 6,:,7 = 1,---,2"™-. Define m, = m, f(t) = 
Q; ON bmi, f(t) = ai — aj_1 On bn:, 7 = 2,3, ---, 2". Then for z in 5,, 


i Smn(t) f(t)dt = a, + ag — a, + +--+ + Gin — Ai 
0 


+ (a; — aj-s) | Smn(t)dt , 
8 mil 0 2) 


n 


and since lim I Smn(t)dt = 1, we have 
5 mi! 0,z 


—_ I ” san(t) {dt = F(x) O<2r<1). 


n 


Lemma 4. For every integer p the set of functions f(t) in L for which 
tim f Smpn(t) f()dt = F(x) (0s 221) 
n 0 


is everywhere dense in L. (my, is the integer of Lemma 3.) 
This is an immediate corollary of Lemmas 2 and 3. 
Now arrange the functions s,,,(¢) in a triangular array 


S11, S12, $13, ++ - 
See, S23, eee 
S33) -** 5 


and define the sequence fi = su, fo = S12, fs = S22, fs = Sis, etc. The sequence 
f(t) then has the property that f,() = 0 except on a set whose measure 
approaches zero with 1/n. Placing 


T.9 = [ * fall) gat , 


it is seen that T,g is a continuous function on L to S. In Theorem 1 take 
X = L,Y =S,y,=F,. From Lemma 4 it is seen that for each p there is an 
unbounded sequence of integers n; such that lim T,,g = F, for every g in a set 
everywhere dense in L. .Thus Theorem 1 gives 

THEOREM 2. There is a sequence of positive step functions f,(t) such that 
f.(t) = 0 except for a set whose measure approaches zero with 1/n and such that 
for every g(t) in L except for those in a set of the first category the sequence 
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[ Sn(t)g(t)dt has the following property. For every measurable function F(x) 
0 


there is a subsequence such that 
im f Sat) g(dt = F(x) almost uniformly . 
i 0 


Since, for p > 1, L, is of the first category in L, one might ask if the excep- 
tional set contains L,. This is not the case, for a reference to the argument 
shows that the same theorem holds if L is replaced by L, (the sequence f,(t) 
remaining fixed). The following corollaries are considerably weaker than 
Theorem 2 itself. 

Corotiary 1. If f is in L and F is in S, there is a sequence f, in L such that 


lim f,(t) = f() , lim [ f.(t)dt = F(x) almost uniformly , 


and another sequence g, in L such that 


n n 0 n 0 


almost everywhere. 
CoroLuary 2. For f in L there is a set S; dense in S such that for each F 
in S, there is a sequence f, in L such that 


lim f,(t) = f( , lim [ : fr(t)dt = F(x) everywhere . 


Coro.uary 3. For f in L and F in S there is a sequence of absolutely con- 
tinuous functions F,, such that 


F(x) — F(z), F!(t) +f) almost uniformly . 
Brown UNIVERSITY. 


5 Corollary 2 is a corollary of Lemma 4. 





THE PROBABILITY LIMIT THEOREM 
By Artuur H. Core_anp 


The probability limit theorem is concerned with the asymptotic behavior of 
certain sequences of integrals, but this should not be all. It is intended to 
throw light upon the nature of physical measurements. Whether or not 
measurements do behave in the manner described, is, of course, not a mathe- 
matical question; but whether or not the assumption of such behavior implies 
inconsistency is a mathematical question. As I have pointed out in a previous 
paper, we can answer such questions of consistency by studying the behavior 
of certain infinite matrices.'_ These matrices are called variates.? It is possible 
to analyse any proof of the probability limit theorem in terms of the matrix 
theory of probability, but it is preferable to start with an entirely new proof 
which is based directly upon the properties of variates.* 

We shall give a brief description of those properties of variates which will 
be used in this paper. A variate z is an infinite sequence of numbers, thus 


= ) ) ) k) 
z= 2, 7®, 7®, ..., xf a 


where x“ is an arbitrary number. A variate is called a constant, or parame- 
ter, when all of its terms are the same. Thus a = a, a, a, --- is a constant or 


parameter. The average of the first n terms of a variate x is denoted by p,(z), 


that is, p,.(z) = >> x /n. We shall let p(x) = lim p,(xz). Then p(z) is called 


k=1 noe 


the expected value of z, or the first moment of z. 


Let 2, Z2, ---, 2, be n variates and f(s:, 82, --- , 8n) be afunction of n vari- 
ables. Then f(x:, 22, ---, 2) is a variate defined as follows: 


f(x, Toy 5 wins » Zp) = fiz”, 2’, sd a » et), fe, =". Atarn , a), re 


For example, z* is a variate such that p(z*) is the second moment of x. Further, 
we shall mention two properties of the operator p( ). First, this operator is 
additive, i.e., 


p(t, + t2 + --- + 2m) = p(ar) + p(t2) + --- + pan). 


Received by the Editors of the Annals of Mathematics, February 11, 1935; accepted 
by them, and later transferred to this journal; presented to the American Mathematical 
Society, September 11, 1931. 


1 See the author’s article, reference (d), at the end of this paper. 

2? A variate is essentially the same as a Kollektiv. See reference (a). 

3 For further literature on the subject of the probability limit theorem, see Khintchine, 
reference (b). That paper includes an extensive bibliography. 
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Second, if a is any constant, p(a) = a and p(ax) = a-p(x). Thus 2’ = 
x — p(x) is a variate such that p(x’) = 0. 

Let ¢,(s) denote the fundamental function of the interval J: a < y < b. 
Then ¢,(s) is equal to 1, if s lies in J, and is equal to 0 otherwise. The variate 
¢,(x) can be interpreted as an event which succeeds on its k-th trial if and only if 


the k-th term of x lies in J, suecess being denoted by 1 and failure by 0. Then 
> ¢(z) is equal to the number of 1’s in the first n terms of ¢g,(x). Thus 
k=1 


Pnlg,(x)| is the success ratio and pl[y,(x)] is the probability of the event ¢,(z). 
Let I, and I! denote respectively the intervals — « <y S$ sand—« <y<s, 
and let ply:,(x)] = F(s + 0), plex (x)] = F(s — 0), and F(s) = [F(s + 0) + 
F(s — 0)]/2. Then F(s) is called the distribution function of the variate x. It 
will be observed that F(s + 0), F(s — 0), F(s + 0) — F(s — 0) are, respectively, 
the probabilities that a term of x will be S s, < s, = s. 

We shall define the dependence and independence of variates in terms of the 


fundamental function. A set 2, 22, ---, 2n,--- (finite or infinite) is said to 
be a set of dependent variates if there exists a finite subset 2,,, Zn.) --- , Zn, and 
a corresponding set of intervals I), Is, --- , 7, such that 


Pl¢r,(an,) -91,(In,) ee ¢1,(2n,)] # Ple¢r,(rn,)] - pler,(2n,)] ‘es Pler,(arn,)] . 


Variates which are not dependent are said to be independent. It will be ob- 
served that the independence of the variates of a given set implies the existence 
of their distribution functions. 

We shall consider an infinite set of independent variates 2;, 22, --- , Xx, --- 
such that p(z,.) = 0. Let X, = (m1 + 22 +--- + 2,)/B,, where B, = 
b? + b3 + --. +b? and b? = p(x). The probability limit theorem, which we 
shall now state, is concerned with the distribution function of the variate X,. 

TuHeoreM. If the variates x, 2, --- , Le, --- are independent and if, given 
any positive number «, there exist two numbers a and N such that p[ys\>a(rx) -27] 
< bi-« for every k, and such that b7/B? < « for every k less than or equal to n 
whenever n = N, then 


lim plen(Xo)] = Fee | “e-Pat 

There is no essential restriction involved in the condition p(x.) = 0, since 
if this condition does not hold, we can set r, = 2, — p(zx) and apply the the- 
orem to the variates x;, «3, --- . 

We shall give an outline of the method of proof of the limit theorem. Equa- 
tions (1) and (2), which are derived in this outline, will be used in the formal 
proof. The proof is accomplished by the aid of the characteristic function. If 
x is a variate and ¢ is a parameter, the characteristic function of xz is the ex- 
pression p(e). The characteristic function of a variate always exists when- 
ever the distribution function exists.‘ The variate X, consists of a sum of 


*See (d), pp. 545-547. 
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variates and e'*»' consists of a product of variates. Since these variates are 
independent, the expected value of their product is equal to the product of their 
expected values. Hence 


(1) p(e*n') = II plgeeles). 


By means of this equation, it will be proved (Lemma 1) that lim p(e*»') = 


e“2, Next we must obtain a method of computing the distribution function 
in terms of the characteristic function (Lemma 2). We have the equation 


1 ifzg<s 

+2 ut _ pitt, p—ist ’ 
| — €  dt=41/2ifz=s, 
me ime 0 ifz>s. 


If z is a variate, the integral is a variate such that there is a probability f(s — 0) 
that one of its terms will be equal to 1 and a probability F(s + 0) — F(s — 0) 
that one of its terms will be equal to 1/2. Hence 


(2) »| at (SS al = re — 0 4 EEO FEO) ry, 


2ri J_. it 





We shall prove that we can interchange the order of the operation of p(_ ) 
with that of integration. When the operator p(_) is applied to the integrand, 
it affects only the expression e‘*', since the remaining portions of the integrand 
depend on parameters and not on xz. Thus® 


1 + pit 





a p(e™") en ist _ 
ori il i dt = F(s). 


Finally, (Lemma 3) 
1 +2 5it __ o—t*/2, p—iat 1 ee 
= —— Se ef oe, 
Oni J. i te, 5 Oy 
We shall now turn to the formal] proof of the three lemmas and the theorem. 
Lemma 1. Under the hypotheses of the limit theorem 
| p(e*™!) — e#?| < e-#2 6, 


where lim e, = 0. 


Let e# = 1+ 2 — [1 + RW)/2. Then R@) = 2l + tt — e*)/f — 1. 
The function R(é) is analytic and bounded for all real values of ¢. Furthermore, 
R(O) = Oand R(+~) = —1. Thus 
bit? 
2.B? 





p(ei#t/Bn) — 1 — (1+ rat), 


5 Paul Lévy uses a slightly different integral for this purpose. See (c). 
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where 
bit? zit? ret 
2B =P Ee AZ 
Moreover 
bit? E (=) | 
-|Tak| S my fennel 
2.B, Iran SP 2.B? 2B, 
242 242 
azjt (2) | |Z (=) ] 
= . R —_— *Pigi-a —, | R\{ — “fi sl<a P 
| J OB, P\s\= (ax) +p 2.B? OB, Pi si< (zx) 
By hypothesis, given a positive number 4, there exists a number a such that 
242 242 
zt =) | 6b;.t 
- | Rl —]] -@eiz0 < ‘ 
of (= Pleize (@) | < Ppa 








There exists a number m such that | R(st/B,) | < 6/4 if | st| /B, < m, that is, if 
|s| S m-B,/|t|. Leth be an arbitrary positive number. There exists a num- 
ber N such that m-B,/|t| 2aif|t|<handn=WN. Therefore 


2,42 2,2 
xyt (=) ] 6bit 
- | Rl — }| -@isiea < 
»| 2 B | op,) | Pi'< (x) 


4B?’ 
and hence | r,,, | < 6 whenever |t| < handn = N. We have the equation 
bit? 
2.B? 











log p(eizkt/Bn) => {1 + Pnkl , 


where 





242 242 242 2 
HE a [gra Jae 


Hence by choosing n sufficiently large, all of the numbers | p,,,| such that k < n 
can be made arbitrarily small. From equation (1) it follows that log p(e*") = 
— @(1 + p,)/2 or ple'Xnt) = e-#+en!2, where | p, | is less than or equal to the 
largest of the numbers | p,,,|. Therefore | p(e*»") — e-“/?| < e-*?. ¢,-@, where 
lim «, = 0. 


Lemma 2. If x is a variate whose distribution function F(s) exists, and if 


+h ott _ izt) , p—ist = 
F(s, h) = sf. [ pe oe Rend = 1/+/i, fn 





| F(s) — F(s,h) | < 2e + [F(s + ©&) — F(s — )]/2. 


The function (e* — e‘.e-**)/t is bounded in the entire u, t-plane and is con- 
tinuous in both variables. Thus as ¢ becomes infinite, p,|(e“ — e'-e-**)/t] con- 
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+h eit _ eizt . e—ist 


verges uniformly for | ¢ | < h, and* hence (5 i 
2ri —h t 
From equation (2) it follows that F(s) — F(s, h) = p(v), where 


it _ pizt g—ist 
~_—s / eacnianpeemeneil 
2mt J\ejzr t 


1 [sat ag ees sin t ay 
h 


t us Jas lh t 


it) = F(s, h). 


and where sgn (x — s) = 1, 0, —1 according as zr >, =, < s. 


” sin t 


We have the inequalities (a) —dt| < l/h < ¢€ if 1 < h, (b) 


1 


a dota ceils sl 260 [ mt a| 1/2 for all 
W Ji z—s\h |a—el-h 


values of z,s and h. Thus» isa variate such that | v® | < 2eif|2® —s| Ze 
and | v® | < 1/2 otherwise. The probability that |r — s| < « is less than 
or equal to F(s + «) — F(s — «). Therefore 


| F(s) — F(s, h)| < 2e + [F(s + ¢) — F(s — 8)]/2. 


1 +h pit _ ¢—t*/2 gist 
etl Rl. AIRE 
dt, &(s, h) = =~ Af dt 








Lemma 3. If 6(s) = se [ 


and ¢ = 1/~/h, then | ®(s) — ®(s, h) | < 3¢. 
Let u be a variate which has the distribution function #(s). Then’ 


+e e- 272 +00 
ple) = : ei d&(s) = —— im et eR dg = — [ e~ @-i"2 dg . 
ce Te = V2 


Since there are no discontinuities of the function e~*-*” (except at infinity) in 
the region bounded by the real axis and the line s = it, it follows that 





ple) =e“? . (4 0) = et”, 


Hence by Lemma 2, | &(s) — #(s, h) | < 2e + [(s + ©) — ®(s — 6)]/2. Since 
(s) satisfies the Lipschitz condition #(s + «) — #(s) < «, we get 


| (s) — O(s, h) | < 3e. 





1 +e ett — giXnt gist 
Let (8) = plen(X.)] and 4,(s, b) = 5% [ ste at. From 


Lemma 1 we conclude that there exists a number N such that 
(3) | @,(s, h) — &(s, h) | < e whenevern = N. 


From Lemmas 2 and 3 we obtain the inequalities 


® See (d), Theorem 3. 
7 See (d), Theorem 2. 
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| &,(s) — @,(s, h) | < 2e + [#,(s + €) — #,(s — 6)]/2, 
| b(s, h) — @(s) | < Be. 
Combining (3) and (4) we get 
(5) | @,(s) — &(s) | < 6e + [4,(s + ©) — O(s — &)]/2. 


(4) 


This inequality can be replaced by 
&,(s — e) — 6e — [®,(s + €) — ,(s)]/2 < ®(s) 


(6) < ,(s + ©) + Ge + [®,(s) — &,(s — 6)]/2. 


Replacing s by s + 2¢ we see that 
®,(s) < @(s) + 8e whenever ®,(s + €) — ®,(s) 
= [#,(s + 3e) — ®,(s + 2e)]/2. 


We shall consider separately the cases ®,(s + ©€) — ®,(s) > 2€ and 
,(s + «€-) — ©,(s) S 2e. Let us assume that ®,(s) 2 O(s) + 8e and 
#,(s + «-) — ®,(s) > 2e. Then ,(s + 2c) = ®,(s + €) > P(s) + 10e > 
&(s + 2e) + 8, and by condition (7), &,(s + 3€) — ®n(s + 2e) > 4e > 2e. 


By induction ®,(s + 2ke) > ®(s) + 8e + 2ke, or lim ®,(s + 2ke) = + @. 
Since this is impossible, we have 

(8) ,(s — ©) S ®,(s) < ®(s) + 8e whenever ®,(s + ¢«) — ®,(s) > 2e. 
But it follows from (6) that 

(9) ©,(s — €) < &(s) + 7e whenever ®,(s + «) — ®,(s) S 2e. 


Replacing s by s + ¢ in (8) and (9) we get ®,(s) < (s) + Qe, and similarly, 
&(s) — 9e < &,(s). Therefore lim ,(s) = (s) . 

This proof of the probability limit theorem is based on assumptions which 
are expressed in terms of the properties of independent variates. Hence there 
is no inconsistency in the assumption that physical measurements behave in 
the manner described by the theorem. 
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APPLICABILITY WITH PRESERVATION OF BOTH CURVATURES 
By W. C. GRAUSTEIN 


1. Introduction. The determination of conditions necessary and sufficient 
that there exist a surface applicable to a given surface with preservation of both 
the total and mean curvatures constitutes a problem of classical differential 
geometry which has received no little attention.'. In this paper, various new 
conditions, all in invariantive form, are found. The map of a surface satisfy- 
ing these conditions on a surface applicable to it in the manner described is 
studied in some detail and is shown to have many interesting geometrical 
properties. 

The treatment is by means of the invariant methods recently exploited by the 
author.2 These methods are particularly advantageous in the present problem, 
in that they naturally disclose facts which otherwise might remain undiscovered 
or prove complicated to establish. 


2. Necessary and sufficient conditions. Let there be given a surface® 
S:2; = z(u,v), i = 1, 2,3, and assume that there exists a surface 8:2; = Z;(u, v), 
i = 1, 2,3, which is applicable to S so that both curvatures are preserved. Then 
any surface §* which is symmetric to S is also applicable to S with preservation 
of both curvatures. Inasmuch as the sign of the mean curvature of a surface 
depends on the orientation of the directed normal, it follows that the normals to 
S and S* must be so directed that corresponding directions of rotation about 
corresponding points have, with reference to these directed normals, opposite 
senses. Hence, for just one of the surfaces 5, 5*, the map of the surface on S 
has the property that corresponding directions of rotation about corresponding 
points are the same. Without loss of generality we may assume that this is the 
surface §; the surface 5* we then exclude completely, since it is readily obtain- 
able from 8. In other words, we assume that the normals of two surfaces which 
are applicable to one another with preservation of both curvatures are so 
directed that (without invalidating the equality of the mean curvatures) corre- 
sponding directions of rotation about corresponding points, referred to these 
directed normals, have the same sense. 


Received January 16, 1936. 

1 For references to the literature see Graustein, Applicability with preservation of both 
curvatures, Bull. Amer. Math. Soc., vol. 30 (1924), pp. 19-23. This paper will be referred 
to later as ‘‘Paper A’’. 

2 Méthodes invariantes dans la géométrie infinitésimale des surfaces, Mémoires de |’ Aca- 
démie Royale de Belgique (Classe des Sciences), (2), vol. 11 (1929); Invariant methods in 
classical differential geometry, Bull. Amer. Math. Soc., vol. 36 (1930), pp. 489-521. These 
papers will be referred to respectively as ‘‘B.M.”’ and “I.M”’’. 

5 It is assumed that all functions are real, single-valued, and analytic in a certain do- 
main of the real variables wu, v. 
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From the fact that the expressions for the total and mean curvatures of a 
surface in terms of the principal normal curvatures are symmetric in the latter 
curvatures we draw the following conclusion. 

Lemma. A necessary and sufficient condition that two surfaces be applicable with 
preservation of both curvatures is that they be applicable (a) so that the principal 
normal curvatures are preserved, or (b) so that in the neighborhoods of corresponding 
points they are congruent through infinitesimals of the second order. 

We return to our given surfaces S and 8. Let 1/n, 1/r2 be the common 
principal normal curvatures and denote the corresponding families of lines of 
curvature, individually and collectively, by Ci, C2, in the case of S, and by Ci, C2, 
in the case of S. Let C, C’ symbolize the two families of curves on S corre- 
sponding respectively to the families of lines of curvature C,, C2 on S and let C, C’ 
represent the two families of curves on S which correspond respectively to the 
families C,, C2 of lines of curvature on S. 

Let P be an arbitrary point of S and direct the curves Ci, C2, C, C’ through 
P (a) so that the direction of rotation from the directed curve C, to the directed 
curve C; and that from the directed curve C to the directed curve C’ are both 
positive, and (b) so that, for convenience, the smallest non-negative directed 
angle a from the curves Ci, C2 to the curves C, C’, respectively, is less than 
ri0Sa<r. 

To the curves C, C’, C,, C2 through the point P of S corresponding to the point 
P of S we give the directions which correspond, by the map of S on S, to the 
directions assigned to C,, C2, C, C’. Then the direction of rotation from the 
directed curve C, to the directed curve Cz and that from the directed curve C to 
the directed curve C’ are both positive, and the rotation about P through the 
angle —a carries the positive directions of the curves C,, C2 into those of the 
curves C, C’, respectively. 

It follows, from this last fact, that, if 1/7, 1/7’ are the normal curvatures, and 


1/7, 1/7’ (= —1/7) are the geodesic torsions of the curves C, C’, respectively, 
we have 

1 cos’a , sin’a 1 sin’a = cos’a 2 + 

SE ae a oo oe — + —, - = {— — —)sin 2a 

r Ty rT, r r; To T T2 1 


In terms of the quantities 


1 1 1 1 
Ra-4-, toe-=-, 

rt ry 1 Ye 
cota. 62,2. 

a FOF 


these equations take the more convenient forms 


(1) K’ = K’, L = L cos 2a, = — Lsin 2a. 


vin 
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The Codazzi equations for S, expressed in terms of the directional derivatives 
8/8s,, 8/82 in the positive directions of the lines of curvature C,, C2, may be 
written* 

PS Adi - r 
0K’ 2 L aL ak 2 L aL 


2 - — 
(2) as; pe as,’ OSe pi as.’ 





where 1/p:, 1/p2 are respectively the geodesic curvatures of the directed curves 
Ci, Co. 

Since geodesic curvature and directional differentiation are preserved by the 
map of S on S, the Codazzi equations of S, referred to the orthogonal system of 
curves’ C, C’, are reducible to the forms 





aK’ aL 27 2(?) 22_9 

ds, 98 pe 382 \F “a 
@) K’ Lb 3 2 22 

é * -215+2(?) 22 _o. 

O82 08 pi 08; \7 p27 


When we substitute for K’, L, 1/7 in (3) their values from (1), and solve the 
resulting equations for da/@s;, da/ds2, we obtain, by virtue of (2), the equations 





0z dz 
(4) a, Mt - 4%, a e+, 
where 
z= cota 
andé 
_ 106K’ @logl , 2 _ _1 0K’ adloghL 2 
(5) M = L Os) ns 08; » po” N “i L O82 ‘a O82 pi 


Equations (4) constitute necessary conditions on the surface S and the angle a 
that there exist a surface S applicable to S with preservation of both curvatures. 
These conditions are also sufficient. For, if equations (4) are fulfilled for S by a 
specific a, the values of K’, L, 2/7 obtained from (1) for this @ satisfy (3), a 
surface S with the desired properties is uniquely determined to within its position 
in space, and the angle a has the proper interpretation’ on both S and S. 

Employing the relation 


4 See B.M., p. 39. 

5 See B.M., p. 53; I.M., p. 508. 

6 The assumption that L = 0 is easily justified; a sphere evidently admits no surface 
applicable to it with preservation of both curvatures. 

7 See B.M., pp. 53, 54. 
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where V/Vs:, V/Vse are the modified directional derivatives* in the positive 
directions of the lines of curvature on S, we obtain, as the condition of com- 
patibility of equations (4), 


(6) Pz — Q =0, 
where 
VM VN VM VN 
a P= — —- —, = — — — M?— N°. 
(7a) V8e Vsi Q Vsi + VS82 


Substituting for M, N in (7a) properly chosen values from (5) and making use 
of the following expressions for the differential parameters® Ai, Ax and the 
total curvature” K, 


ae. (= y V de V ag 
A = _— __ A = eo Cl —_ 1—— 
” as, . = 751 O51 9 Vs2 O82’ 


-£()-£(). 
Vs2 \p1 V8i \pe2 


we find, as alternative values of P, Q, 


(7b) p=2/© (1) + r (4), Q = AslogL — 2K — aK’. 


Vs: \pi Vs2 \pe2 


From the first of these formulas it follows that the vanishing of P is a necessary 
and sufficient condition that S be an isometric surface." Hence we may pass to 
the following conclusions. 

TueoreM 1. There are ~' surfaces S applicable to a given surface S with pres- 
ervation of both curvatures if S is an isometric surface and 


LA: log L = 2KL* + AiK’. 


There is a unique surface S if S is not isometric and z = Q/P satisfies equations (4). 
Otherwise, no surface S exists.” 


3. Surfaces admitting ~' applicable surfaces. If S is an isometric surface, 
P = 0 and, by (7a), VM/Vsz = VN/Vs:. Hence if ds; = Aedu + Bedv and 
dsz = A,du + B,dv are the differentials of arc of the directed lines of curvature 
C,, C2 on S, Mds, + Ndsz is the exact differential of a function.“ Writing 


5 See B.M., pp. 57, 78; I.M., p. 500. 

* See B.M., pp. 60, 61; I.M., pp. 517, 518. 

10 See B.M., p. 58; I.M., p. 511. 

11 See B.M., pp. 55, 59; I.M., p. 513. 

12 For the corresponding theorem in non-invariant form, see Paper A, p. 23. 
13 See B.M., p. 58; I.M., p. 500. 
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this function in the form — log ¢, and using the second values of M and N given 
in (5), we find that 


(8) g= . eS, where S = / (“ _ =) : 
L p2 Pl 
Furthermore, since M = — @ log ¢/ds,, N = — @ log ¢/@82, we have from (7a) 


the fact that 


1 
= —-Asg. 
e 


Thus we have shown that S admits ~! surfaces applicable to it with preserva- 
tion of both curvatures if and only if it is an isometric surface and Aw = 0. 
According to (5), M = 0 and N = 0, that is, g is constant, when and only when S 
is of constant mean curvature. But, by (2), every surface for which K’ is 
constant is isometric. Hence our result may be stated as follows. 

THEOREM 2. A necessary and sufficient condition that there exist ~' surfaces 
applicable to the surface S so that both curvatures are preserved is that S be a surface 
of constant mean curvature or an isometric surface of variable mean curvature on 
which the curves ¢ = const., where ¢ is given by (8), form an isometric family with 
yg as an isometric parameter. 

The angle a. When the values of M and N in terms of ¢ are substituted in (4), 
these equations become 


A(yz) _ ap (yz) dg 
as; as,’ A8e as; 








Hence if y is a function to which ¢ is “conjugate’’,"* the general solution of (4) is 


(9) on CHE 
() 


’ 


where k is an arbitrary constant. 

If there is a constant angle a which satisfies (4), M = 0, N = 0 and S has 
constant mean curvature. Conversely, if K’ is constant, ¢ is constant, y is 
constant, and hence, by (9), a is constant. 

TuHeoreM 3. The angle a from the lines of curvature on S to the curves on S 
corresponding to the lines of curvature on 8 is constant only when the surfaces have 
constant mean curvature. 

It follows that only in this case do the lines of curvature on S correspond to the 
lines of curvature on a surface S. If the surface S for which a = a is denoted by 
S., 0 < a < 7, this surface is 5,/2 and the lines of curvature C,, C2 on it corre- 
spond respectively to the lines of curvature C2, C, on S) = S. More generally, 


4 The functions y and ¢ are actually conjugate functions of isometric parameters for 
the lines of curvature on S, as is readily proved by referring S to these parameters. See 
also Paper A, p. 21. 
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the lines of curvature on every pair of surfaces §,, Sais2,0 S a < 2/2, corre- 
spond.® 

The counterpart of Theorem 3 in the general case is the following. 

TueoreM 4. If S is a surface which admits ~' surfaces applicable to it with 
preservation of both curvatures and four of these surfaces are chosen, a specific cross- 
ratio of the four directions at a point P on S, which correspond, at the image points of 
P on the four surfaces, to the four principal directions associated with a chosen prin- 
cipal normal curvature, is the same at every point P on S. 

The result follows directly from the fact that equation (9) defines a linear 
transformation of k into z. In particular, if we take as the four surfaces S, = 
S, So, S:, S:, where S; is the surface S for which k = k, and denote the four corre- 
sponding directions at P by d,,, do, di, dx, we find that the cross-ratio (d,, do, di dx) 
has the value k and thus obtain a geometric interpretation of k as the projective 
coérdinate of the direction d,; referred to d,,, do, d; as basic directions. 


4. Properties of the map. Primary and secondary orthogonal systems. It 
follows from the lemma of §2 that two families of curves lying respectively on 
two surfaces which are applicable with preservation of both curvatures and 
making on these surfaces the same directed angle with the lines of curvature 
associated with a specific principal normal curvature have the same normal 
curvature and geodesic torsion. For the normal curvature and geodesic torsion 
are uniquely determined by the principal normal curvatures and the directed 
angle in question. 

We recall also from §2 that if a family of curves € on S has the slope angle @ 
with respect to the directed lines of curvature C,, the corresponding family of 
curves € on S has the slope angle @ — a with respect to the directed lines of 
curvature C;. Hence if 1/r, 1/t and 1/f, 1/t are respectively the normal curva- 
ture and geodesic torsion of the curves € and €, we have 


Rs LK? + L cos 26), = 1 T, sin 26, 
r 2 2 


(10) 


| ol 


= 51K’ + Leos 2(0 — a], = 5 L sin 200 — a). 


a 


If 1/ = 1/r and 1/t = 1/t, it follows that a = 0. In other words: 

TuHeoreM 5. If two corresponding families of curves lying respectively on two 
surfaces which are mapped isometrically with preservation of both curvatures have 
the same normal curvature and geodesic torsion, the surfaces are congruent. 

If 1/r = 1/é and a 0, it follows that 6 = a/2 or 06 = a/2 + 2/2 (mod z). 
Furthermore, 1/t for 6 = a/2 is equal to 1/f for 6 = a/2 + x/2, and 1/t for 
6 = a/2 + 2/2 is equal to 1/t for 6 = a/2. Hence we have the proposition. 


18 If K’ = 0, a closed continuum of surfaces Sq consists of a family of associate minimal 
surfaces and the surfaces Sa, Sai2/2 are reflections of one another in the point of symmetry 
of the family. According to (2), this is the only case in which the map of S on S reduces 
to a reflection. 
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TuHeorEM 6. If two surfaces are applicable so that both curvatures are preserved, 
there exist just two families of curves on each surface which are mapped with preser- 
vation of their normal curvatures. The two families form an orthogonal system, 
namely, the system consisting of the curves which bisect the angles between the lines 
of curvature associated with a chosen principal normal curvature and the curves 
corresponding to the lines of curvature on the other surface associated with the same 
principal normal curvature. The geodesic torsion of each family is equal to that of 
the family on the other surface to which it does not correspond; in other words, the 
geodesic torsions of the two families are interchanged by the map. 

If 1/t = 1/t and a ¥ 0, it follows from (10) that 6 = a/2 — +/4 or 6 = a/2 + 
7/4 (mod x). Moreover, 1/r for @ = a/2 — 7/4 is equal to 1/f for @ = a/2 + 
x/4 and 1/r for 6 = a/2 + 7/4 is equal to 1/¢ for @ = a/2 — x/4. 

THEOREM 7. If two surfaces are applicable so that both curvatures are preserved, 
there exist just two families of curves on each surface which are mapped with preser- 
vation of their geodesic torsions. The two families form an orthogonal system, 
namely, the system consisting of the curves which bisect the angles between the two 
families of curves which are mapped so that their normal curvatures are preserved. 
The normal curvature of each family is equal to that of the family on the other 
surface to which it does not correspond; in other words, the normal curvatures of 
the two families are interchanged by the map. 

The orthogonal systems of Theorems 6 and 7 we shall call respectively the 
primary and secondary orthogonal systems of the map of the surfaces on one 
another. On each surface the curves of either system bisect the angles between 
the curves of the other system. The map preserves the normal curvatures and 
interchanges the geodesic torsions of the primary system, and preserves the 
geodesic torsions and interchanges the normal curvatures of the secondary 
system. 

TuHeoreEM 8. Two surfaces are applicable with preservation of both curvatures, 
if and only if they are applicable so that there exist corresponding orthogonal systems 
whose normal curvatures are preserved and whose geodesic torsions are interchanged, 
or if and only if they are applicable so that corresponding orthogonal systems exist 
whose normal curvatures are interchanged and geodesic torsions preserved. 

The conditions have already been proved necessary. That they are sufficient 
follows immediately from the expressions"® 

1 1 


(11) K=i-_ XK 


ll 
ol — 
| — 
~ 


for the total and mean curvatures of a surface in terms of the normal curvatures 
1/r, 1/r’ and geodesic torsions 1/t, 1/t’ = —1/t of the two families of curves of 
an orthogonal system. 

Formulas (11) suggest the possibility of corresponding orthogonal systems on 
S and S whose normal curvatures and geodesic torsions are both interchanged. 
From equations (10) for @ = @ and @ = @ + 7/2 it is found that only when 


16 See B.M., p. 49; I.M., p. 516. 
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a = 1/2 do orthogonal systems with this property exist, and that then every 
pair of corresponding orthogonal systems has the property. 

THEOREM 9. In the case of two surfaces of the same constant mean curvature 
which are applicable with preservation of the lines of curvature, the normal curvatures 
and geodesic torsions of each pair of corresponding orthogonal systems are inter- 
changed. This is the only case in which there exist corresponding orthogonal systems 
on S and S whose normal curvatures and geodesic torsions are both interchanged. 

The first part of the theorem seems to deny the existence of the primary and 
secondary orthogonal systems on the surfaces in question. The paradox is, 
however, readily explained. The secondary orthogonal systems consist of the 
lines of curvatures and their torsions are all zero and so are both interchanged 
and preserved. Similarly, the normal curvatures of the primary orthogonal 
systems, which consist of the bisectors of the angles between the lines of curva- 
ture, are all equal and hence are both preserved and interchanged. 

Oblique systems. We now choose a second pair of corresponding families of 
curves, consisting of the curves €’ on S, with the slope-angle 6’, normal curva- 
ture 1/r’, and geodesic torsion 1/t’, and the curves @’ on S, with slope-angle 
6’ — a, normal curvature 1/f’, and geodesic torsion 1/t’, and consider the corre- 
sponding systems of curves ©, ©’ and G, C’. 

The total and mean curvatures of S and S are expressible in terms of the 
quantities pertaining to either of these two systems of curves. Expressions for 
them in terms of 1/r, 1/r’, 1/t, 1/t’, and w = 6’ — @ are” 





1 1 1 1 2 
ind a: SR. aint os Kk’ a — Soe eae 
(12) sin?wK = — iar sin? w K = + “3 COS w, 
where 1/p has either of the values 
1 cosw sinw 1 cosw . sinw 
(13) “in. ain eh y- Y +r) 


and those in terms of 1/7, 1/t’, 1/p’, and @ = w are entirely analogous. 

If the two systems of curves:are orthogonal (wo = + 7/2), the equal values of 
1/p in (13) reduce to + 1/t and + 1/t’, and (12) becomes (11). Thus 1/p 
assumes the rdéle for an arbitrary system which is played by the geodesic torsion 
for an orthogonal system. 

This fact suggests that we seek a generalization of Theorem 7 by demanding 
that 1/p = 1/p,a #0. From equations (10), (13) and the corresponding equa- 
tions for €’, @’, we find that 


— 


5 


[K’ cos w + L cos (6’ + @)], “te 51K" cos w + Leos (6 + 6 — 2a)). 


2 
Thus if 1/p = 1/p and a # 0, it follows that 6’ + @ = a@ (mod =) and hence that 
6 = a/2 — w/2, 0’ = a/2 + w/2 (mod x). It is readily shown, then, that 
1/r = 1/t’ and 1/r’ = 1/f. 

THeorEM 10. If two surfaces are applicable so that both curvatures are pre- 


ci 


17 See B.M., pp. 74, 75. 
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served, there exist an infinity of systems of curves on each surface, depending on an 
arbitrary function of two variables, every one of which is mapped so that the 
quantity 1/p is preserved. The curves of the primary orthogonal system of the map 
bisect the angles between those of each of the infinity of systems, so that these 
systems may be said to form an involution whose double system is the primary 
orthogonal system and whose orthogonal system is the secondary orthogonal 
system. The normal curvatures of each of these systems are interchanged by the 
map. 

The common value of 1/p and 1/$ for the corresponding systems whose curves 
meet under the angle w is evidently 


(14) 5 =F = 5 (K’ c08w + Leos a). 

5. Conjugate systems and asymptotic lines."* It follows from the opening 
statement of §4 that the angles between the two families of asymptotic lines on 
S are equal to those between the two families of asymptotic lines on S, and 
that the geodesic torsions of the two families on S are equal to those of the two 
families on S. We recall also that the geodesic torsions of the two families on 
either surface are negatives of one another. 

If the asymptotic lines on the two surfaces correspond, they coincide on each 
surface with the curves of the primary orthogonal systems, by Theorem 6, and 
hence both surfaces must be minimal. Moreover, since the asymptotic lines 
correspond, so do also the lines of curvature and therefore a = 7/2. Finally, 
by Theorem 6, the geodesic torsions of the two families of asymptotic lines on 
either surface are interchanged by the map and hence are actually preserved 
except for signs. 

The same results follow directly from (14). For the asymptotic lines on the 
two surfaces correspond if and only if conjugate systems always correspond, and 
a system of curves on a surface is conjugate if and only if the associated quantity 
1/p vanishes. But, by (14), 1/p = 1/6 = 0 for every value of w when and only 
when K’ = O anda = 7/2. 

Suppose now that there is just one family of asymptotic lines on each surface 
which corresponds to a family of asymptotic lines on the other. By Theorem 6, 
the two families belong, respectively, to the primary orthogonal systems on the 
two surfaces, and the geodesic torsion of the one is the negative of that of the 
other. Since the slope angles of the families of the primary orthogonal systems 
on S are a/2 and a/2 + 7/2, it follows that the angles between the asymptotic 
lines on S, and hence on S, are a, r — a. 

It is readily shown that if 1/r and 1/r’ are the common normal curvatures of 
the primary orthogonal system, 


18 We exclude developable surfaces in this paragraph. 
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Hence the case under discussion is characterized by the relation 
K” — L’ cos’ a = 0, K’' # 0. 


If neither family of asymptotic lines on the one surface corresponds to a similar 
family on the other, there is a unique conjugate system on each surface which 
corresponds to a conjugate system on the other. This conjugate system is one 
of the systems described in Theorem 10, namely, that one, according to (14), 
whose angle w is determined by the equation" 


K’' cosw + Leosa = 0. 


THEOREM 11. If two surfaces are applicable with preservation of both curva- 
tures, there exists, in general, at least one conjugate system on each surface which 
corresponds to a conjugate system on the other. Corresponding conjugate systems 
belong respectively to the involutions on the two surfaces described in Theorem 10 
and hence their normal curvatures, if existent, are interchanged by the map. In the 
exceptional case, there exists a single family of asymptotic lines on each surface 
which corresponds to a family of asymptotic lines on the other. These two families 
are double families in the two involutions in question and their torsions are negatives 
of one another. 

Inasmuch as two corresponding systems are conjugate systems if and only if 
they belong respectively to the involutions on the two surfaces described in 
Theorem 10, and also to the involutions whose double systems consist of the 
asymptotic lines, we may also draw the following conclusion. 

Coro.iary. Unique corresponding conjugate systems consist of conjugate- 
imaginary families of curves if and only if the two surfaces are of negative curvature 
and the families of asymptotic lines on either separate the constituent families of the 
primary orthogonal system. 

THEOREM 12. A necessary and sufficient condition that two non-minimal sur- 
faces be applicable with preservation of both curvatures is that they be applicable 
either (a) so that there exist corresponding conjugate systems which are mapped with 
interchange of their normal curvatures or (b) so that there exist unique corresponding 
families of asymptotic lines whose geodesic torsions are negatives of one another, 
provided the smallest positive directed angles from these asymptotic lines to those of 
the second families are supplementary. 

The necessity of the condition has already been established. The sufficiency 
follows from (12) in the case of the corresponding conjugate systems and, in the 
remaining case, from the expressions” 


2 
K= ~ Pp? K = —|_ cot yd 


19The case of minimal surfaces is excluded. The corresponding conjugate systems 
consist in this case of the isotropic curves. 
20 See B.M., p. 85. 
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for the total and mean curvature of a surface in terms of the geodesic torsion of a 
family of asymptotic lines and the directed angle y from these asymptotic lines 
to those of the second family. 


6. Map referred to fundamental orthogonal systems. Let the surface 
S: 2 = x(u, v) be applicable to the surface 8: @ = Z(u, v) with preservation of 
both curvatures. Suppose that S is referred to an orthogonal system of directed 
curves C, C’, where the direction of rotation from the directed curve C through an 
arbitrary point to the directed curve C’ is positive, and let the canonical differen- 
tial equations” of the curves C and C’ be respectively 


(15) Adu + Bdv = 0, A'du + B’dv = 0. 
The three fundamental forms of S are” 
ds? + ds”, 
(16) Lag — 2 aety + Las", 
r rt r 


i a = 
(4 +4) ae —? dsds + (5+ 4) ar, 
where ds = A’du + B’dv, ds’ = Adu + Badv, and 1/r, 1/r’ and 1/r, 1/r’( = —1/r) 
are respectively the normal curvatures and geodesic torsions of the curves C, C’; 
and the Codazzi equations of S are* 


aK’ aL 2 a/2 22 
+oO- 5% + (7) +2 2-0, 





p 
(17) 
dK’ _ ob _ 2 3 (2) 226 
as asp’ as’ \r — 


where @/ds, 8/ds’ denote directional differentiation in the positive directions of 
the directed curves C, C’ respectively, 1/p, 1/p’ are the geodesic curvatures of 
these directed curves, and 


- 1 1 1 
ore ° ees 


Suppose now that the orthogonal system of curves C, C’ is the primary system 
on S with reference to the map of Son S. Then if K’, L, 1/7 have the same 
meanings for the curves C, C’ of the primary orthogonal system on 5 as have 
K’', L, 1/7 for the curves C, C’ on S, we have 


uw Cen, Fax 
T 


21 See B.M., p. 48; I.M., p. 501. 
22 See B.M., p. 51; I.M., p. 507. 
23 See B.M., p. 53; I.M., p. 508. 
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and the Codazzi equations of S, referred to the curves C, C’, become 





0K’ , ab _ 2 2 (?) 22 _9 

as’ ' as’ p  as\r)/ pir” 
(18) 

oR’ ak _2,,_2(2) 422 

ds ds p ds’ \r pT 


From equations (17) and (18) we find that 


0/1 21 0/1 21 
_ (+5529 «we)-22 0 


Conversely, from equations (19), in conjunction with (17), follow equations (18). 

Equations (19) constitute conditions necessary and sufficient that | 1/7 | be a 
common integrating factor™ of the canonical differential equations (15) of the 
families of curves* C, C’. 

THEOREM 13. There exists a surface applicable to a given surface S with preser- 
vation of both curvatures if and only if there are two mutually orthogonal families of 
curves on S whose canonical differential equations have as a common integrating 
factor the square root of the absolute value of the geodesic torsion of either of the 
families. 

Corotuary. There are ~' surfaces applicable to S as prescribed when and only 
when S is an isometric surface containing an orthogonal system of the type described; 
S then contains ~' such orthogonal systems. 

The corollary follows from the fact that, if an isometric surface admits one 
surface applicable to it as required, it admits infinitely many, corresponding to 
1! values of the parameter a@ in §2. 

We have remarked that equations (19) are equivalent to the relations 


|1/r|* (Adu + Bdv) = du, |1/r|*(A’du + B’dv) = dw, 


where wu, »; are functions of u,v. Thus the condition of Theorem 13 is identical 
with the demand that there exist on S an (isometric) orthogonal system with 
reference to which the linear element can be written in the form 


|r| (dui + dvi). 


Suppose now that the curves C, C’ on S and the corresponding curves C, C’ on 
S constitute the secondary orthogonal systems of the map of Son 8S. Then 


K' = K’, L=-L, 


24 The quantity 1/7 is never zero, since the primary orthogonal system never consists 
of the lines of curvature; see §4. 
2% See B.M., p. 54; I.M., p. 513. 
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and the Codazzi equations for 5S, referred to the curves C, C’, become 





=-S *1+ 2(2) 27 =0, 
ds ds p Os \r pT 
- K L.2 2 22 
aK’ 0 7) 
tat, +3(2) - 22-0. 
Equations (17) and (20) are obviously equivalent to equations (17) and 
aL 2 aL 2 


But equations (21) characterize | L |! as a common integrating factor® of the 
canonical differential equations (15). Thus we conclude: 

THEOREM 14. There exists a surface applicable to a given surface S with preser- 
vation of both curvatures if and only if there are two mutually orthogonal families of 
curves on S whose canonical differential equations have as a common integrating 
factor the square root of the absolute value of the difference of the normal curvatures 
of the two families. 

Corotuary. There are ©! surfaces applicable to S as prescribed when and only 
when S is an isometric surface containing an orthogonal system of the type described; 
S then contains ~' such orthogonal systems. 

The condition of Theorem 14 is equivalent to demanding that there exist on S 
an (isometric) orthogonal system with reference to which the linear element can 
be written in the form 

1 
| L| 

THEOREM 15. If a surface S admits a surface applicable to it by preservation of 
both curvatures, the finite equations of the families of curves constituting the primary 
and secondary orthogonal systems of the map on S can be found by quadratures. 

The theorem follows immediately from the foregoing considerations when one 
realizes that in both cases the canonical differential equations (15) can be found 
by algebraic processes.”” 

THEOREM 16. The spherical representations of the primary orthogonal systems 
on two surfaces which are applicable with preservation of both curvatures are mapped 
on one another with preservation of arc lengths. 

This proposition furnishes a characterization of the primary orthogonal 
systems of the map except when the two surfaces are minimal. It follows from 
the third of the fundamental forms (16). 

By means of the second of these fundamental forms and the relation K = 
1/(rr’) — 1/7’, the following fact is readily proved. 


(du? + dv). 


26 J, is never zero, since the secondary orthogonal system never consists of the curves 
bisecting the angles between the lines of curvature. 
27 See I.M., p. 516. 














190 W. C. GRAUSTEIN 


THEOREM 17. The asymptotic lines on each surface correspond to a conjugate 
system on the other if and only if the total curvature of either surface is equal to twice 
the product of the normal curvatures of the primary orthogonal system. 


7. Surfaces of constant mean curvature. 

TuHeoreM 18. [f there exist on a surface two mutually orthogonal families of 
curves whose canonical differential equations have both of the common integrating 
factors |1/r|' and | L|*, the surface has constant mean curvature and the two families 
cut the lines of curvature under constant angles. 

For, if equations (19) and (21) both hold, K’ = const., by (17), and 1/7 and L 
have a constant ratio. The directed angle 8 from the lines of curvature C, 
(see $2) to the curves C is then constant, by virtue of the relation® 


(22) 3 = L tan 28. 


TueEoreM 19. The canonical differential equations of each two mutually orthog- 
onal families of curves on a surface of constant mean curvature which cut the lines 
of curvature under constant angles have both | 1/r |* and | L |* as common integrating 
factors, provided merely that neither quantity vanishes.” 

Since K’ = const., the Codazzi equations (17) become 


& 3, 2 (?) 5 


as’ op as \r pr P 
aL 2 a (2 22 
at} at-* 


From these equations and equation (22), in which 8 is now constant, equations 
(19) and (21) are readily deduced and thus the theorem is proved. 

In view of the results of §6, Theorems 18, 19 tell us that a surface S admits two 
surfaces applicable to it with preservation of both curvatures so that a prescribed 
orthogonal system on it is the primary orthogonal system of the one map and the 
secondary orthogonal system of the other if and only if S is of constant mean 
curvature and the given orthogonal system makes constant angles with the lines 
of curvature. 

With reference to an orthogonal system O of the type described we may define 
four transformations of a surface S of constant mean curvature which carry S 
into surfaces isometrically mapped on S with preservation of both curvatures, 
namely, the transformations: 


T;, for which O is the primary orthogonal system on S; 
T2, for which O is the secondary orthogonal system on S; 
T;, which preserves the lines of curvature; 

T», the identity. 


28 See B.M., p. 54; I.M., p. 516. 
29 For the lines of curvature, 1/r = 0, and for the curves bisecting the angles between the 
lines of curvature, L = 0. 
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These four transformations, applied to S and O, give rise to four surfaces, S;, Se, 
S;, So = S, and four orthogonal systems, 0,, O02, 03, Oo = O, lying respectively on 
these surfaces. 

According to Theorems 6, 7, 9, the four transformations have the following 
effects on the normal curvatures, 1/r, 1/r’, and geodesic torsions, 1/7, 1/r’, of 
the orthogonal system O: 


T, preserves 1/r, 1/r’ and interchanges 1/7, 1/r’; 

T: interchanges 1/r, 1/r’ and preserves 1/7, 1/7’; 
T; interchanges 1/r, 1/r’ and interchanges 1/r, 1/r’; 
T» preserves 1/r, 1/r’ and preserves 1/r, 1/7’. 


It is evident that, with respect to these properties, the four transformations 
form a group. In particular, if i, 7, k are the numbers 1, 2, 3 in any order, 
T;T; = T;, and hence the surface S; and the orthogonal system O; are carried 
into the surface S; and the orthogonal system O; by the transformation T',. 
We may, then, draw the following conclusions. 

TuHeEorEM 20. The set of four surfaces and associated orthogonal systems S;, O; 
is closed with respect to the group of transformations T ;,i = 0,1, 2,3. Application 
of one of the transformations, other than To, to the four surfaces and systems inter- 
changes them in pairs, and application of all four transformations to one surface and 
system yields all four. 

The surfaces S,, S2, S; belong to the family of surfaces § which are applicable 
to S = S, with preservation of both curvatures. If, as in §3, we denote the 
surface § for which a = a by §,,0 < a < x, and denote by 0,0 < 6 < 1/2, the 
smallest non-negative directed angle by which the orthogonal system O on S, is 
advanced over the lines of curvature, then So, S,, S2, Ss are respectively the 
surfaces S, Sa, Seei+/2, S,j2. As 0 varies from 0 to 7/2, the two variable surfaces 
of the set each trace the entire family of surfaces S, just once. 

As the comment on Theorem 9 suggests, there are two special positions of O for 
which the surfaces S;, and the transformations T ;, coincide in pairs. When O 
consists of the lines of curvature on So, then @ = 0, so that S; = So, Sz = S; and 
T; = To, Tz = T3; and when O consists of the bisectors of the angles between the 
lines of curvature, then @ = 2/4, Se = So, S; = Ss;and T; = To, T; = 73. 


HarvarpD UNIVERSITY. 











THE IDEAL WARING THEOREM FOR TWELFTH POWERS 
By L. E. Dickson 
1. Ideal. Let g denote the greatest integer < (3/2)". Let 
(1) P = g2" -1, I=9+2*-2. 


Evidently P < 3". Consider all the ways in which P can be a sum of positive 
integral n-th powers, necessarily 1" or 2". There will occur more than J sum- 
mands except when there are exactly g — 1 terms 2” and exactly 2" — 1 terms 1. 
Hence P is a sum of J, but not fewer, n-th powers. 

When n = 2, 3, 4,g = 2, 3, 5, J = 4,9, 19. Lagrange and Euler proved 
that every positive integer is a sum of four squares. In 1770 Waring conjec- 
tured that “every positive integer is a sum of 9 cubes, also of 19 fourth powers, i 
etc.” The proof for 9 cubes was first obtained by Wieferich.! For fourth 
powers, the best result yet proved is that 35 suffice, while all integers < 10” are 
sums of 19. All tables extant confirm the following amplification of Waring’s 
conjecture: Every positive integer is a sum of I = g + 2" — 2 integral n-th powers 
20. This will be called the ideal Waring theorem. Proof has been published 
only for n = 2and n = 3. 


2. Summary for twelfth powers. We here prove the ideal Waring theorem 
for n = 12, viz., 

TuHEorEM 1. Every positive integer is a sum of I = 4223 integral twelfth powers. 

But we go further and prove 

THEOREM 2. Every integer > 2-3" is a sum of 2405 twelfth powers. Every 
one > 3-3" is a sum of 1560 powers. Every one > 2-5" + 7% + 8" is a sum 
of 440. 

There are only two earlier results.2 From Kempner’s identity, we find that 
6 1/4 billion twelfth powers suffice. By a short table, the writer proved also that 
10711 suffice. The present table gives decompositions of 6 908 733 consecutive 
integers into twelfth powers. 


3. Minimum decompositions. We employ 
a = 2" = 4096, b = 3" = 531 441, ec = 4" = 16777 216, 
(1) d= 5" = 244140625, f = 6" = 2176 782 336, 
g = 7" = 13 841 287 201, h = 8" = 68 719 476 736, 


Received February 7, 1936. 

1 Math. Annalen, vol. 66 (1909), pp. 99-101. The proof was corrected and greatly sim- 
plified by Dickson, Trans. Amer. Math. Soc., vol. 30 (1927), pp. 1-7. 

2 Dickson, Bull. Amer. Math. Soc., vol. 39 (1933), p. 713. 
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(2) b = 129a + 3057 , ce = 31b + 73a + 3537, 
(3) d = 14c + 17b + 55a — 176, 
(4) f = 8d + 13c + 10b + 58a + 1550, 
(5) g = 6f + 3d + 2c + 27b + 65a + 1731, 
(6) h = 49 + 6f + d + 2c + 30b — a — 275. 
By a resolution of n we mean one of the ways of expressing n as a linear fune- 
tion of a, b, c, --- , whose coefficients are integers. In case all the coefficients 


are 2 0, the resolution is a decomposition, and their sum is called the weight w. 
Thus, (2), (4), (5) are decompositions, while (3) and (6) are not, but are resolu- 
tions. A decomposition of n is minimal if there is no other decomposition of n 
of smaller weight. 

For s = 2d + g + h, we seek a minimum decomposition of every integer 
between s and s + d. Such an integer can be expressed in the form r + N, 
where 0 S r < a, and 


(7) N=Aa+Bb+Cco+s, OS AS129, OF BS31, 05CS14. 


Assign any integral values to A, B, C within their limits. All homogeneous 
resolutions, involving’ g + A but not d, of r+ N (r = 0, --- , 4095) are evi- 
dently obtained by adding N to all resolutions of 0, --- , 4095 of the form 
L — 2d, where L is linear and homogeneous in a, b, c. One of the latter resolu- 
tions is obtained from the double of (3), 


(8) 110a + 34b + 28¢ — 2d = 352. 


Evidently all such resolutions of 0, --- , 4095 are found by adding to (8) all 
homogeneous resolutions into a, b, c of — 352, --- , 3743. The latter are obvi- 
ously found by repeated additions of 


130a — b = 1039, 74a + 31b — c = 559, 


and (3) with terms similarly transposed, together with the negatives of these 
four. 

Actually we desire homogeneous decompositions and not resolutions. Hence 
in what precedes we retain only those resolutions of 0, --- , 4095 which involve 
no one of — 130a, — 32b, — lL5e. 


4. Leaders. A decomposition is not minimal if it is the sum of the left 
member of one of the following equations and any function whose coefficients 
are all = 0. 


3 For those involving d + g + h, but not s, use Z — d. For those involving 2d + g, 
but not h, use L — h; ete. 











194 L. E. DICKSON 
63b+ 18a = 79+ 2c, 172b+ d = 614 a+ 20c, 
3ld+ 68c+ lla = 11+ 4b+ 4f, 58d+ 3la = 25+ 32b+4 18c+ g, 
2f+ 17c+ 104a = 12+ b+ 19d, 3f+ 28d+ 28c+ 10b = 37+ 18a+4 g, 
5f+ 71ld+ 3b = 8+ lla+ 32c+ 29, 7f+ 19c+ 99a = 924 12b4 7d+ g, 
29+ 55d+ 15c+ 21a = 3+ 6b+ 19f, 
49+ 79d+ 36¢ = 554 9a+ 12b+ 3f+h, 
h+ 34b+ 90a = 44+ 9c + 37d+ g, 
h + 56d+ 24b4+ 4a = 144 18c+ 25f+ 2g, 
h+ 23c+ 111b = 394 a+ 54d+ 13f+ 2g. 


By convention a like result holds for the following equations whose two 
members are of equal weight: 


130c = 1200 +a+8b+f, 15d + 40c + 38b + 42a = 133 + 2f, 
15d+ 42c-+ 24a = 544 25b4+ 2f,  45d+ 73a = 424 67b4 4c+ 5f, 


2f+ 88b = 254+ 6a+ 44c4+ 15d,  3f+ 46c+ 42b = 124 4904 30d, 


4f+ 117a = 55+ 22b+ 9c+ 35d, 6f+ 3b+ 93a = 1+ 5le+ 50d, 


10f + 85c + a = b + 95d. 


5. List of equations. We employ equations P = r (0 <= r S 4095) in 
which the coefficients of f and c are 2 0, those of g and h are = —1, while the 
coefficient of dis 2 —2 and that of bis 2 —3. In fact, we shall use (7) only 
when C = 0, B S 3, and we then require that all coefficients of N + P be 2 0, 
whence N + P is a decomposition of r. Since we retain only minimal decom- 
positions, we have discarded N + P if it be the sum of a leader and a function 
whose coefficients are all 2 0. Further, N + P were found to be not minimal 
(and discarded) during the construction of the table in §6. For reasons ex- 
plained in §10 we have abridged a longer list of such equations. In the list, 
w is the weight of Pin P = r. When the coefficients of f, d, c are omitted, they 
are the same as in the line above. 
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— h-—gplus | — h + 0g plus (continued) 

No. f dic b a r w No. f dic b a r w 
1 31 61 10 38 -8 754 130 46 30 12 29 0 -—53 3020 17 
2 32 51 27 —-1 26 2577 133 | 47 31 243 56 -58 346 73 
3 32 52 12 13 45 2960 152 48 31 2 44 24 -2 826 98 
4 14 -85 1921 23 | 49 31 3 28 70 -—39 729 92 
5 33 43 13 21 —88 3647 20 | 50 31 3 29 38 17 1209 117 
6 34 34 13 59 -—18 1836 120 | Sl 39 -113 170 —12 
3 34 35 O 10 —I17 2140 60 | 52 31 3 30 6 73 1689 142 
8 35 23 45 6 -—3 3276 104 | 38 7 -—57 650 13 
9 35 25 15 34 35 4042 142 | & 31 414 538 -94 553 7 
10 35 —95 3003 13 55 31 4 15 21 -—38 1033 32 
11 35 25 16 3 —39 3483 38 | 56 31 5 O 35 —19 1416 51 
12 35 26 1 17 —20 3866 57 | 57 31 S & 3 37 1896 76 
13 36 15 31 27 12 1289 119 58 4 -93 857 -—53 
14 36 16 16 41 31 1672 138 

15 36 16 17 10 —43 1113 34 | —h+q plus 
as = 2. Ss Se So) ee ee ee ee 

17 37 6 31 66 —47 2535 91 | 
18 37 6 33 3 -—65 2456 12 59 18 63 18 25 -—6 3225 118 
19 37 7 16 80 —28 2918 110 60 20 45 20 39 —13 2581 i111 
20 37 717 48 28 3398 135 | 61 20 46 6 22 —68 2405 26 
21 49 —102 2359 6 62 21 33 #66 4 -—73 3158 51 
22 37 7 18 17 —46 2839 31 63 21 36 21 46 -17 211 107 
23 37 8 2 63 -—83 2742 25 | 64 21 37 6 60 2 594 126 
24 37 8 3 31 —27 3222 SO | 65 22 2 7 67 —1 2320 123 
25 37 8 4 0 —101 2663 —54 66 22 28 8 36 —75 1761 19 
26 38 -—2 19 24 -—50 469 27 | 67 23 1448 4 —99 2131 25 
27 38 —1 4 38 -—31 852 46 | 68 23 16 5 O -5 3377 88 
28 38 -1 5 6 25 1332 71 69 23 #417 37 «+78 —98 2800 57 
29 7-105 293 -—58 | 70 23 #17 39'~«44 14 3760 107 

71 2 #18 2 60 —23 3663 101 

— h + 0g plus 72 2% 19 10 11 ~—22 3967 41 

Mm f @€¢ ba ff wi® 2 SSR 8-2 2 
74 24 9 24 67 -—27 1298 97 

30 2 58 6 74 —55 2564 107 75 24 9 2 35 29 1773 122 
31 264648 2406€«38 «684 7711384 | 76 4 9 26 4 -45 124 #18 
32 27 39 24 «442 —25 2017 106 | 77 2 #10 9 8l -—8 1676 116 
33 27 39 2 10 31 2497 131 78 24 #10 10 5O —82 1117 12 
34 27 40 10 2 -80 1841 21 | 79 2 10 11 «#18 —26 1597 37 
35 28 30 2 49 —28 3743 103 | 90 2 -1 41 2 7 3116 100 
36 28 31 11 32 -—83 3567 18 81 25 0 2 74 —30 3019 94 
37 28 31 12 0 -—27 4047 43 82 25 0 27 #%411 —48 2940 15 
38 29 21 27 «+24 24 1853 124 83 25 1 12 25 -—29 3323 34 
39 25 —106 814 —5 
40 29 22 11 70 —13 1756 118 — h + 2g plus 
41 30 11 41 81 —110 2236 52 ™ 
eo npné 6am oi FF Ft 
43 30 11 43 #17 2 3196 102 84 10 77 28 1 —10 1680 107 
44 30 12 28 31 21 3579 = 121 85 12 58 45 1 -—36 653 81 
45 32 —109 2540 -8 86 12 60 15 29 2 1419 119 
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— h+ 2g plus (continued) 





— h + 3g plus (continued) 














No. f dec b a Tr Ww No f die b a r Ww 
87 12 61 1 12 —53 1243 34 | 132 9 —92 3849 —12 
88 13 49 46 8 —39 2379 78 | 133 9 30 15 54 0 695 110 
89 13 51 16 36 —1 3145 116 | 134 9 30 16 23 -74 136 6 
90 13 51 17 5 —7% 2586 12 | 135 9 31 1 387 -55 519 2 
91 13 52 1 5O 18 3528 135 | 136 9 31 2 5 1 999 50 
92 13 52 2 19 —56 2969 31 | 137 10 17 7% 36 —5 1448 135 
93 14 43 2 57 14 1158 181 | 138 10 17 7% 5 —79 889 31 
94 144 43 3 2 —60 599 27 | 139 10 18 61 19 —60 1272 5O 
95 15 32 34 4 £29 2598 115 | 140 10 19 45 65 —97 1175 44 
96 15 33 19 18 48 2981 134 | 141 10 19 47 1 15 2135 94 
97 19 —82 1942 5 | 142 10 20 31 47 —22 2038 88 
98 15 34 4 33 -—63 2325 24 | 143 10 20 32 15 34 2518 113 
99 1 34 5 1 —7 2805 49 | 144 16 —96 1479 —16 
100 16 20 79 1 —87 2695 30 | 145 10 21 17 29 53 2901 132 
101 16 23 34 43 -—30 3844 87 | 146 30 —77 1862 3 
102 16 23 35 11 2 228 111 | 147 10 22 2 44 —58 2245 22 
103 16 24 19 57 —12 131 105 | 148 10 22 3 12 -2 272% 47 
104 146 2 4 71 7 514 124 | 149 11 8 77 12 —82 2615 28 
105 146 2 6 8 —11 435 45 | 150 11 9 62 26 -—63 2998 47 
106 17 13 51 4 3 1571 89 | 151 11 11 33 23 —99 3205 —19 
107 17 15 22 1 —33 1778 28 | 152 11 12 18 37 —80 3588 0 
108 17 16 5 78 4 2240 121 | 153 11 12 19 5 —24 4068 25 
109 17 16 7 15 —14 2161 42 | 154 11 13 3 51 —61 3971 19 
110 18 1 99% 1-113 1668 4 | 155 ll 13 4 19 -6 355 43 
111 18 3 67 -—3 —19 2914 67 | 156 12 -1 78 19 —86 245 24 
112 8 4 52 Il 0 3297 86 | 157 12 O 64 1 —11 1108 68 
113 18 5 37 2 19 3680 105 | 158 122 #149 15 8 1491 87 
114 26 —111 2641 —24 | 159 16 —122 452 —42 
115 18 6 21 71 —18 3583 99 | 160 12 3 19 44 —84 1218 —-4 
116 18 6 23 8 —36 3504 20 | 161 12 3 2 12 —28 1698 21 
117 8 7 6 & 1 3966 118 | 162 12 4 4 58 -—65 1601 15 
118 18 7 7 54 —73 3407 14 | 163 12 4 5 2% -9 2081 40 
119 19 -2 8 61 —77 1037 10 
120 19 -2 9 29 -—21 1517 35 — h + 4g plus 
No. f die b a r w 
intial aaa 164 1 4 8 5 5 2273 128 
™ ft €¢ @ + © + © ins 1 42 53 65 —23 2559 141 
121 2 9 8 5 24 901 134 | 166 1 42 55 2 —41 2480 62 
122 4 74 2 5 —1 3970 109 | 167 1 43 38 79 —4 2942 160 
123 5 65 26 12 —5 1600 105 | 168 1 43 39 48 —78 2383 56 
124 6 55 42 5 —27 2943 83 | 169 1 43 40 16 —22 2863 81 
125 7 48 14 8 63 1819 142 | 170 1 45 10 44 16 3629 119 
126 9 -—67 780 13 | 171 1 45 11 13 —58 3070 15 
127 8 36 59 5 —53 1916 57 | 172 2 31 8 12 —-8 3999 125 
128 8 39 15 15 60 3545 139 | 173 2 33 56 9 -45 110 58 
129 16 —70 2506 10 | 174 2 34 41 23 -26 493 77 
130 8 40 0 30 —51 2889 29 | 175 2 35 2 37 -7 87% % 
131 9 29 31 8 387 792 116 | 176 2 35 27 6 -8l 317 -8 
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— h + 4g plus (continued) — h + 5g plus 
No. f die b a r w No f d c b a r w 
177 36 11 = 5il 12 1259 115 | 223 0 -—-2 0 2 64 3045 68 
178 2 36 12 19 68 1739 140 | 224 3 -—66 2006 —6l 
179 20 -—62 700 ll 
180 3 21101 5 —31 1246 102 ~ opie 
181 3 22 8 19 —12 1629 121 
-— ss 2&8 348488 61? *.* fe 
183 3 2 57 16 —48 1836 55 | 225 0 7 -—82 2188 17 
184 3 25 41 62 —85 1739 49 | 226 0 55 21 —63 2571 36 
185 3 26 27 44 -—10 2602 93 | 227 0 56 35 —44 2954 55 
185’ 3 27 12 58 9 2985 112 | 228 0 56 3 12 3434 80 
186 3 27 13 27 -—65 2426 8 | 229 1 43 17 —49 3707 80 
187 4 14 72 40 4 3738 137 | 230 1 

4 1 

4 

4 

4 


8 

od 

I 

Py 

& 
woe ee 

#8 





233 

234 

235 

236 
194 4 17 29 19 42 712 114 | 237 35 6 22 2200 120 
195 20 —87 3769 —14 | 238 7-108 1161 —-9 
196 4 18 4 #4 —-69 56 4 | 239 2 36 41 20 41 2583 139 
197 4 18 1 2 —13 536 29 | 240 21 —89 1544 10 
198 4 19 0 16 6 919 48 | 241 2 37 7 3 -4 407:~= 4 
199 5 4 8 2 —-93 426 10 | 242 2 38 11 49 —51 2310 48 
200 5 5 74 16 —74 809 29 | 243 2 38 12 #17 5 2790 73 
201 5 6 59 30 —55 1192 48 | 244 18 —125 1751 —56 
202 5 7 44 44 —36 1575 67 | 245 3 23 101 3 —94 2297 35 
203 5 7 45 12 20 2055 92 | 246 3 24 8 17 —75 2680 54 
204 13 —110 1016 —37 | 247 3 29 13 24 1 420 69 
205 5 8 29 58 —I17 1958 86 | 248 4 14102 10 —97 4023 32 
206 5 8 30 2% 39 2438 111 | 249 4 14673 6 -4 1173 
207 27 —91 1399 —18 | 250 4 18 44 3 -—40 1380 28 
208 5 9 15 41 —72 1782 1 | 251 4 19 28 49 —77 1283 22 
209 5 9 16 9 —16 2262 26 | 252 4 19 29 17 —21 1763 47 
210 5 10 O 55 —53 2165 20 | 253 4 20 1 O —76 1587 —38 
211 5 10 1 2 3 2645 45 | 254 4 21 0 138 7 3009 110 
212 6 -—2 46 20 —113 2742 —40 | 255 14 —57 1970 —19 
213 6 —1 30 65 —20 3684 83 | 256 5 4118 2 10 2309 138 
214 6 —-1 31 34 —% 3125 —21 | 257 5 6 89 —1 —26 2516 72 
215 6 -—1 32 1 91 548 132 | 258 5 8 59 27 12 3282 110 
216 2 —38 3605 4 | 259 5 9 48 73 -—25 3185 104 
217 6 0O 1 79 -—1 4067 102 | 260 5 9 45 10 —43 3106 2% 
218 6 O 16 48 —75 3508 —2 | 261 5 10 30 24 —24 3489 44 
219 6 O 17 16 —19 3988 23 | 262 5 11 14 70 —61 3392 38 
220 6 1 1 62 —56 3891 17 | 263 5 11 16 6 5 26 87 
221 6 1 2 30 -1 275 41 | 264 7 —79 3313 —4l 
222 6 1 3 -2 55 755 66 | 265 5 12 0 52 13 159 ~—s 8 
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— g plus (continued) | 0 plus 

No. f de b a r w No. d c b a r w 
266 5 12 1 2 69 639 106 | 276 —2 26 98 —1 3488 121 
267 21 —60 3696  —22 | 277 —2 27 67 —-75 2929 17 
268 6 -1 61 3 -—66 353 2 | 278 —2 28 35 —-19 3409 42 
269 6 0 46 17 -—47 = 736 21 | 279 —2 29 3 37 ©3889 67 
270 6 1 32 0 -—102 560 —64 | 280 4 -93 2850 —62 
271 6 cf? BB 47 1982 84 | 281 -1 13 49 0 3792 61 
272 14 -—83 943 —45 | 282 -1 14 17 55 176 85 
273 6 3 1 59 10 1885 78 | 283 18 -74 3233 —43 
274 6 3 27 66 2365 103 

275 28 —64 1326 —26 





6. The table. Except for the final three tablettes marked B = 3, all the 
tablettes have B = 0 and lead to decompositions of r + Aa + s, where s = 
QW@+g+h. 

Tablette A = 0 is merely an arrangement according to the constant term r 
(second column) of those equations (number in first column) in which the co- 
efficients of a and b are = 0. The third column gives w + D — 1, where w 
is the weight of the left member of the equation and D is the difference between 
its r and the following r. For example, 


No. r D w wt+D-1 





102 228 28 111 138 
263 256 73 87 159 
191 329 95 


Hence 138 is the weight for the largest (unprinted) r between 228 and 256. 
For tablette A = 0 the largest entry in the third column is the final entry m = 
195. It was computed from the (unprinted) final r = a = 4096 and the weight 
142 of No.9. In (7), N is here s, whose weight is 4. Hence each of the integers 
r+s(r=0,--- , 4095) isa sum of 195 + 4 twelfth powers. 

Tablette A = 47 employs only equations in which the coefficient of b is = 0 
and that of ais = —47. 


An earlier table A = tis denoted by (t). A row of dots means insert from the 
next earlier tablette. 

For A = 8 we cite only the numbers of the new equations to be inserted in 
tablettes A = 0,1. The effect of the insertions is usually shown in detail under 
A = 19. Similarly for A = 10, 43, 53, 60, 63. 
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A= 254 3009 145 93 1158 145 
0 156 COS 223 «=. 3045 138 249 1173 129 

265 159 49 80 3116 179 50 1209 166 
102 228 138 43 3196 187 177 1259 144 
263 256 159 258 3282 124 13 1289 161 
191 329 185 112 3297 186 28 1332 154 
247 420 162 20 «= 3398 170 56 1416 125 
104 514 157 228 49-3434 173 158 1491 166 
215 548 177 91 3528 151 106 1571 117 
64 594 170 128 3545 172 123 1600 133 
266 639 161 44 3579 170 181 1629 163 
133 695 126 170 3629 169 14 1672 141 
194 712 172 113 ~—- 3680 162 | 77 1676 119 
31 771 154 187 3738 bs 84 1680 165 
131 792 180 70 ©3760 138 178 1739 156 
231 857 185 281 3792 157 40 1756 134 
121 901 151 279 =. 3889 143 75 1773 167 
198 919 127 117 —- 3966 193 125 1819 158 
136 999 114 9 4042 195 6 1836 136 
235 1064 169 0) 1853-2200 
93 1158 181 A=1 108 2240 153 
50 1209 166 256 2309 48 164 2273 163 
177 1259 144 65 2320 167 256 2309 148 
13 1289 161 274 2365 175 65 2320 167 
. ee C8 te ae 274 «= «2865 144 
86 1419 190 117 3966 21 | 241 2407 143 
158 1491 166 122 3970 1890 | 33 2497 151 
106 1571 189 9 4042 166 | 1443 ~—-:2518 175 
14 1672 4 217 4067 130 60 2581 127 
52 1689 191 95 2598 118 
178 1739 1733 A=8 185 2602 135 
75 1773 167. 155, 48, 175, 211 2645 124 
125 1819 175 CO 249, 137, 123, 48 2725 111 
38 1853 15 77, 164, 148, 243 2790 87 
273 1885 ss 99, 167, 89, 59, 99 2805 144 
57 18% )Ss—idN 8, 68, 172 145 2901 172 
271 1982 156 167 2942 177 
203 =. 2055 im A = 10:84 @) 2960-3045 
41 #42135 «©6158 80 3116 128 
237 ©2200 159 A=19 89 3145 166 
108 2240 1839 (0) 0-256 43 3196 130 
256 462309 « 198i |i 329 «120s 59 = 3225——s«*i168 
274 «=. 23865 175 | 155 355 107 8 3276 124 
206 ~—«-:2438 169 247 420 162 | 112 = 3297 165 
33 2497 151 | 104 514 145 68 § 3377 144 
1443 ~—-:2518 177 197 536 131 228 ©9344 133 
239 «= «-2583 63 ) 639-771 276 «= 3488 160 
95 2598 161 131 792 149 (0) 3528-3889 
211 2645, 189 | 48 826 147 117 3966 121 
243 ©2790 183 175 876 139 122 3970 137 
145 2901 190 | 198 919 127 172 3999 167 
3 2960 172 | 136 999 114 9 4042 166 
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91 
73 
114 
119 
124 
85 


1763-3116 


155 
114 
127 
124 
111 
119 


106 


176 119 
211 123 
228 114 
232 112 
256 105 
275 120 
355 107 
420 83 
435 102 
493 119 
536-695 

712 130 
729 98 
736 110 
826 123 
852 112 
919 127 
999 114 
1064 119 
1108 72 
1113 129 
1209 121 
1214 135 
1332 118 
1380 63 
1416-1698 
1763 61 
1778 129 
1885 88 
1896 137 
1958 109 
1982 118 
2017 126 
2038 104 
2055-2262 
2365 116 
2379 105 
2407 126 
2480 116 
2535 136 
2581-2805 
2805 82 
2839 134 
2943 93 
2954 119 
3019 119 
3045 128 
3106 114 
3196-3409 
3489 58 
3504 120 
3605 141 
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70 =. 3760 138 | 216 ©3605 4 «| 210 2165 99 
281 3792 12 267 = 3696 73 147-2245 38 
101 = 3844 108 281 3792 112 209 «= 2262 73 
12 3866 79 CO 242 =. 2310 62 
279-3889 122 A=72 9% 2325 103 
199 3945 92 198 919 82. 61 2405 46 
72 3967 120 236 954 101 186 2426 37 
37 4047 91 136 999 83 18 2456 61 
55 1033 106 129 =. 2506 74 
A = 53:210, 229 157 1108 72 226 23-2571 50 
Wdcwensniiehnansnsioas 90 2586 70 
A = 60:127, 30, 267 56 1416 87 211 2645 79 
182 1453 a 246 ©2680 89 
A = 63 16 1496 73 (63) 2716-2839 
197 536 91 120 1517 114 130 —-.2889 68 
94 599 7 si 79 1597 40 277-2929 106 
53 650 8s 162 1601 111 81 3019 4 
269 736 110 161 1698 85 46 3020 102 
pchictvaeientxewaes 0) atksunaaaticasinae 260 «=. 3106 97 
15 1113 112 271 1982 107 188 3179 75 
201 1192 69 224 2006 93 24 3222 60 
76 1214 46 109 ~—s 2161 45 | 283 3233 46 
87 1243 a a ae Se ae 83 3323 102 
28 1332 118 88 2379 103 262 3392 52 
bistetetNabett ipa bade 61 2405 100 118 3407 95 
107 1778 I Ge Bees ee a eae 261 3489 62 
183 =s-«18386 108 | i 26—s 3106 97 218 3508 4 
273 1885 ss 188 3179 75 (63) 3605-3866 
57 1896 S | 24 3222 124 279 «= 3889 68 
127 1916 Ss | 220 = 3891 92 
205 1958 109 A=75 72 3967 44 
Ce ere 173 110 83 | 154 =. 3971 94 
109 ~—_ 21161 4 OC 134 136 70 37 4047 91 
210 =. 2165 116 233 201 105 
209 2262 73 221 275 Ml | 4 = 87 
242 —-2310 116 47 346 81 196 56 83 
88 2379 105 155 355 107 134 136 70 
ghycdiunebabacesecis 247 420 8 | 233 201 75 
17 2535 119 105 435 78 156 245 53 
30 ©2564 113 26 469 76 | 221 275 82 
226 =. 2571 109 135 519 41 | 176 317 27 
211 2645 a Se 268 353 83 
42 2716 8 | 269 736 9 | (75) 435-599 
148 ©2725 111 200 809 ) 53 650 62 
243 ©. 2790 87 27 852 112 179 700 46 
99 ©2805 a eee oeeren eee oe 269 736 64 
22 2839 80 87 1243 62 126 780 41 
130 2889 93 139 1272 103 200 809 71 
227-2954 119 275 1326 63 27 852 82 
81 3019 4 (72) 1416-1496 138 889 60 
46 3020 102 120 1517 92 198 919 71 
260 3106 114 202 1575 88 272 943 44 
LAER eS 2 Teer (72) 1597-2161 55 1033 35 




















1037 80 
1108 72 
1113 37 
1117 86 
1192 69 
1214 21 
1218 60 
1283 64 
1326 63 

1416-1517 
1575 78 
1587 72 
1698 83 
1761 35 
1778 26 
1782 80 
1862 82 
1942 32 
1970 16 
2006 72 
2140 80 
2161 45 
2165 42 
2188 73 

2245-2310 
2325 81 
2383 77 

2405-2645 
2680 68 
2695 59 
2725 63 
2742 87 
2805 82 
2839 80 
2889 68 
2929 27 
2940 43 
2969 59 
2998 68 
3020 66 
3070 50 
3106 76 
3158 71 
3179 75 
3222 60 
3233 36 
3313 52 

3407-3605 
3696 50 
3769 82 
3866 79 

3889-4047 





DICKSON 

= 94 
536 45 
553 52 
599 77 
1517 61 
1544 52 
1587 72 
3407 71 
3465 69 
3489 62 
3508 77 
3588 16 
3605 45 
3647 68 
3696-3889 
3891 39 
3914 66 
3967 44 
3971 35 
3988 81 
4047 63 
4068 52 

= 98 
1117 69 
1175 60 
1192 69 
1326 46 
1399 61 
1479 48 
1544-2725 
2742 82 
2800 61 
2805-3313 

= 113 
56 51 
104 57 
136 39 
170 62 
245 53 
275 58 
293 74 
426 52 
469 76 
519 41 
536 45 
553 13 
560 25 
650-780 





809 33 
814 37 
857 32 
943 27 
1016 59 
1113 37 
1117 55 
1161 43 
1214-1283 
1326-1544 
1587 42 
1668 33 
1698 61 
1739 70 
1761 35 
1778 26 
1782 59 
1841 41 
1862 61 
1921 43 
1942 32 
1970 16 
2006 63 
2131 54 
2161-2310 
2325 57 
2359 51 
2405-2571 
2586 40 
2615 53 
2641 —3 
2663 24 
2742 56 
2839 41 
2850 27 
2940-3233 
3313 31 
3386 63 
3483 62 
3508 20 
3531 30 
3567 38 
3588-3696 
3769 65 
3849 29 
3891-3971 
3988 57 
4023 55 
4047 63 
4068 52 
A = 122 

293 —1 
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73 351 48 214-3125 58 145 2901 144 
199 426 35 151 3205 8 | iW 2914 137 
159 452 41 3233-4096 185’ 2985 = s:187 

536-2165 
225 —«-2188 64 A = 125 B=3, A=36 
41 2236 60 184 1739 60 33 2497 149 
147-2245 38 244 1751 33 257 «2516 —Ss«<136 
209 2262 60 34 ~—s«1841 41 60 2581 127 
245 «2297 62 

2325-2456 B=3, A=19 B=3, A=43 
129 2506 43 43 4602518 ~Ss«<171 194 712 ~—-:130 
45 2540 37 2 2577 136 49 7292117 

2586-3070 60 2581 127 222 755 136 
260 3106 | Seager Rips ae 48 826 147 








7. Conclusion from the table. In the tablette with a fixed A, let m denote 
the maximum entry of its third column. For B = 0 we havet 





A 0 1-7 8-9 10-26 27-31 3246 47-52 53-59 60-62 
m 195 191 178* 177* 166 162* 142 138 135 
A+m 195 198 187 203 197 208 194 197 197 


A 63-71 72-4 75-86 87-93 94-7 98-112 113-21 122-4 125-9 
m 127 124 112 95 87* 83 76* 70* 68 
A+m 198 198 198 188 184 195 197 194 197 





Here A + mis found from the largest A in its sequence. Thus A + m S 208 
forevery A. We have also the further needed facts. For A = 19, max = 172 
except for gaps 175 at 2580 and 177 at 2959. For A = 36, max = 156 except 
for gap 162 at 2580. For A = 43, max = 152 except for preceding gap and for 
r = 712-771. 

Employing also tablettes with B = 3, we see that, when B = 3, A + m S 198 
for every A. Hence all our results imply that, if B < 12, B + A +m Ss 210 
for every A < 129. Applying (2;) we obtain, as in §6, 

THEOREM 3. Every integer between s = 2d + g + hand s + 13b is a sum of 
214 twelfth powers. 


8. Ascent. We take n = 12 in the writer’s® 
TueoremM 4. If every integer > sand = s + Disa sum of k — 1 integral n-th 
powers = 0, and if m is the maximum integer satisfying 


(9) (m+ 1)" — m < D, 


every integer > sand S s+ D+ (m+ 1)*is a sum of k integral n-th powers 2 0. 
Increasing the interval in Theorem 3 by b 18 times, we infer that 232 powers 
suffice from s to s + 31b. Since (9) holds if m = 3, D = 31b, Theorem 4 with 


4 The values of m marked * are true maxima for minimum decompositions (§10). 
5 Bull. Amer. Math. Soc., vol. 39 (1933), p. 710, Theorem 10. 
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k = 233 implies that 233 powers suffice from s to s + 31b + c. Add c 12 more 
times. Thus 245 powers suffice to s plus 31b + 13c = 234 578 479. 

Take the latter as a new D and note that d — ec < D. Since (9) now holds 
if m = 4, Theorem 4 shows that 246 powers suffice from s to s + D + d (briefly 
we may addd). Adding 6d, we see that 252 powers suffice from s to 


(10) s + 1 943 562 854. 

We may add f. We add 5f, 4g, 3h, 2-9", 2-10", 2? (2 = 11,---,15). We 
find that 273 powers suffice from s to beyond 

(11) [Ip = 224 715 123 X 10°. 


We have reached the stage where single ascents are unnecessary, but may 
make ¢t ascents at once by use of the writer’s result (l.c., p. 711, n = 12): 

TuHeoreM 5. [f all integers between s and Ly inclusive are sums of k integral 
twelfth powers, then all between s and L, inclusive are sums of k + t integral powers 
if log L, = (12/11)*(log Lo + 12 log V) — 12 log V, 12V = 1 —s/Lp. 

We take k = 273 and Jy as in (11), s/Ly = .00036957, log V = —1.0793417, 


log Lo = 14.3516323, log Lo + 12 log V = 1.3995319, 
(12) log log L, = .0377885t + .1459828. 


9. Proof of Theorems 1, 2. In the current number of Annals of Mathematics 
the writer gives amplifications and generalizations of the remarkable paper 
by Vinogradow (ibid.) on the asymptotic Waring problem, and in particular 
obtains the following fact. If log log N = 7.5068, all integers 2 N are sums 
of 586 twelfth powers. By (12), L, > N if t 2 195. Hence all integers = s 
are sums of 586 twelfth powers. This can be reduced to 440. 

But® 1560 powers suffice from 3b to 5 X 10”, 2405 powers suffice from 2b to 
3b, and I powers from 1 to 2b. 


10. Table M of minimum decompositions. We first constructed table M, 
here omitted. The part with B = 0 required 240 equations in addition to those 
used for the corresponding part of our above table T. The parts with B = 
1, 2, 3 required 45 further equations. Still further equations were used to prove 
that if B = 10, A + m S 189, and if B = 22, A + m & 186, both for every A, 
while 224 twelfth powers suffice from s to 


dd +g +h=s+8d = 8 + 1953 125 000. 


The discussion following (10) applies also here. Hence 245 powers suffice from 
s to (11) increased by 10’. The remarkable Theorem 3 cannot be improved 
by use of M. 


UNIVERSITY OF CHICAGO. 


6 Bull. Amer. Math. Soc., vol. 39 (1933), pp. 709, 713. 














GROUPS OF CREMONA TRANSFORMATIONS IN SPACE OF 
PLANAR TYPE. II 


By Artuur B. CoBLe 


1. Introduction. We have defined in part I' of this account the meaning 
to be attached to the phrase ‘‘of planar type” and have given one example of a 
group G of this type. It is the purpose of this article to give further examples 
of groups G, which differ in some essential respects from the first. 

The stable character of the elements of G is due to the fact that all of the 
elements have a common F-curve of the first kind. In the example given (ef. 
footnote 1) the elements had in addition variable isolated F-points. In the 
first three examples given below the elements have also an F-curve of the first 
kind, which may vary with the element, and whose nature is dependent upon 
that of the variable isolated F-points. In the fourth example given below 
there is also a fixed isolated F-point. 

In order to ensure that the elements of G have a common F-curve of the first 
kind, it is convenient to define G by means of involutorial generators. For the 
first three of our groups G we use generators of types given by Sharpe and 
Snyder.2, We develop anew the properties of these generators by a mapping 
process. 

In such a provisional exploration as this, it is convenient to avoid the com- 
plications of contact singularities. Simplifying assumptions in this direction 
are sometimes made. 

The first group G developed in §4 is considered in more detail than the later 
ones. Since the groups G are all associated with linear groups g generated by 
involutorial elements of a particular arithmetic character, it would seem prefer- 
able to discuss the groups g more generally before making applications to the 
Cremona groups G. This the author hopes to do in an early paper. 


2. Webs of cubic surfaces of degree two. Since the generic surface of a 
web can have only fixed singularities, we distinguish two cases: (a) the generic 
surface has no singularities; (b) the generic surface has a node. If, in case (a), 
K, is a generic surface of the web, and \2Ke + A3sK3 + A4K, a residual net, then 
this net must cut K;, in a fixed curve C and a variable net of degree two. If this 
net on K;, has fixed base points, we require them to be simple base points. For 


Received February 24, 1936. 

1A. B. Coble, Groups of Cremona transformations in space of planar type, this journal, 
vol. 2 (1936), pp. 1-9. 

2? F. R. Sharpe and V. Snyder, Certain types of involutorial space transformations, Trans- 
actions of the American Mathematical Society, vol. 21 (19380), pp. 52-78. 
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a double base point would imply a contact for surfaces of the web, which we 
wish to avoid. 

Let K, be mapped from the plane FE by the system of cubic curves (q; - - - g«)* 
on six points g. The net of curves on K; is the map of a planar net (qj - - - q2)°. 
If the net on K, has a fixed curve C, the net on E must decompose into a fixed 
curve c and a residual net of degree two. Moreover, the fixed base points of 
this residual net outside q, --- , g must be simple base points if the net on K, 
has simple base points as required. 

Planar nets of degree two are either the Geiser net of cubics (17)* or the trans- 
form of such a net by a Cremona transformation, such as (271°)*, (251)5, ... . 
In the case of these nets of higher order, all of the multiple base points must be 
found among qi, --- , qs. Moreover, there are at least three of such multiple 
base points except in the case (271°)*. In this case, one of the simple base points 
also must be in qi, --- ,q. Otherwise the fixed part c of the net (q} --- q3)° 
would be a (qj --- 93959)", which cannot exist. Thus the three multiple base 
points of highest order are always in q, --- , gs, and a quadratic transformation 
with F-points at these three points will reduce the order of the net. The residual 
net can, therefore, be reduced eventually to the Geiser net (17)* by a process 
which merely shifts the mapping double six on K,. 

Thus we find just four essentially distinct degenerations of (q} --- q3)* into a 
fixed curve c, and a Geiser net, namely: 


fized curve c Geiser net 
I (qi --+ q6)° (q +++ er)? 

II (9192 es q8)° (q2 +--+ GerT2)* 
Ill (919293 non qs)° (93 +--+ QeTiT2rs)® 
IV (19293949596)° (qaqsqer «++ 74). 


The cases break off because a fixed curve c of type (q? --- qiqzq3)® cannot 
exist. In these four cases the fixed curve c maps into respectively a space sextic 
of genus four, a space quintic of genus two, a rational space quartic, and a set 
of three skew lines. The points 71, re, - - - map into simple base points py, po, - - - 
of the web. 

In case (b) let the generic cubic surface K, have a node fixed for the entire 
web at N. The surface K,; can be mapped on the plane E by cubies (q: - - - q)*, 
where qi, --- , g are on a conic whose points map into directions on K, at N. 
A cubic surface on N cuts K; in a curve whose planar map (qj - -- ¢2)° contains 
the conic (q: --- qs)’. If the surface also has a node at N, the conic appears 
twice in the map, and the residual curve is a (q --- q)*5. Since the net 
AoKe + A3sK3 + AsK, with node at N must meet K, in a net of curves with a 
fixed part C and a variable part of degree two; (q: --- qs)® must have a fixed 
part c and a variable net of degree two, which must be the Geiser net (17), 
since the multiplicities at q are all unity. Thus c is a conic. If ¢ is not on 
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four points q, it will meet the conic (q --- qs)? in points not at g, whence C 
on K;, will pass through N. Since we wish to avoid, as far as possible, incidences 
of isolated F-points with F-curves of the first kind, we take c on four points q, 
and obtain a fifth web for which 


V: c: (q. --+ ga); Geiser net: (Qsq671 --- 1s)*. 


The conic c maps into a conic on K,, and the points r into simple base points p 
of the web. 

In each case the existence of the net of degree two on a surface of the web is a 
consequence of the existence of the corresponding net in a planar map, whence 


(1) There is a web of cubic surfaces of degree two defined by each of the following 
bases: 


I: a space sextic C, of genus four and a simple point p,; 

II: a space quintic C; of genus two and two simple points pi, p2; 
III: a rational space quartic C, and three simple points p,, p2, Ps; 
IV: three skew lines C; and four simple points py, --- , Pa; 

V: a conic C2, a node N, and five simple points p,, --- , ps. 


Each base may be chosen in generic position in space. The net of surfaces of the 
web on a point x is also on x’, and x, x’ are correspondents under a Cremona involu- 
tion I, Tie, Lies, Lies4, and Ih2345, respectively. 


There is a variety of particular cases of each of the above involutions which 
depends upon various degenerations of the curves C. These we do not pursue 
but note merely that no one of the cases I, --- , V isa particular case of another. 
The group G generated by involutions of type J; has been considered in Part I.! 
We proceed to examine the remaining involutions and the groups G generated 
by them. 


3. The Cremona involution J,.. Let C; be a space quintic curve of genus 
two on a quadric Q, and let p:, pe be two points in generic position with respect 
to C; but, in any case, not on Q. Then the Cremona involution J;2 of 2 (1) is 
determined by the web of cubic surfaces (Csp:p2)*. The web contains surfaces 
(Cspips)*, (Cspp3)*, which, from the definition of Ji2, are the P-surfaces of the 
isolated F-points pi, p2, respectively. 

If the generic cubic surface K, of the web is mapped as in 2 upon the plane EZ, 
Cs maps into the sextic (qjq3 --- q3)*®, and the residual net of curves on K, is 
mapped into the Geiser net (q2 --- gerir2)®°. The Geiser octavic involution G 
determined by this net is the map on E of J,,0n K,. The members of the Geiser 
net with nodes at 7, re, respectively, are unique. Hence the above P-surfaces 
are unique. 

All points z on a generator of Q trisecant to Cs; are on the same net of the 
web K, and thus determine the same point zx’. The locus of these points 2’ 
determined by a variable trisecant generator is an F-curve L of the first kind 
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whose P-surface is Q. Since K, contains one trisecant variable with K,, then 
K, must meet L in one variable point. This trisecant is mapped on EF by the 
directions at q:, and these pass by G into the directions at s,, where s; is the 9-th 
base point of the pencil (qe - -- gerireq1:)*. But this is the map of the pencil on 
K, cut out by the pencil of planes on p,, p2, each of which with Q is a surface of 
the web K. Thus §S, is the map on £ of the further intersection of the line pipe 
with K,, and L is the line pipe. Since a plane 7 meets each trisecant once, the 
F-curve L is a simple curve on the homaloidal web Hj: of Diz. 

A generic plane z cuts K, in a cubic curve which passes by J), into the section 
of K, by H, that member of H,. which corresponds to 7 under Jiz. In E the 
section + is a (q: --- qs)* which passes by G into a curve of the web (q} --- 
qzrirgs,)®. If we add to this planar web the fixed curve {(q?q3 --- q2)*}*, ie., 
the map of C; taken three times, it becomes a web (q} --- giriris,)”, which is 
the map on E of the homaloidal web Hi. = (LC3pip?)’. 

The quadric Q is a P-surface containing only the F-curve C;. It must, there- 
fore, pass by J;2 into the P-surface of C;, but on this P-surface L must have a 
multiplicity one greater than the normal multiplicity to indicate that Q is 
the P-surface of L. Hence the P-surface of C; is an (L’C§p{p})*. 

To verify the completeness of the enumeration of isolated F-points and F- 
curves of the first kind, we transform Hj. = (LC{p{p})® back into the web of 
planes by another application of Jj: The transform is an (L°C{"p{*p}*)™. 
From this the four P-surfaces must separate, respectively, once, three times, 
four times, four times. There is left only (L°C$p{p3)', the web of planes. 

We observe also that, if 72. is any plane on 7, ps, 712, Q is a member of K in- 
variant under J;2, Hence the variable part 72 of this surface is also invariant. 
The web K cuts m2 in the net of curves (p:pojs - - - jz), where js, --- , jz are the 
meets of m2 and C;. Hence Jz on 72 is the Geiser involution G’ determined 
by this net. Under G’ the line pyp2 passes into the conic (j; - - - jz)®, the section 
of Q by m2. The directions on 72 about js correspond under G’ to the cubic 
(pyPejaja +++ Jq)®. The five cubics of this sort, and L taken three times, make up 
the complete intersection of 72. and the P-surface of C;. Thus the P-curve 
which corresponds to an F-point j on C; is a plane cubic in the plane (jpip2)! 
with node at j and simple points at p,, p2, and the remaining four intersections 
of the plane with C;. 

We seek now the F-curves of the second kind of J;2, curves contained on every 
surface of Hi. = (LC’pip$)*. The four bisecants [)) from p, to C; and the 
four 1?’ from p2 to C; are each F-curves of the second kind. The two P-sur- 
faces P,, Py, meet outside C; in an elliptic quartic 8-secant to C; with nodes 
at p, and pe. This degenerates into two conics k2, k, each on p,, p2 and 4-secant 
to C;, and therefore in the base of Hiz. Since these conics are cut by planes x 
in two points, they are double on the surfaces of Hi2. Also the two trisecants 
g, g’ of C; on the points where L cuts Q are in the base of Hi2. For the same 
reason as indicated above for L, g and g’ are three-fold on the P-surface of Cs. 
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The completeness of the above enumeration of F-curves is verified by taking 
from the common curve of order 81 of two surfaces H the common F-curves 
indicated. There is left a curve of order 9, the proper transform under J). 
of the common line of the two planes which correspond to the surfaces H. 
Hence 


(1) The homaloidal web and P-surfaces of the involution I,. have the following 
description in terms of the F-system: 


His (LCSpip2)9Uy UY kek 2799’) ; 
Q(C5)*(g, 9’) ; 

(L9C§ pips) Bly Te "kp ke‘g'g”) ; 
P,, = (Cspipa) (ly keak:) ; 

P;, = (Cspip2)°(UP kak) 


Pi 


Pe, 


It is important for the sequel to note the subordinate réle played by the 
F-curve L of the first kind. Both it and its P-surface Q are determined by the 
choice of the other F-loci C5, pi, pe. 


4. The Cremona group G generated by involutions /,. and its linear group 
g. Let pi, po, Ps, ps be points in generic position with respect to C;. There 
are three types of products that may be formed from two involutions J;; 
(i,j = 1,---, 4). The first type, Ji2/12, is the identity; the second, J)2/;3, 


4,48 


has a homaloidal web of the form (C$q{q2q3)", where 9:, 93 = Pi, Ps and qe is the 
transform of ps by Ii3; and the third, Ji: Iss, has a web (Ciqiqiq}*qi*)”, 
where qs, 74 = Ps, Pa and qi, g2 is the transform of pi, pe by Js.. 

Let G be the Cremona group generated by involutions J,2 for variable ~, pe 
but fixed C;. We shall be interested mainly in the types of Cremona trans- 
formations in G. In forming a generic element II = II’ J,, I1"’ of G, the F-points 
Pr, Ps Of I,, may fall, wholly or in part, among the isolated F-points of the web 
of II’, as in the very special cases above. We do not consider the case when one 
or two of the points p,, p, fall on P-surfaces of the web of II’. Products of this 
latter kind have coalescent F-loci. They may be regarded as particular cases 
of the more general products. 

The direct and inverse homaloidal webs of any product II will each have one 
/’-curve Ly of the first kind whose P-surface is Q. If ll = J,,-Tl’, then Ly is 
the transform of the line p,p, by II’; if Il = I11’’-J,,, then Lyg- is the transform of 
prps by (Il’’)-*. The knowledge of these F-curves is not necessary to express an 
element of G as a product II. 

With respect to J), a linear system of surfaces has a characteristic {y}, i.e., 
an order y, a multiplicity y on L, yo on Cs, and y;, y2 On pr, Po. This linear system 
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is transformed by J): into a linear system of characteristic {y’}, where {y’}, 
according to 3 (1), is expressed in terms of {y} by 


y’ = My — 29 — 18y — 3x — 3y2, 
y= y — 3y, 

(1) yo = 3y— G— by — ym — ye, 
yi = 4y — 8y — 2y —y, 
y2 = 4y — 8y— yi — 2ye. 


This linear transformation of determinant —1 is involutorial, as we should 
expect. It is clear that y’ — 3y, = 9,9’ = y — 3yo. If we make the change 
of variable 

(2) y—3y=2, 9Y=2%, Yy-—Ww=%, YW= 4, Yo = 4x2, 


the linear transformation (1) interchanges z, Z and yields the foliowing trans- 
formation on 2, 21, 22: 


Ip = 320 — 4x, _ 472, 
, 
a, = Mo — 24, — 22, 
(3) ue. C, 
Le = tom ai 222, 
, ° 
Zz, = 7; (j > 2). 


The last equation in (3) merely expresses that other multiplicities are unaltered. 

The transformation 7): has period two and determinant + 1. We observe 
that if our linear system has no special relation to L, i.e., if 9 = Z7 = 0, then 
y’ — 3y, = 2' = 0. Ifalsoy — 3y =z =0,then2’ = 0. For the homaloidal 
webs mentioned above, those of Ji2, Zielis, Liels4, y — 3yo does vanish. More- 
over, when these webs are transformed by further involutions J;;, the webs 
have no special relation to L,;, the subsidiary F-curve of J;;. We are therefore 
entitled, in transforming these webs, to set z = Z = 0, and then have, in the 
transform, z’ = 2’ = 0. Thus the significant effect of J ;; is that represented by 
tis (i,j = 1,---,; ¢ # J) on the variables 2, z;, z; alone, and we pass from 
the characteristic {x} to the characteristic {y} as in (2). A single exception 
is the web of planes for which 2, 21, 72 = 1,0, 0 and z,Z = 1,0. However, this 
exception disappears immediately under transformation by 72. It is perhaps 
better to remove the exception entirely by adding to the web of planes the fixed 
quadric Q = (C;)*, in which case the web has the characteristic y, 9, yo, Yi, Ye = 
3, 0, 1, 0, 0, which yields z, Z = 0, 0; 2, 21, 2 = 1,0,0. This is the transform 
by (1) of the characteristic of the web (LC{p{p})? plus the F-curve L with mul- 
tiplicity — 1, i.e., of the web (CZpip2), the homaloidal web with the L-inci- 
dence disregarded. It is obvious that the multiplicity —1 must be ascribed 
to L, to Cs, to m:, and to pe, in order that (1) may furnish the corresponding 
P-surface. 
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We now state the theorem: 


(4) Let g(p) be the linear group generated by involutions ix, (0 < j,k S p;j # k). 
If the generic element of g(p) is 


/ 
Lo = Anoro — 4anir1 _ 4ay2X2 a tt 4a0,%,, 
, , 
9: 2, = awXo — Qn 1%) — Ai2%2 — +--+ — ay I, (l= 1,---,p), 
, 
Z—_ = In (m> p), 


then this element represents a Cremona transformation G whose homaloidal web 
has the form (LyC§" pi*” --- pi*~)8*. The P-surface of Cs is 


(LiC5* pi®™ --- pp*)°* ; 


of px is (CE* pt* --- po)ee* (k =1,---,p). The F-curve Ly of the first kind 
is determined by the other F-elements, and its P-surface is Q on Cs. 


For we observe that the theorem is true for g = iy. We find that, if it is 
true for g and G, it is also true for giz and GJj2, for gti, 41 and GJ;,,4:, and for 
Jtp+1, +2 and GI,.;,,42, the formation and comparison of these products being 
omitted here [ef. 5, footnote 1]. Since g itself is merely a product of involu- 
tions 7, the proof is complete. 

It is sufficient to examine the effect of (3) to see that 


(5) The group g(p) has the linear and quadratic invariants 


L=%2m—%—2%#— --:-—2,, Q = 25 — 42rj — 473 — --- — 47’. 
For the characteristic ao; am, --- , ao, of the homaloidal web in (4), L and Q take 
the value 1. 


A table of these characteristics which arises from the web of planes by using 
not more than three generators 7;, is as follows (the 0’s not being written): 


a: 7; 3111 15; 63311 
3; 11 7; 222 19; 6633 

” 5; 211 11; 4321 23; 87331 
9; 3311 15; 55211 27; 993311 . 


On the other hand, a table, arranged according to values of ag < 16, of 
values satisfying L = Q = lis: 


(1;0°), (3;17), (5;21%), (7;31%), (7;2%), (;41*), (9; 371%), 
(7) (11; 515), (11; 4321) , (13; 61°) , (13; 532?) , (13; 4°31) , 
(15; 71’), (15; 641%) , (15; 63712) , (15; 5°21?) , (15; 5471). 


In this latter table one characteristic, (15; 641‘), is not geometric. For, 
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transformed by iis, it yields a characteristic with negative ay. With respect 
to the possibility of reducing the order ag of a characteristic we prove that 


(8) Any characteristic for which x) > 1, and which satisfies L = Q = 1, when 
so arranged that , => 22 2--- = x, = O, satisfies also the inequality 
2(21 + 2X2) > 2. 


For on comparing 42, L = 42, with Q = 1, and noting that x22 = 23, etc., 
we have, on factoring out the non-zero xz)» — 1, the inequality 42, = x + 1. 
We have from Q = 1 that 27, < x. Let then 


(a) 4, = %+1+m (0S mS %-— 2). 


If 422 = rz + 1 — m, the theorem is proved. On the other hand, the assump- 
tion 


(B) 472 << a+1—m 


leads to a contradiction. Indeed, if we multiply L = 1 by 422 and compare 
with Q = 1, we find that 


(to — % — 1)4r, => zi — 1 — 42? 


On strengthening this by the use of (8), and on substituting for x, from (a), 
we get m? > m(x — 3). This, however, violates the inequality (a) when 
m > 0, and it is not satisfied when m = 0. 

The inequality (8) permits the statement: 


(9) Any characteristic which satisfies L = Q = 1 can be reduced by involutions i; 
to one of lower x, and eventually to (1; 0°), unless in the reduction process one or 
more of the x, --- , X, become negative. 


For if we apply i. to the ordered characteristic (8), the inequality 
2(a; + 22) > 2% implies ry <2. Hence 


(10) Any solution ao; a, --- , Xo of the equations L = Q = 1 which satisfies 
the set of inequalities conjugate to x; = 0 (ti = 1, --- , p) under g(p) (an infinite 
set when p = 4) defines as in (4) a geometrically existent homaloidal web. 


For such a solution can be reduced to the solution (1; 0°) corresponding to the 
web of planes by successive applications of involutions 7, without introducing 
negative numbers xz; The corresponding Cremona involutions /;,, carried 
out in the reverse order, yield the given homaloidal web. 

With respect to the case p 2 4, we find that 


(11) The group g(2) has the order 2; the group g(3) has the order 24, and is iso- 
morphic with the even octahedral gi2 amplified by a reflection in a plane of symmetry 
on only one of the three diagonals; the group g(p) (p = 4) is of infinite order. 


Let A, denote the set of four forms, one of which is 


(2k? + k + 1)xy — 2k? + k + 1)a, — Bk(k + 1)ze — (2k? + 1)z3 — Wx, 
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the other three arising from this by interchanging 2, 22, or also x3, 24. Let By 
denote the similar set of four forms, one of which is 


(2k? — k + 1)xo — 2(k® — k 4+ Ia, — 2k(k — 1)xe — (2k? + 1)zs — Wz, 


We find that 73, carries A, into By,;, and that t,2 carries By into Ay. Thus 
tsatig Sends A, into Axy:. Hence izq/:2 is of infinite order and g(4) is infinite. 
Necessarily then g(p) (p > 4) is also infinite. 

We prove finally that 


(12) The inverse of the element g in (4) is obtained from g by interchanging aj; 
and a. The coefficients of g and g~ satisfy the following relations in which the 
indices run from 1 to p: 


Dd aio = a0 — 1, DL a0; = a0 — 1, 
. ? 


LD. aij = 4a — 1, LD aij = 4a — 1, 
7 2 
4) a3, = ab, — 1, 4), a3; = ai, — 1, 
t 7 
Lai; = 405; +1, Dai; = 4ai, +1, 
‘ 2 
> Ana; = Apa; , » > Anji = Aoo%o , 
> aAjake = Sajak (j x k) ’ p> Qa = 4a inex no (¢ ca k) e 


For the first column of relations expresses the invariance of L and Q under g. 
If in g we interchange a; and a;; to obtain g’, this first column of relations also 
expresses that gg’ = 1; hence g’ = g~'. The second column then expresses 
that Q, L are invariant under g™. 


5. The Cremona involution /,; and its attached Cremona group G and 
linear group g. Let C, be a rational space quartic curve and p;, pe, ps 
three points in generic position. The web of cubic surfaces K, which defines 
Ti23 as in 2 (1), contains only one degenerate member, 7123 — Q, 7123 being the 
plane (pip2ps3)', and Q being the quadric on Cy. The mapping of K, on E by 
the system (q --- q)* described ,in 2 carries Cy into (qiqiq} --- 42)®, pipeps 
into ryrers3, and the net residual to C, cut out on K, by K into the Geiser net 
(q3 --- G6 :72rs)*. This net determines the Geiser involution G, the map on E 
of Ji23 on K,. 

As in 3 we find that pi, pe, ps are isolated F-points of Ii23. The P-surface of 
pi, P»,; is the surface of the web K with node at pi (C4 p? p; px) (i,j, k = 1, 2,3). 

A generator of Q trisecant to C, corresponds to a single point x’, and the 
locus of these points zx’ is an F-curve of the first kind, L, whose P-surface is Q. 
Since K, meets Q in two such variable trisecants, K, meets L in two variable 
points. The surface K = 7i23-Q is invariant under Ji2;, and Q being a P- 
surface, 7123 is itself invariant. Surfaces K meet m3 in the net of cubics 
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(pipepats --~ tz)’, where ts, --- ,t are the meets of 123 and Cy. The Geiser 
involution G’ of this net is J;23 On 7123. The trisecants of C4 cut 723 in the conic 
(123, Q). This conic is carried by G’ into L, a rational quartic on 7123 with 
nodes at pi, P2, Ps and simple points at ty, --- ,t. This curve L is contained 
simply on the homaloidal web Hy:3 of L123. 

The plane sections of K,; map into (q - - - gs)* which passes by G into the sys- 
tem (q} --- géririr$s,s.)", s;, % being the transforms of q:, gz by G. If we add 
to this the map of the fixed curve C, four times, we get a system (q{* --- q}?r --- 
r$s,8,)", the map of the curves cut out on K, by the homaloidal web, Hi23 = 
(LCip$pip$)". The points s:, s2 arise from the two variable intersections of L 
and K;. Asin3 we conclude that the P-surface of Cy is an (L'C3p}°p3°p3°)™, and 
can again verify that Hj; is transformed by J;23 back into the web of planes. 

We list the following F-curves of the second kind, each contained on every 
member of Hyo3: (a) gi, --- , ga, the four trisecants of C, from points where L 
meets Q outside C,; (b) IY’, 19, I, the three bisecants of C4 from pi, ps, ps 
respectively; (c) cs, the twisted cubic on p;, pe, ps and 6-secant to C4; and (d) 
k3®  kO®, kY*, respectively conics on pi, P2; Pi, P3} P2, Ps each 4-secant to C,. 
Since cs is on P,,, P»,, P»,, there can be only one such curve, a triple curve on 
Hy23. Since P,, and P,, meet outside C, in c; and k{!*?, there is but one such 
conic, a double curve on H,2.3. The completeness of this list can be verified as 
in 3. Hence 


(1) The homaloidal web and P-surfaces of the involution I\23 have the following 
description in terms of the F-system: 


Hyes(LCipipsps)"g, --- gals? --- Uw?” --- RP P%C5); 
P, = Q(C.)?(g1 oes G4); 
Po, = (L?Ci pi’ pe’ pa Qi --- gale --- IP Ry?” --- RP P%e§); 


(Cyp2p, Pals gk ORY”) (i, j, k = 1, 2, 3). 


P.. 


To an F-point on C, there corresponds a P-curve on Pe¢,, which is a rational 
space quartic on p;, p2, ps, cutting L once, and 8-secant to Cy. This is a conse- 
quence of the multiplicites of C, on H, on P,, on P,,, and on Pe,. We observe 
that a curve of this order and these multiplicities must be contained on Pe,. 

We consider the Cremona group G generated by involutions J;23 for variable 
~i, P2, Ps but fixed Cy. The same remarks as are made in 4 with respect to 
products of these involutions and with respect to the behavior of L apply here. 
Let {y} again be the characteristic of a linear system of surfaces with respect 
to the F-basis of Hi23. The linear transformation of this characteristic produced 
by Ji23 is easily written by using (i). If we make the following change of 
variable in this characteristic: 


(2) y— 3y% =z, y = 2, yY — 2yo = Xo, yi=5z; (i = 1, 2,3), 
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then, apart from 2’ = 2, 2’ = z, the transformation of the characteristic is ex- 
pressed by 
ty = 429 - 5x1 _ 5x2 - 523, 


(3) ‘ies: Z, = Zo — Z1 — Te — Zs — Zi (i = 1, 2, 3), 
2; = 2; (j > 3). 


This involution, 7123, of determinant —1 has the invariant linear and quadratic 
forms: 


(4) L=m—m—at%2—---—2,, Q=25 — 52} — Sr} — --- — 52’. 
If the linear group generated by involutions i: (j,k, 1 = 1, --- , ) has the 
generic element 
, 
To = Anto — 5an21 _ 5ao2 Xe — sss 5a, Xo, 
, 
(5) g: Ly = AoLo — An; — AjeLe — +++ — Ay, (i = 1,---, 9), 
, 

Km Bus (m > p), 
we prove as before that this element represents a Cremona transformation in 
G with a homaloidal web (L,C{"p{*" --- p3*”)**™. 

This linear group g becomes infinite when p = 5. An easy verification of this 
is obtained from two sets of values of x; 21, Z2, --- , 25, namely: 


Ax: 5(k? = k + 1); R, R, k? =— k + i, (k — 1)?, (k = sy; 
Bi: 5(k? — k + 1); (k — 2), (kK — 12,2 —k +1, 8, #. 


Since 734; sends A, into B,,;, and i123 sends B, into Axis, then 73452123 sends A, 
into Ax+2, and thus has an infinite period. 


6. The Cremona involution /,.;,; its Cremona group G and linear group g. 
Let C; be three skew lines on a quadric Q, and py, --- , ps four points in generic 
position. Then J;234 is defined as in 2(1) by the web K = (C3p, --- ps)*. If Ky 
is mapped upon E by (q - - - q)®, Cs mapping into (919393949596) and pr --- ps 
into 7; --- r4, the net residual to C; cut out on K, by K maps into the Geiser net 
(94959e11 --- Ts)*, which determines the involution G, the map on E of J)234 on 
K,. As before, the surfaces of K with nodes at pi, --- , ps, respectively, are the 
P-surfaces of the isolated F-points p:, --- , ps. Also Q is the P-surface of an 
F-curve of the first kind L contained simply on the homaloidal web Hj23,4 of 
Ti234, and met in three variable points by a surface K. 


The web (q --- gs)* is transformed by G into a web (q$qéqir? --- r$s,s_8)", 
where s; is the transform of q; (¢ = 1, 2,3). If to this we add the map of C; 
five times, we get a system (qj°--- qi'rf --- rs, --- s,)", whence His = 


(LC3pi --- ps)”. 
A member of H234 cuts Q in 15 cross-generators, whence the order of L is 15. 
The P-surface of p; cuts Q in 3 cross-generators, whence L has triple points at 
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Pi, +++ , pa, the three directions at p, on L not being on P,,, since these latter 
directions are self-corresponding. 

Let the three skew lines C3 be \y, As, As. The web K contains the degenerate 
member (Ayp:)!- (AcAsPepsp4)?, invariant under J934. The plane (Ayp;)' is cut 
by the web K in a net of curves of degree one only, whence J;234 transforms the 
plane into the quadric. On taking away this quadric, and the surface P,,, 
from a member of Hy234, we have a surface (LA{A}A3pj{ --- pi)", the P-surface 
of \;. This cuts Q in 10 cross-generators, whence L is 10-secant to \;. Thus 
L, of order 15 with triple points at pi, --- , ps and 10-secant to each of Aj, Az, As, 
is met by a member of K in 3 variable points as expected. 

The F-curves of the second kind of H1234 consist of 12 lines |; ; from point p; 


across the two lines yA, (¢ = 1, --- , 4; 9, k, = 1, 2, 3); and of four cubic 
curves cS") on p;, px, pr and bisecant to each of the three lines A (7, --- ,1 = 
1, --- , 4), these curves being three-fold on Hi234. On eliminating the F-curves 


of both kinds from the common curve of two members of Hi234, we have a curve 
of order 15, the proper transform of a line under Jj234. Hence 


(1) The homaloidal web and P-surfaces of the involution I\234 have the following 
description in terms of the F-system: 


Hy234(LC 3p} ibe ps) (cS Me. oh, ‘es 1,3); 


Pi = Q(C;)*; 
Py, = (LATARASPT «++ pac” ++ She «++ lala +++ Leads 
P;, = (CPi PeDs Ps) (ey cS cs Labels). 

To an F-point on \, there corresponds a P-curve on P,, which is a rational 
space quintic on pi, --- , ps, l-secant to L, 4-secant to Ay, and 3-secant to each 
of Ae, As. - 

The involutions J;23, with fixed triad of lines C3 and variable p, --- , ps 


generate a Cremona group G of ternary type. With the same change of char- 
acteristic as before (except that we set y; = 62; (i = 1, --- ,4)) we find the 
following expression of the effect of Tiss: 

ro = 5a — 6a, — Gre — 623 — 6x, 
(2) tesa? 2; = Xo — 2B — Le — Te — Me — 2% (i = 1,---,4), 


rz, =; (j > 4). 


7 


This involution 7;23, of determinant +1 has the invariant linear and quadratic 
forms 


(3) L=m—u—%—---—2,, Q= 2) — Gr} — Gr} —--- — 62%. 


In this case the form of the generic element of the group g generated by 
elements tkimn (Kk, --- ,m = 1, +--+, p) is 
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Xo = awto — Ganzi — --- — Gaopz,, 
(4) 9: 2, = awl — and — --+ — ail, (J = 1,---,p), 
ri = Im (m > p). 


To this element g of the linear group there corresponds an element G of the 
Cremona group with a homaloidal web of the form (L,C$"p{* - - - p§*)**™. 
The linear group g is infinite if p > 6. For if 


Ay = (kK? + k + 1)(ao — 23 — 24) — (K + 1)°(t1 + 22) — (as + 2), 
By = ( + k + 1)(ao — 2s — 24) — W(x + 22) — (kh + 1)°(a5 + 2%), 


then t3456 transforms A, into By,:, and i234 transforms B,.; into Ax,s, whence 
2345621234 has an infinite period. 


7. The Cremona involution /;23;;; its Cremona group G and linear group g. 
Let J12345 be the involution defined as in 2(1) by the web K = (C2N?p, - -- ps)’, 
and let Q = (N®C;,)? be the quadric cone with node at N and on the conic C2. 
The generators of Q are P-curves whose F-points run over an F-curve of the 
first kind L contained simply on the homaloidal web H,...; of I;...s. 

The sections of K, by the web K yield, under the mapping of 2(b), the Geiser 
net (45971 --- 7s)* with involution G. Plane sections of K, map into the web 
(qj: --- q)* on E, and this passes by G into the web (q§q§rj --- ris; --- s,)", the 
partial map of the sections of K; by M,...5, 8, --- , 84 being the transforms of 
m, --:,Qs by G. We add to this partial map the conic {(q --- qs)?}®, taken 
six times, in order to have like multiplicities for each g. We add also the conic 
{(q. --- qe)?}, taken 12 times, in order to bring the multiplicities of the q’s up 
to one third of the order of the web. The resulting web of order 54 shows that 
H,...5 = (LC3N™ pj --- pi). The simple points s, --- , sy above arise from 
the four variable intersections of K, and LZ. As before, the P-surface of p, is 
P,, = (C,N* pip, --- ps)*. These P-surfaces, and all surfaces K, meet Q outside 
C. in four generators. A member of H;...; meets Q in 24 variable generators, 
in addition to C2, whence L has the order 24. 

The web K contains the degenerate member (C,)'-(N*p, --- ps). Since 
the plane is not cut by K in a net of degree two, the plane and quadric inter- 
change under /,...;, the P-surface of C2 separating from the homaloid which 
corresponds to the plane. Hence the P-surface of Cz is an (LC}N™ pj} --- p§)”. 
Moreover, the quadric passes into the plane. On taking this plane and the 
surfaces P,,, --- , P,, from twice a homaloid we have twice the P-surface of 
N. Thus the P-surface of N is an (LC}N'p} --- pj)”. 

The P-surfaces of C; and N meet Q outside C, in 20 and 14 generators respec- 
tively, whence L is 20-secant to C; and has a 14-fold point at N. These multi- 
plicities account for the four variable intersections of L and the web K. We 
can verify as usual that H,...; is transformed by J;...; back into the web of planes. 

The F-curves of the second kind include the five lines 1 from N to p, 
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(i = 1, --- , 5) and the 10 conies c{'” on N, p;, p; and the two points on C2 cut 
out by the plane (Np;p;)' (¢, 7 = 1, --- ,5; 7 #9). Another F-curve appears 
from the following considerations. Surfaces K cut the plane (C2)! in the net of 
straight lines, and the cone Q in a net of sextic curves with a 4-fold point at N, 
On Pi, --+ , Ps, and 4-secant to C2. For example K,, mapped on £, yields the 
line qsq_ and the conic (r; --- 75). This conic maps into the sextic curve men- 
tioned. On the particular surface K = P,,, this sextic consists of the generator 
Np, and a residual quintic curve c; with a triple point at N on pi, --- , ps and 
4-secant to Cz. Obviously cs is on every P,,, and also on H,...5, necessarily 
5-fold. We verify now that two surfaces of H;...; meet outside the F-curves in 
a curve of order 18, the transform of a line by J;...;.. Hence 


(1) The homaloidal web and P-surfaces of the involution I,...5 have the following 
description in terms of the F-system: 


1.s(LCSN™® 7 ... pZ)BT) ... eft 2* ... elt eS); 
= Q(N?C;)*; 
Pc, = (LCSN™ pS --- p8)(c{!2” ... eft *e8); 
Py = (LC{N'pj --- pg) «-- Ue «-- eft 8); 
Pp, = (CN pips «+= PIM EY «Ley, 


To any F-point on C, there corresponds a P-curve on Pe,, which is a sextic 
curve with a triple point at N, simple points at pi, --- , ps, cutting C2 and L 
six times and once, respectively. 

The elements of the three types of Cremona groups @ just discussed, with 
fixed F-curves C5, C4, C3, respectively, have had an F-curve of the first kind, Ly, 
which was variable with the element, and which was determined by the other 
F-clements. Thus the element II of G could be suitably described without a 
particular description of this curve Ly. An essential feature of this situation 
is that Ly has a P-surface Q = (C,)?, which is the same for all the elements of G. 

In the case of the involution J;...; above, the P-surface of L is the quadric 
cone with node at the isolated F-point N. In order to keep this P-surface 
fixed, we consider the Cremona group G generated by involutions J;2345 with 
fixed C, and N, and with variable positions of p, --- , ps only. 

We examine first the effect of Ji234; upon linear systems of surfaces. Let 
y be the order, 7, yo, 8, Yi, --- , ¥s the multiplicities of L, C2, N, pi, --- , ps for 
such a system. After transformation by J234; we obtain the new characteristic 


y = 18y — 29 — 16y0 — 10s — 3y, — --- — dys, 
f= y — yo — 8, 
(2) yo = 6Gy—g — by —3s—ym—--- — Ys, 
s’ = 12y — 29 — 10y — 7s — 2y, — --- — 2ys, 
yi = Ty — 6y —4s—y—----—ys—y (i =1,---,5). 
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If we set y — yo — 8 = 2, 9 = 2, we observe that 
2’ = §, a’ = 2, s’ — 2yy = —(s — 2y). 
The remaining equations (2) are then covered by 
y’ — 2ye = Gly — 2ye) — 4(¢ — 2p) — 91 — --- — Ys, 
yi = Ty — 2yo) — 4(s — 2yo) —y—--- -—ys—ys (6 =1,---,5). 


We shall be interested only in the transforms of the web of planes for which 
s — 2yo = 0, and therefore s’ — 2y5 = 0. On dropping this term and setting 


y — 2y = To, Y= q2; (i = 1, ss , 9), 
we obtain 
, . -— - = 
Yo = OX) — (2, — (Xe — +--+ — 62s, 
- - ’ ; - 
(3) ties: FT; = To — Hh — Feo—- ++ — Xe — Zi @Q= a sae , 9), 
' ° - 
r; =7; is > Sp. 


This involution ¢,234; has the determinant —1 and the invariant linear and quad- 
ratic forms 


(4) L=xmy—-%m—%—--- —7f,, Qu se. - 78, — 7, — +>. — Fz". 


If the generic element of the linear group g generated by tjximn (k, --- ,m S p) 
has the form 


Lo = aXo — Tat, — --- — Tao, 2x, , 
~ , ‘ 
(5) J: 2, = Anlo — Anti — +++ — AigL, (i _ 1,--- + iP 
, 
Xm = Im (m > p) ’ 


there corresponds to it an element G of the Cremona group with a homaloidal 
web of the form 


(Ly Cf» N% pit... pinion 

We find that this linear group g is infinite when p => 7. For if 
Ax 
By 


(Pe +k+ 1)(x» — r3 — 2a — 25) — (kK + 1)2(a1 + 22) — (x5 + 2), 
(kK? + k + 1)(to — x3 — a4 — 25) — (ti + 22) — (K + 1)7(25 + 20), 


then 72345 transforms B, into Ax. :, and tg4567 transforms A, into By.,, whence 
t12345 tsase7 transforms B, into B,,2, and thus has an infinite period. 
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CRITICAL POINT THEORY UNDER GENERAL BOUNDARY 
CONDITIONS 


By Marston Morse AND GEorGE B. VAN ScHAACK 


1. Introduction. MM. Morse has previously treated the theory of the critical 
points of a function of class C?, whose critical values are isolated and the neigh- 
borhoods of whose critical sets admit a special type of deformation, ref. [7, 8]. 
In the present paper it is assumed merely that the function f has continuous 
first partial derivatives which satisfy Lipschitz conditions and that the critical 
values (not critical points) of f are isolated. In spite of the fact that the 
critical sets may not be locally connected or possess neighborhoods which are con- 
tractible, it is shown that the type numbers of the critical sets are finite and 
depend on the definition of f only in an arbitrarily small neighborhood of the 
critical set. We emphasize the fact that this part of the treatment does not 
depend at all upon the definition of f in the large. 

The authors have also taken up the theory of critical points on an abstract 
metric space, ref. [10]. The present treatment, although less general, is simpler 
and more suitable for most applications in analysis. The reader may also refer 
to the papers of A. B. Brown [2], Lefschetz [5], and Birkhoff and Hestenes [1]. 
See [12] for a later abstract by Morse. 

The second part of this paper contains the first treatment under general 
boundary conditions. The principal theorem here was announced by Morse 
in [6]. This part of the paper has important applications in the theory of 
harmonic functions, as will appear in a subsequent paper by Morse. 

Finally, various group theory aspects of the problem are brought out. 


I. Type numbers 


2. The function and the critical set. Let (x) = (x, --- , z,) be the rectangu- 
lar coérdinates of a point in euclidean n-space. Let R be a limited, open, n- 
dimensional region of this space. Let f(x) be a real, single-valued function of! 
class C'L defined on R. A point of R at which all of the first partial derivatives 
of f vanish will be called a critical point of f. The value of f at a critical point 
will be called a critical value of f. We assume that the critical values of f are 
isolated. Neighboring each ordinary (non-critical) point of f the differential 
equations of the trajectories orthogonal to the loci f = constant may be given 
the form 


Received February 26, 1936. 

1 A function defined on an open region will be said to be of class C'Z on this region if 
it is of class C' and if its partial derivatives satisfy Lipschitz conditions neighboring each 
point of the region. 
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dz; 


ri x (i= 1,---,n). 


Since f is of class C'L on R, there is a unique trajectory through each ordinary 
point on R. 

By a critical set o of f will be understood any closed? set of critical points on 
which f is a constant ¢ and which is at a positive distance from all other critical 
points of f. A critical set may or may not be connected. In general it will 
not be a finite complex. By a neighborhood of ¢ we mean an open set of points 
of R which includes all points of R within a small positive distance of ¢. We 
shall admit only those neighborhoods of « whose closures contain no critical 
points at which f = c other than the critical points of «. A neighborhood of o 
will be termed arbitrarily small if all of its points lie within an arbitrarily small 
distance of o. 


3. Deformations and maximal sets. In §§3-7 we shall be concerned with a 
single critical set ¢ of f on R on which f = c. We begin by considering two 
deformations of R. We shall make use of the trajectories 


a = ~ is (§ = 1,---,n), 


orthogonal to the manifolds f = constant. We make the convention that there 
is a trajectory coincident with each critical point at all times ¢. 

The deformation D(t). Let p be a point of R and let p, be the point corre- 
sponding to t = 0 on the trajectory which issues from p when ¢t = 0. It is 
understood that p, does not necessarily exist for all values of ¢. Under the 
deformation D(t) the point p shall be replaced by the point p, at the time ¢. 
The value of f at p, is a non-increasing function of ¢ at all values of ¢ for which 
pis defined. Critical points are held fast under D(¢). 

The deformation A(t). Recall that c is a critical value of f. Let p be a point 
of R at which f > c, and \ the trajectory which issues from p when ¢t = 0. 
Under the deformation A(t), p shall be deformed as under D(t), provided f re- 
mains greater than c on dX. If f = ¢ at a point p, on X, p shall be deformed 
as under D(t) for t S 7, and shall be replaced by p, for all values of t > 7. The 
points of R at which f < c shall remain fixed under A(t). We distinguish be- 
tween a deformation such as A(t) and the final image P, of a point P under 
A(t). The deformation A(¢) shall refer to the set of all images P, of P for which 
0 < r S t, while the final image of P under A(é) shall refer to the image P,. 

We introduce the following definition. 

A neighborhood Y of ¢ will be said to be A-contractible if there exists a neigh- 
borhood X of o such that the images of Y under A(é) lie on a closed subdomain 
of X for all values of f 2 0. We term Y A-contractible on X. This type of 
contractibility does not imply that Y can be deformed on X into an arbitrarily 
small neighborhood of c. 


2 Closures shall be taken relative to the entire space (z). 
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Let VW be an arbitrary pair of neighborhoods of ¢ of which WC V. We 
shall distinguish between two types of cycles* neighboring ¢. We shall refer 
to these cycles as belonging to «. We shall say that a point of R at which 
f < cis below c. 

By a spannable k-cycle corresponding to VW we shall mean a k-cycle on W, 
below c, ~ 0 on W, but # 0 on V below ec. 

By a critical k-cycle corresponding to VW we shall mean a k-cycle on W, # 
on V to a k-cycle on V below c. 

Null cycles are naturally admitted, and we make the convention that on a 
null eyele the value of f may be chosen at pleasure, and in particular may be 
taken below c. By virtue of this convention no cycle which is dependent on V 
can be a critical cycle corresponding to VW. 

We introduce the following definition. 

Let the cycles of a class C of k-cycles be distinguished by the possession of 
certain properties. By a maximal set A of cycles of C will be meant a finite 
set of cycles of C, every proper linear combination (always mod 7) of whose 
cycles belongs to C, and which is such that there exists no set of cycles of C, 
every proper linear combination of whose cycles belongs to C and which con- 
tains A as a proper subset. 

The following theorem is of fundamental importance. 

NEIGHBORHOOD THEOREM. There exists a fixed neighborhood N* of o, and 
corresponding to any neighborhood X of « on N* a neighborhood M(X) of oa, 
A-contractible on X and with the following property. Corresponding to any two 
pairs of neighborhoods X M(X) and Y M(Y) of o, of which X and Y lie on N*, 
there exist on any arbitrarily small neighborhood of « common maximal sets of 
spannable and critical k-cycles. 

In §§4—6 we shall be occupied with the proof of this theorem. 

The maximal sets of the theorem are of importance in that they depend 
only on the neighborhood of ¢. We shall subsequently define type numbers of 
o with their aid. 


4. a-admissible neighborhoods. We begin the proof of the Neighborhood 
Theorem with the following lemma. ; 

Lemma 4.1. Corresponding to an arbitrary neighborhood X of o there exists a 
neighborhood A(X) of « which is A-contractible on X. 

Let Y be a neighborhood of « whose closure lies on X. If the lemma is 
false there must exist an infinite set of points P, (n = 1, 2, --- ) which tend to 
a point on o as n becomes infinite and which under A(t) possess images Q, 
respectively on the boundary of Y. Let Q be a cluster point of the points Q,. 


3 We deal with the case mod xz, where 7 is any prime > 1. When the restriction mod x 
is omitted, the development holds for the absolute case as well. The notation w = 0 is 
used to imply that nw ~ 0, where n is an integer # 0 mod x, Lefschetz [4]. More generally, 
the whole theory is valid for coefficients in an arbitrary commutative field, chains, cycles, 
and homologies being defined with coefficients in that field. 
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It is clear that f = cat Q. The trajectory which passes through Q has a posi- 
tive distance from o, as do the trajectories which pass through points Q,, suffi- 
ciently near Q. This is contrary to the nature of the points P,, which lie on 
trajectories through the respective points Q,. Hence the lemma holds as 
stated. 

The deformation A(t, 7). We introduce the product deformation 


A(t, r) = A(t)-D(r) (t >0,7 20). 


We understand thereby that an arbitrary point P on R is deformed into a 
final image P,,, as follows. The point P is first deformed under A(é) into a 
point P,. We take P,,, as the final image of P, under D(r). This deformation 
deforms points below c through pojnts below c. 

If X is an arbitrary set of points on R, the subset of points of X below ec will 
be denoted by X.. 

We come to the following lemma. 

Lemma 4.2. Corresponding to arbitrary neighborhoods X and N of oa, any 
sufficiently large value of t = 0 and sufficiently small value of + > 0 will have 
the following property. Under A(t, r), A(X) is deformed on X onto N + X-. 

Let M be a neighborhood of ¢ such that‘ M CN. According to the preceding 
lemma there exists a neighborhood Y of o such that Y C X and such that A(X) 
is deformed on Y under A(t). It follows from the definition of A(t) that for ¢ 
sufficiently large the final image U of A(X) under A(t) will lie on M + Y.. 
If the deformation D(r) now be applied to U, the ordinary points of U at which 
f = c will thereby be deformed into points below c. If this deformation be 
terminated at a sufficiently small value of t > 0, M and Y will not thereby be 
deformed off N and X respectively. Thus the resultant deformation will 
deform A(X) on X onto N + X.. 

We shall prove the following lemma. 

Lemma 4.3. There exists a A-contractible neighborhood T of o such that the 
Betti numbers of TY. are finite. 

Let 2 be an arbitrary neighborhood of ¢. If e is a sufficiently small positive 
constant and N a sufficiently small neighborhood of o on which f > c — e, 
the following statements will be true. From each point of N. an orthogonal 
trajectory will lead on R to a point Q on f = ¢c — e. The set S of these points 
Q will have a closure which lies on R and which consists of ordinary points of 
fon R. There will exist® a finite complex K on f = c — e covering S. If K 
forms a sufficiently small neighborhood of S on f = c — e and K’ is the set of 
points on orthogonal trajectories issuing from K on which ¢c — e < f < ¢, 
N + K’ will lie on A(Q). 

We set l = N + K’, and observe that I is A-contractible on 2. We shall 
show that I, has finite Betti numbers. 


‘ The closure of a point set E will be denoted by E. 
5 The methods of Cairns [3] suffice to prove this statement. 
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To that end we note that [, = K’. With the aid of the orthogonal trajec- 
tories we see that the Betti numbers of K’ are those of K, and accordingly finite. 

The proof of the lemma is complete. 

We introduce the following definition. 

An ordered pair of neighborhoods VW of « will be termed a-admissible if V CT 
and W is A-contractible on V. We abbreviate the phrase ‘corresponding to any 
a-admissible pair of neighborhoods VW’ by the expression a-adm VW. 


5. Existence of maximal sets a-adm VW. In this section we shall prove the 
following theorem. 

TueoreM 5.1. There exist maximal sets of spannable or critical k-cycles 
a-adm VW. The number of cycles in such sets is less than a finite constant inde- 
pendent of pairs VW which are a-admissible. 

Before coming to the proof of the theorem we define certain subclasses of 
spannable cycles. 

A spannable (k — 1)-cycle ux, a-adm VW, will be called linkable a-adm VW, 
if bounding on [. If wx; is linkable, there exists a chain \; on [’, such that 


Nn? — Man (on [,) . 
By virtue of the definition of a spannable (k — 1)-cycle, there also exists a 
chain A; on W such that 
(5.1) dy —> Maa (on W). 
We set 
(5.2) Me + Me = Ae, 


and term A, a k-cycle linking up», a-adm VW. We shall say that A, belongs 
to o. 

A spannable k-cycle a-adm VW which is independent on [, will be termed a 
newly-bounding k-cycle a-adm VW. 

We begin the proof of Theorem 5.1 with the following lemma.* 

Lemma 5.1. If A is a A-contractible neighborhood of « and V C A, there exists 
no homology of the form 


(5.3) mrAx + ne, + we ~ O (on A), 


where ; and cy, are respectively linking and critical k-cycles a-adm VW and w;, 
is a k-cycle below c, unless m = n = 0, mod x. 

Suppose there were an homology of the form (5.3). We shall prove that 
m =n = 0, mod z. 

The deformation A(t, r) applied for a sufficiently large time ¢ and sufficiently 
small time 7 will not only deform A onto V + R, but also deform W only on V. 


6 [t follows from this lemma that there exists no relation of the form mA, + neg + we =O 
(on A) unless m =n =0. 
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Let AZ, cz and w;, be, respectively, the final images of Ay, cz and w, under A(t, 7). 
We have the homologies 


MA —A\, ~0 (on V + R.), 
(5.4) Cc. —c, ~0 (on V), 
we — we ~ 0 (on R,) . 


Moreover, under A(t, 7), (5.3) yields the homology 
(5.5) mdi + nce + we ~O (on V + R,). 
Substituting the homologies (5.4) in (5.5), we find that 
(5.6) mrAx + nee + we ~ O (on V + R.). 
We can write (5.6) in the form 
Zeer + 241 > MAz + NCE + We (on V + R.), 


where z,,; is a chain on V and z;,; isachain on R.. Let z, and z, be respec- 
tively the boundaries of z;,,; and z,,,. Let us; be the linkable (k — 1)-cycle 
a-adm VW linked by Ay. Recalling (5.2) and using the preceding bounding 
relations, we see that 


(5.7 Zp +2, = md, + mdi + nce + (mod 7) . 
We find that the chain 
mr; + Nk — 2 (on V) 


reduced mod 7, lies on R,, since the remaining chains in (5.7) lie on R,. From 
(5.1) we see that 


(5.8) mdr; + Ne, — 24 > MuUg-1 (on V). 


Proof thatm = 0. If m ¥ 0 in (5.8), the spannable cycle mu,_, bounds on 
V below c. This is contrary to the nature of a spannable cycle. Hence m = 0. 

Proof thatn = 0. Since m = 0, it follows from (5.7) that the chain z; — wz, 
reduced mod z, lies on V. Observe that z, ~ 0 on V. Hence we see from 
(5.7) that 


(5.9) NCk ~ 2, — We (on V). 


But z;, — w, is a cycle below c. Hence (5.9) is contrary to the nature of the 
critical cycle c, unless n = 0. Thusn = 0. 

The proof of the lemma is complete. 

Proof of Theorem 5.1. Since T is A-contractible, there will exist a neighbor- 
hood A of « such that A is a finite complex, which covers I’, and approximates 
lr so closely as to be A-contractible. We make such a choice of A. 

(a) Critical k-cycles. Let (c), be a finite set of critical k-cycles a-adm VW, 
every proper linear combination c, of whose cycles is of the same type. It 








226 MARSTON MORSE AND GEORGE B. VAN SCHAACK 


follows from Lemma 5.1 that c, # 0 on A. Hence the number of cycles in 
(c), is at most the k-th Betti number R, of A. But R;, is finite. It follows 
that there exists a maximal set of critical k-cycles a-adm VW, and the number 
of cycles in such a set is at most R;, a number which is independent of a-admis- 
sible pairs VW. 

(b) Linkable (k — 1)-cycles. Let (u),_; be a finite set of linkable (k — 1)- 
cycles a-adm VW, every proper linear combination u,_, of whose cycles is of 
the same type. Let (A), be a set of k-cycles which link the respective (k — 1)- 
cycles of (u)x1. Let u,x_; be a proper linear combination of the cycles of (u).—1, 
and A; the corresponding linear combination of the cycles of (A),. Recall that 
dx is a linking k-cycle a-adm VW. It follows from Lemma 5.1 that A, # 0 
on A. Thus the number of cycles in (A);, and hence in (u),_1, is at most the 
preceding number R;. Hence there exists a maximal set of linkable (k — 1)- 
cycles a-adm VW and the number of cycles in such a set is bounded by Rx. 

(c) Newly-bounding k-cycles. Let (v), be a finite set of newly-bounding 
k-cycles a-adm VW, every proper linear combination of whose cycles is of the 
same type. It follows from the definitions of such cycles that they lie on I., 
and are independent on T... Hence the number of cycles of (v); is at most the 
k-th Betti number P; of T.. It follows from Lemma 4.3 that P; is finite. Hence 
there exists a maximal set of newly-bounding k-cycles a-adm VW, and the 
number of cycles in such a set is at most P,, a number which is independent of 
a-admissible pairs VW. 

Theorem 5.1 follows from (a) in so far as it refers to critical cycles. With 
respect to spannable cycles, observe that a maximal set of spannable k-cycles 
a-adm VW will be afforded by the sum of maximal sets of linkable and newly- 
bounding k-cycles a-adm VW, as follows from the definitions of the cycles 
involved. The theorem accordingly holds for spannable cycles as well. 


6. Proof of the Neighborhood Theorem. In this section we shall establish the 
Neighborhood Theorem of §3. Our first lemma is the following. 

Lemma 6.1. Corresponding to an arbitrary neighborhood V of o on IT, there 
exists a neighborhood B(V) of « on A(V) such that if 


WC BV), 


a maximal set of critical or spannable k-cycles a-adm VW is also a maximal set 
a-adm V B(V). 

The lemma will be proved for the case of critical cycles. The proof for the 
case of spannable cycles is similar. 

Let W and W’ be any two neighborhoods of o for which W C W’ C A(V). 
Let (c), be a maximal set of critical k-cycles a-adm VW. It follows at once 
from the definitions of critical k-cycles and of maximal sets that (c),; is a subset 
of a maximal set of critical k-cycles a-adm VW’. But the number of cycles in 
a maximal set of critical k-cycles is finite. Hence if W’ is taken as a sufficiently 
small neighborhood B(V) of o, the number of cycles in (c), will be independent 
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of W, provided W C B(V), and (c), will be a maximal set of critical k-cycles 
a-adm V B(V). The proof of the lemma is complete. 

We continue with the following lemma. 

Lemma 6.2. There exists a fixed neighborhood H of o on V such that if 


(6.1) ’caz, WC B(A)-B(V), 


a maximal set of critical or spannable k-cycles a-adm HW is a maximal set 
a-adm VW. 

We consider the case of critical k-cycles. Suppose V and V’ are any two 
neighborhoods of ¢ for which V C V’ C TI, and W is any neighborhood of ¢ on 
B(V’)-B(V). The pairs VW and V’W are then a-admissible. Theorem 5.1 
applies, and maximal sets of critical cycles exist. Moreover, it follows from 
the definition of critical cycles that a maximal set (c), of critical k-cycles 
a-adm V’W is a subset of a maximal set of critical k-cycles a-adm VW. Recall 
that the numbers of cycles in such sets a-adm V’W are bounded with respect 
to all pairs V’W which are a-admissible, and with the above choice of W de- 
pend only on V’, by virtue of Lemma 6.1. If then V’ is taken as a sufficiently 
small neighborhood H of ¢ on T, the set (c), a-adm V’W will be a maximal set 
a-adm VW, and the lemma holds as stated. 

We introduce the following definition. 

A pair of neighborhoods VW of o will be termed 8-admissible if (6.1) holds. 
The phrase B-adm VW will have the obvious meaning. 

We come to the following theorem. 

THEOREM 6.1. Corresponding to any two B-admissible pairs of neighborhoods 
VW and V'W’ of o there exist on any arbitrarily small neighborhood of « common 
maximal sets of spannable and critical k-cycles. 

Let N be any neighborhood of o such that NC W-W’. Let (c), be a maximal 
set of critical k-cycles B-adm HN. Such a set exists by virtue of Theorem 5.1. 
It follows from Lemma 6.2 that (c), is a maximal set of critical k-cycles both 
B-adm VN and f-adm V’N. It then follows from Lemma 6.1 that (c), is a 
maximal set of critical k-cycles both B-adm VW and B-adm V’W’. Hence the 
theorem holds for critical cycles. 

The proof for the case of spannable cycles is similar. 

Proof of the Neighborhood Theorem (§3). If we choose the neighborhoods N* 
and M(X) of the Neighborhood Theorem respectively as the neighborhoods 
H and B(H)-B(X) defined above, we see that the Neighborhood Theorem is 
an immediate consequence of Theorem 6.1. 

We introduce the following definition. 

An ordered pair of neighborhoods VW of o will be termed admissible if they 
satisfy the conditions 


VC M(N*), WCM(V), 


where N* and M(X) are the neighborhoods of the Neighborhood Theorem. 
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The phrase adm VW will have the obvious meaning. 

It follows at once from the Neighborhood Theorem that the total number, 
say m,, of cycles in maximal sets of spannable (k — 1)-cycles and critical k-cycles 
adm VW is independent of the choice of admissible pairs of neighborhoods VW. 
We define the number m;, to be the k-th type number of the critical set ¢. 

The type numbers of « depend on the definition of f only in an arbitrarily small 
neighborhood of o. It is by virtue of the Neighborhood Theorem that the type 
numbers exist and are finite. 


7. Group aspects in the small. In this section we shall discuss the group 
theory aspects of cycles neighboring co. 

We shall deal with a quotient’? G = A/B of groups of cycles A and B in which 
the operation is addition mod x. This will be understood throughout and need 
not be repeated. In the group G the operation will again be addition. The 
elements of G will be classes of cycles, two cycles belonging to the same class 
if their difference mod z belongs to the class which is the zero-element of G. 
Any class which is not the zero-element of G will be called a proper class. A 
cycle belonging to a class C will be termed a representative of C. We shall deal 
with groups which possess a finite number of generators. 

Let VW be an arbitrary pair of neighborhoods of ¢ of which W CV. We 
define the following groups corresponding to VW. 

Let A be the group of k-cycles on W below c, ~ 0 on W, and B the subgroup 
of cycles dependent on V below c. We term A/B the k-th spannable group 
corresponding to VW. 

Similarly let A be the group of k-cycles on W, and B the subgroup of cycles = 
on V to cycles below c. We term A/B the k-th critical group corresponding 
to VW. 

We have the following theorem. 

THEOREM 7.1. A cycle is a spannable (critical) k-cycle corresponding to VW 
if and only if it is a representative of a proper class of the k-th spannable (critical) 
group corresponding to VW. 

Let VW be an a-admissible pair (§4) of neighborhoods of ¢. We define the 
k-th linkable group a-adm VW to be the group of those classes of the k-th 
spannable group a-adm VW whose representative cycles are dependent on I.. 

The quotient of the k-th spannable group by the k-th linkable group a-adm 
VW will be termed the k-th newly-bounding group a-adm VW. We can and 
shall regard the elements of the latter group as classes of cycles. 

We have the following theorem. 

THEOREM 7.2. A cycle is a newly-bounding k-cycle a-adm VW if and only 
if it is a representative of a proper class of the k-th newly-bounding group a-adm VW. 


7 For additive groups, quotients such as A/B are frequently written as A mod B. 
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II. The fundamental relations 


8. The function and the region. We turn now to the problem of determining 
the fundamental relations between the type numbers of the critical sets of f 
and the Betti numbers of the domain of definition of f. 

Let > be a limited, open, n-dimensional subregion of the region R of §2, 
whose closure 5 lies on R, and whose boundary B is a closed point set consisting 
of a finite number of connected, regular, non-intersecting (n — 1)-spreads* of 
clas§ C*. Let f, denote the directional derivative of f on the normal to B in 
the sense that leads from points on = to points not on >. 

A function f will be termed A-admissible on > if it satisfies the following 
conditions. 

A I. The function f shall be of class C'L on an open region containing = and 
shall have only a finite number of critical values on >. 

A Il. The function f shall be of class C? neighboring B. The directional deriv- 
ative f, of f shall be positive on B. 

We assume that the function f is A-admissible on =. As in [9], we can alter 
the definition of f neighboring B so that the resulting function, which we will 
again call f, is A-admissible on =, but in addition is constant on B, its value on B 
being greater than at any point of =. This alteration can be made without 
introducing any new critical points. We assume that f has been altered in 
this way. From this point on we consider f only on 3. 

If a and b are two ordinary values of f, with no critical values between them, 
the domains f < a and f < b are homeomorphic, [9]. When there are critical 
values between a and b this will not in general be so. We are concerned in 
what follows with the topological differences between the domains f < a and 
f <= 6, and the manner in which these differences depend on the critical points 
of f. 

We recall the definitions of critical sets and their neighborhoods made in §2. 
Let D be a closed subdomain of = on whose boundary f has no critical points. 
The set w of all critical points of f on D at which f = c will be termed a complete 
critical set on D. In general the points of w may be grouped into a finite number 
of disjoint critical sets in several ways, but it follows from the definition of 
critical sets that it is not possible to group the points of » into an infinite en- 
semble of disjoint critical sets. Suppose w is the sum of disjoint critical sets 
@1, --- ,@m. It follows from the definition of critical sets that the admissible 
pairs of neighborhoods used in defining the type numbers of the several sets 
w; may be chosen so small that pairs belonging to distinct sets w; are disjoint. 
Hence the k-th type number of w equals the sum of the k-th type numbers of 
the several sets w;, and is thus independent of the way in which w is broken 
up into a sum of a finite number of disjoint critical sets. 


8 An (n — 1)-spread is said to be regular and of class C’ (r > 0) if in the neighborhood 
of each of its points one of its coérdinates can be represented as a function of class C’ 
of the remaining coérdinates. 
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9. Classification of cycles. Let ¢ be a critical value of f, and 6 > can ordinary 
value, such that between c and b there is no critical value of f. In this section 
we shall give a maximal set of k-cycles on f < b, independent on f < 5, in terms 
of k-cycles belonging to the complete critical set w of f on f = ec. 

Let o be any critical set of f on which f = c. Linkable, linking and newly- 
bounding k-cycles adm VW are formally defined as in §5, the domain I, being 
replaced by the domain f < c. A linking k-cycle 4, adm VW which links a 
linkable (k — 1)-cycle ux, adm VW may be represented in the form 


(9.1) Me = An +, 


where \, is a chain on W and X{, a chain on R,, the boundaries of \;, and X being 
Up and —u,_; respectively. 

We come to four lemmas on linking and critical cycles. We begin with the 
following. 

Lemma 9.1. Let (u)x_1 be a set of linkable (k — 1)-cycles adm VW, and (A), 
a set of k-cycles linking the respective (k — 1)-cycles of the set (u)x. adm VW. 
A necessary and sufficient condition that (u),_, be a maximal set of linkable (k —1)- 
cycles adm VW is that (A), be a maximal set of linking k-cycles adm VW. 

The proof of this lemma is essentially the same as that of Lemma 6.1, [8]. 
One need only replace the symbol ~ by #, the word “sum” by the phrase 
“proper linear combination,” together with certain changes of sign depending 
upon the substitution of mod zw for mod 2. 

Our next lemma follows. 

Lemma 9.2. Jf VW is an admissible pair of neighborhoods of o, any k-cycle 
on V is = on N* to a linear combination of critical k-cycles adm VW and cycles 
below c. 

It follows at once from the definitions that a k-cycle on V is = on N* toa 
sum of critical k-cycles corresponding to N* M(N*) and cycles below c. But 
it follows from the Neighborhood Theorem that a maximal set of critical k- 
cycles corresponding to N* M(N*) may be taken as the cycles of a maximal 
set of critical k-cycles adm VW. 

The proof of the lemma is complete. 

Lemma 9.3. Let (A), be a maximal set of k-cycles linking adm VW. Then 
any k-cycle on W + R, is = on N* + R, to a linear combination of cycles of 
(A)x, critical k-cycles adm VW and k-cycles below c. 

Let (u),1 be the set of (k — 1)-cycles linked respectively by the cycles of 
the set (A),. Let z, be an arbitrary k-cycle on W + R.. If sufficiently finely 
divided z, can be represented in the form 


(9.2) Ze = 2, + 2h, 


0 ° » ” ° . P 
where z, is a chain on W and z, a chain on R,. Suppose z,_; is the boundary 
, . . . 
of z,. The cycle z,_, is below c. It accordingly satisfies an homology 


(9.3) Ze = TUny (on V below ec), 


where u,_; is a proper linear combination of cycles of (u),.1, and r = 1 or 0. 








rh 
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Let ux: be a proper linear combination of cycles of (u),1, and A, the corre- 
sponding linear combination of the cycles of (A)x. By virtue of (9.3) there 
exists a chain w; on V below c and an integer m such that 


Wi —> MZ~-1 — MTUE_1 (m # 0). 
Upon using (9.1) and (9.2) we find that 
(9.4) mz, — mrr, = (mz, — mrr;, — w,) + (mz, — mrr;, + w,). 


The first parenthesis contains a k-cycle on V and the second a k-cycle below c. 
But k-cycles on V are = on N* to sums of critical k-cycles adm VW and cycles 
below c. Since the modulus z is prime, the congruence (9.4) yields the lemma.® 

We return to the complete critical set w of fon f < b, on which f = c. We 
define a new set of cycles. A k-cycle below ¢ independent below c, and inde- 
pendent below c of the spannable k-cycles adm VW is termed an invariant 
k-cycle adm VW. 

We come to a basic theorem. [8, Theorem 6.1.] 

THEOREM 9.1. A mazimal set of k-cycles on f < b, independent on f < b, 
is afforded by maximal sets of critical, linking and invariant k-cycles corresponding 
to an admissible pair of neighborhoods VW of the complete critical set w on which 
Sf = ¢C. 

We shall prove the theorem by proving statements (a) and (b). Statement 
(a) follows. 

(a) Any k-cycle z, on f < bis = onf < b to a linear combination of the k- 
cycles of the maximal sets adm VW of the theorem. 

By means of the deformation D(t) of §4, the domain f < b can be deformed 
on itself onto the domain W + R,. Hence we lose no generality if we suppose 
that z, lieson W + R.. It follows from the definition of invariant cycles that 
cycles below c are = on f < b to a linear combination of invariant cycles. 
Statement (a) follows at once from Lemma 9.3. 

(b) The cycles of the maximal sets of the theorem are independent on f < b. 

Suppose that there existed an homology of the form 


(9.5) mrx + nex + Tin ~ O (on f < b), 


where m, n, r = 1 or 0, and where Xx, cx, and 7; are respectively linking, critical 
and invariant k-cycles adm VW. 

It follows, as in the proof of Lemma 5.1, that the homology (9.5) may be 
taken on V + R,. Continuing as in the proof of Lemma 5.1, we infer that 
m=nz=0. Hence (9.5) implies the homology 


(9.6) rin ~ 0 (on f < b). 


® Were xz not a prime, the congruence derived from (9.4) by replacing multiples of the 
parentheses on the right by the desired cycles might contain a leading coefficient on the 
right of the form mn =0, and the proof would fail. 

10 In what follows it is convenient to suppose that all neighborhoods of w which are ad- 
mitted lie on f < b. 
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Since 7; is on R., we can deform f < b on itself onto W + R., keeping 7, fixed. 
Hence (9.6) implies the homology ri, ~ 0 (on W + R,). Let 2x4; be the chain 
on W + R, bounded by ri,. We can write 


- , ” . 
(9.7) Zhaa Ht Ze41 > My, 


, ° ‘ - ” ° , ” ° 

where z,,, 1s a chain on W and z,,, achainon R,. Let z, and z, be, respectively, 
. ’ ” 

the boundaries of z,,, and z,,,. From (9.7) we see that 


° , ” 
rh, — 2, — 2, =0. 
- . ” 0 ” 
Since 7, and z, are on R,, the cycle z, ison R,. Andz, ~0onR,. Hence 
° , 
(9.8) TY, ~ Zk (on R,). 


If z, ~ 0 on R,, r must be zero in (9.8), for invariant cycles are independent 
below c. If z; # 0 on R,, 2, is a spannable k-cycle adm VW. We again infer 
that r = 0, since invariant k-cycles are independent below c of spannable 
k-cycles adm VW. 

Thus in (9.5), m = n = r = 0, and the proof of (b) is complete. The theorem 
follows directly. Cf. note 6 following Lemma 5.1. 


10. The relations. We come to the relations between the type numbers of 
the critical sets of f and the Betti numbers of >. 

Recall that w is the complete critical set of f on = on which f = c. Leta 
and b (a < b) be ordinary values of f between which c is the only critical value. 
Relative to the critical value c and the constants a and b, a new k-cycle shall 
mean a k-cycle on f < b, not = onf < b to k-cycles on f <a. Relative to the 
critical value c and the constants a and b, a newly-bounding k-cycle shall mean 
a k-cycle on f < a, independent on f < a, but bounding on f < b. 

Since the Betti numbers of the domains f < a and f < b are finite, it follows 
that maximal sets of new k-cycles and newly-bounding (k — 1)-cycles exist. 
Moreover, the numbers of cycles in such maximal sets are independent of 
ordinary values a and b (a < b), between which c is the only critical value. 
Let mj, m;, be respectively the numbers of cycles in maximal sets of new k- 
cycles and newly-bounding (k — 1)-cycles relative to the critical value ec. 

Recall that a newly-bounding k-cycle adm VW is a spannable k-cycle adm VW 
which is independent below c. It follows from this and from the definition of 
invariant (k — 1)-cycles adm VW that a maximal set of (k — 1)-cycles inde- 
pendent below ¢ consists of maximal sets of invariant and newly-bounding 
(k — 1)-cyclesadm VW. Of these cycles the invariant (k — 1)-cycles adm VW 
remain independent on f < b, according to Theorem 9.1. Hence m; equals 
the number of newly-bounding (k — 1)-cycles adm VW in a maximal set of 
such cycles. It also follows from Theorem 9.1 that mz is the number of critical 
and linking k-cycles in maximal sets of such cycles adm VW. It follows from 
Lemma 9.1 that the numbers of cycles in maximal sets of linking k-cycles and 
linkable (k — 1)-cycles adm VW are the same. Observe finally that maximal 
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sets of linkable and newly-bounding (k — 1)-cycles together form a maximal 
set of spannable (k — 1)-cycles. 

Combining the above statements with the definition in §6 of the type numbers 
mx of w, we see that m, = mi; + m;. We are thus led to the following theorem. 

THEOREM 10.1. Let a and b, a < b, be two ordinary values of f between which 
c is the only critical value. Let AR, denote the k-th Betti number of the domain 
f < b minus that of the domain f < a. Let m, be the k-th type number of the 
complete critical set w on which f = c. Finally, let mi; and m; be, respectively, 
the numbers of cycles in maximal sets of new k-cycles and newly-bounding (k — 1)- 
cycles relative to the critical value c. Then 


AR, = mi — mia m, = mi +m, (k = 0,1,---,n), 


where my = m,,, = m, = 0. 

The following theorem, which is an immediate consequence of Theorem 10.1, 
affirms the validity in the present case of the earlier relations of Morse [8, 
Theorem 1.1]. In it f is a function which is A-admissible on 2. 

TuHEoreM 10.2. Between the Betti numbers R; of = and the sums M; (i = 
0, 1, --- , ») of the i-th type numbers of the critical sets of f on 2, the following 
relations hold: 


Ro S Mo, 
Ro — R, = Mo — M,, 
Ro — Ri + Re S Mo — Mi + Mo, 


Ro — Ri + Re - cae + (—1)*R, = Mo — Mi + M2 — — + (—1)"M,. 


11. Group aspects in the large. We shall examine the results of §9 from the 
standpoint of groups. 

Let VW be an admissible pair of neighborhoods of the complete critical set 
w of §9. We make the following definitions. 

Let A be the group of k-cycles on W + R, and B the subgroup of cycles 
= on N* + R, to cycles on V plus cycles below c. We term A/B the k-th 
linking group adm VW. 

Similarly, let A be the group of k-cycles below c and B the subgroup of these 
cycles = 0 onf < b. Relative to the critical value c we term A/B the k-th 
invariant group. 

Finally, let A be the group of k-cycles on f < a[f < 6], and B the subgroup 
of cycles = 0 onf < a[f < bj. One terms A/B the k-th Betti group of f < a 
If < 6}. 

We have the following theorems. 

THEOREM 11.1. A k-cycle on W + R, is a linking k-cycle adm VW if and 
only if it is a representative of a proper class of the k-th linking group. 

TuHEoreM 11.2. A k-cycle below c is an invariant k-cycle adm VW if and only 
if it is a representative of a proper class of the k-th invariant group. 





234 MARSTON MORSE AND GEORGE B. VAN SCHAACK 


Theorem 9.1 takes the following form. 

THeEorREM 11.3. The k-th Betti group of f < 6b is isomorphic with the direct 
sum of the k-th linking group adm VW, the k-th critical group adm VW, and the 
k-th invariant group. 


III. General boundary conditions 


12. Admissible functions. We come to a generalization of the conditions A 
of §8. Let the region = and its boundary B be defined as in §8. 

Let P be any point on B. It follows from the regularity of B that B can be 
represented neighboring P in the form 


Zi = 9; (ul, --- , u*")  . ee 


where the functions g; are of class C* neighboring a point (uo) that determines P, 
and where the matrix of the first partial derivatives of the functions g; is of 
rank n — 1 at (vw). Parameters (u) in such a local representation of B will be 
termed admissible. 

The function f° defined by f on B will be termed the boundary function defined 
by f on B. 

We shall represent f° in terms of parameters of B, using, however, only those 
parameters of B which we have termed admissible. 

Let (u) be a set of parameters admissibly representing B neighboring a point 
P on B, and let ¥(u) be the value of f° in terms of these parameters (u). The 
critical points of f° neighboring P are defined by the critical points of y(u). 
The critical points of f° are clearly independent as points on B of the parameters 
(u) which are used locally to represent B. The critical sets of f° on B are 
defined in the same manner as the critical sets of f on R. 

We shall subsequently refer to neighborhoods of the critical sets of f°. We 
understand that these neighborhoods are open in B, in the point set sense. 

A function f(x) will be termed B-admissible on > if it satisfies the following 
conditions. 

B I. The function f shall satisfy condition A I. 

B II. The function f shall be of class C* neighboring B and shall have no critical 
points on B. The boundary function f° shall have a finite number of critical 
values. 

By making use of the trajectories orthogonal to the manifolds f° = constant, 
we can define a deformation on B similar to the deformation A(t) of §3. A- 
contractible neighborhoods on B of a critical set o° of f° can then be defined 
essentially as in §3 and a theorem established similar to the Neighborhood 
Theorem in §3. Admissible pairs V°W° of neighborhoods of o° on 8 and the 
type numbers of o° can be defined essentially as before. Complete critical sets 
of f° on closed domains D of B are defined as were complete critical sets of f 
on closed domains D of &. 

Let 8 be a connected boundary spread of B. We admit the possibility that 
the boundary function f° be identically a constant on 8. Suppose then that 
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f° =con8g. The spread 8 is a critical set of f°. A sufficiently small neighbor- 
hood of 8 on B will be identical with 8. Neighboring 8 on B there are no points 
below c and hence no spannable cycles belonging to 8. Each cycle on 8 which 
is non-bounding on 6 will be a critical cycle belonging to 8. The number of 
critical k-cycles in a maximal set of such cycles belonging to 6 will equal the 
k-th Betti number of 8. 

Recall that f, is the directional derivative of f on the normal to B in the sense 
that leads from points on = to points not on =. The set of points on B at which 
tf, = 0 will be termed the negative boundary of >. 


13. The principal theorem. The following theorem gives the main results 
of this part of the paper. In it f(z) is a function which is B-admissible on 2. 

THEOREM 13.1. Let M; (i = 0,1, --- , n) be the sum of the i-th type numbers 
of the complete critical sets of f on = and of the boundary function on the negative 
boundary of S, and let R; (j = 0, 1, --- , n) be the j-th Betti number of . The 
numbers M ; and R; satisfy the relations in Theorem 10.2. Cf. Theorem 1.1, [8]. 

This theorem is obtained by applying the results of Theorem 10.2 to the 
function F(x) described in the following lemma. In this lemma we let o° denote 
an arbitrary complete critical set of f° on the negative boundary of >. 

FUNDAMENTAL LemMA. On a suitably chosen open region including > there 
exists a function F(x) which is identical with f(x) on 5 except neighboring B, and 
which is A-admissible on >. 

The critical points of F(x), other than those of f(x), may be grouped into critical 
sels o(a°) which correspond in a one-to-one manner to the critical sets o°, and possess 
type numbers mo, --- , Mp, which equal the corresponding type numbers of o(c°), 
while m, = 0. 

The remaining sections of the paper will be occupied with the proof of this 
lemma. In §14 we shall modify f at the points on = neighboring the critical 
sets of f° on the negative boundary of = in such a way that on B neighboring 
these critical sets the new function will have a constant negative normal direc- 
tional derivative. In §15 we shall extend the definition of the new function 
to points neighboring B outside ©. The function H thus obtained will have a 
positive normal directional derivative on a spread B, neighboring B. The 
function H is of the type already studied in §§8-10. Its critical sets are de- 
termined by those of f and of f°, as we shall see in §16. 


14. Modification of f within >. In this section we shall make use of special 
coordinates neighboring B. Let (2°) be the rectangular coérdinates (x) of a 
point P on B. It can be shown that the normals to B neighboring P form a 
field neighboring P. Let (u) be a set of admissible parameters in terms of which 
B is locally representable neighboring P. Let (uo) be the parameters of P. 
On each normal to B neighboring P let s be the are-length measured positively 
in the sense of the outer normal, with s = 0 on B. There exists a positive 
constant » and on B a neighborhood M of (uo) such that on the normals to B 








236 MARSTON MORSE AND GEORGE B. VAN SCHAACK 


at points of M the points at which | s | < » cover a neighborhood M* of (x°) 
in a one-to-one manner. Sets (u, s) for which (u) is on M and | s| S » may 
be considered as new coérdinates for the points of M*. The transformation 
from the coérdinates (x) to the coérdinates (u, s) will be of class C? and will 
have a non-vanishing jacobian. We shall call such codrdinates (u, s) admissible 
neighboring P. We can suppose the constant 7 is so small that the points on 
the normals to B at which | s | S » cover a neighborhood of B in a one-to-one 
manner include no critical points of f, and include only points at which f is 
of class C%, 

It is convenient at this point to define several functions. 

The invariant function g on B. In terms of local codrdinates (u) on B let 


do? = gi; du' dw (i,j =1,---,n—1) 
be the differential quadratic form defining the metric on B. Let ¥(u) be the 


local representation of the boundary function f° and let y; = ~. The function" 


g = 9 Vili (i,j = 1,---,n—1]) 
is an invariant function on B and assumes a proper minimum on each critical 
set o° of the boundary function f°. 


The functions I(z) and X(z). Let U(z) be a function of class C* for all z such 
that 


(0) = 0, 
l(z) = 0, 4s 2, 
V’'(0) = 1, 


l’(z) 20, 0< 2 
-asl’(z)s 


1’’(0) — 


| 

se 
— 
A 
nx 
nw 
IA IA 
_ - 


where a@ is an arbitrary positive constant. Let A(z) be a function of class C? 
for all z such that 


A(z) = 1, 0s2 <1, 
0 Ss A(z) S$ 1, l<# <4, 
A(z) = 0, 4s 2’. 


Functions l(z) and A(z) can easily be constructed. 

The functions g(a, --- , tn) and domains U,. Let o° be a complete critical 
set of f° on the negative boundary of =. If r is a sufficiently small positive 
constant, the points of B connected to o° at which the invariant function g < 3r 


1 See Eisenhart, Riemannian Geometry, Princeton, 1926, p. 14. 
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form a neighborhood U? of o°. Let U, be the domain of points neighboring B 
which are on the normals to B at points of U° and for which 


02s> —3r (3r < n). 


We suppose the constant r is so small that the domains U, corresponding to the 
complete critical sets o° with different critical values are disjoint and that 
f, < 0 on the closure of each of these domains. Such values of r we term 
admissible. 

On each domain U, we define a function g(x). Let f? be the value of f, on B 
and let M < 0 be a lower bound of f? on B. Let P be a point of U, on the 
normal to B at a point Q. Let (u, s) be admissible coérdinates neighboring P. 
Let &(u, s) be the local representation of f neighboring P and ¥(u) the local 
representation of f? neighboring Q. Then neighboring P at the point (zx) 
determined by the codrdinates (u, s), g(x) shall have the representation 


g(x) = (u, s) + n(2)(e) [M—W(u)] (—3r<s <0), 


r 


where [(z) is defined as previously with a taken as r. 
One readily verifies the fact that g(x) is of class C?. The directional deriva- 
tive of g along the outer normal to B is 


(14.1) g = % + v(*)a(2) [M — ¥]. 
On U°, s = 0, and for¢g Sr, (2) = 1. 


Hence at points of U° neighboring o° for which ¢ 
J, = M ° 


IIA 
3 


For r sufficiently small the corresponding function g has no critical points. 
For on U, the function f, is bounded above by a negative constant, say —m. 
On the same domain the second term in the right member of (14.1) is bounded 
above by the positive number —7?M. Thus if we choose an admissible value 
of r so small that | r?>M | < m, we shall have g, < 0 on U,. Hence g will have 
no critical points. : 

Finally, on the subdomain of U, exterior to the domain 


g < 2r, —2r<ss0, 


the function g(x) is identical with f(z). 

The function G(x, --- , 2). On = we define a function G(x) which is a 
modification of f(z). At a point (x) of = not on any of the several domains 
U, we set G(x) = f(x). At a point (z) of a domain U, we set G(x) = g(z), 
where g(x) is the function corresponding to U,. 

One readily verifies the statements in the following lemma. 
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Lemma 14.1. The function G(x) is of class C\L on = and is identical with f(x) 
on B. In general G(x) is identical with f(x) except in neighborhoods of the com- 
plete critical sets o° of f° on the domain f? < 0. At points on B neighboring these 
critical sets o°, G, is the negative constant M. 


15. Modification of f outside >. Let s; and s2 be two arbitrarily small posi- 
tive constants with 0 < s; < se < 7, where 7 is the constant used in §14. Let 
>; (¢ = 1, 2) denote the points of = together with the points neighboring B for 
which 0 S s < s;._ Let B, denote the boundary of >). 

The function h(x, --- ,2,). Let S denote the points neighboring B for which 
0 < s < s. OnS we shall define a function h(x). Recall that M is a negative 
lower bound of f? on B. Let G° be the value on B of the directional derivative 
of G(x) along the outer normal to B, where G(z) is the function of §14. Let L 
be a positive constant larger than —M. Let P be a point on S on the normal 
to B at a point Q. Let (u, s) be admissible codrdinates neighboring P. Let 
¥(u) be the local representation of f° neighboring Q and x(u) the local repre- 
sentation of G? neighboring Q. Then neighboring P at the point (x) determined 
by the coérdinates (u, s) the function A(z) shall have the representation 


3 3 
(15.1) h(x) = f° + sG? + a = ¥(u) + sx(u) + ie (OS 8s < &). 
Sy 8} 


One sees that h(x) is of class C?. 
The function H(x). On 22 we define a function H(x). To that end we set 


H(x) = G(z) (on =), 
H(x) = h(x) (on S). 


We shall prove the following lemma. 

Lemma 15.1. The function H(x) is A-admissible on 2,. The functions H(z) 
and f(x) are identical neighboring their respective critical sets on >. 

The function H(z) is of class C? on = neighboring B and on S. One readily 
shows that H(z) is of class C'L neighboring B and hence of class C'L on 2. 
The normals to B are also normals to B;. On B, the directional derivative H, 
along the outer normg! to B, is positive. For from (15.1) we find that 


H,=G2+L (on B,). 
But since G° = M, we see that 
H,=M+L>0 (on B,). 


The proof of the lemma is complete. 


16. The critical sets of H neighboring B. It is clear that the only critical 
points of H(x) other than those of f(x) occur at points neighboring B for which 
0 < s < gs. In this section we shall discuss the existence of such critical 
points. 
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Let o° be a complete critical set of f° on the negative boundary of =. Let 
the constant r have the value used in defining the function G(x) of §14. The 
points of B connected to o® for which the invariant function g¢ < r form a 
neighborhood U* of o°. Let U denote the domain of points neighboring B 
which project orthogonally by means of the normals to B into U* and for which 
058 < 8. 

We now set 





and prove the following lemma. 

Lemma 16.1. The points of the point set o* on U for which s = s* and which 
project orthogonally into the points of o° on B are the only critical points of H(x) 
on U. 

Let a neighborhood of each point on U be represented by admissible coérdi- 
nates (u,s). The conditions which define the critical points (u, s) of H neighbor- 
ing a point of U are, according to (15.1), 


aH _ af | 3G! 


(16.1)’ 3 = = 0 ({=1,---,n—ljg<r), 
” dH ' Ls? 

(16.1) ap = Ge + ~~ = @ (p <r). 
8 81 


The points of o* satisfy (16.1)’ as follows from the fact that at each point of 
oon B 


Tw 0, G? = M. 


au 


The second condition (16.1)’’ bears upon s alone and is satisfied by s = s*. 
Hence the points of o* are critical points of H(z). 

Moreover, the condition (16.1)’’ is satisfied only by s = s*. On the other 
hand, (16.1)’ is satisfied only when (u) represents a point of o°, because for 
0O<¢e<r 


0 
aG? _ aM _ 4 


aul ltt 





and for at least one value of 7 


0 
x = ~ (O<e<r). 
The proof of Lemma 16.1 is complete. 
The point set o* of Lemma 16.1 will be termed the critical set of H(x) corre- 
sponding to the critical set o° of f°. 
We continue with the following lemma. 
LemMa 16.2. For 82 sufficiently small the function H(x) has no critical points 
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for which 0 S s < 8» except the points of the critical sets o* corresponding to the 
complete critical sets o° of f° on the negative boundary of >. 

Let B’ denote the complement on B of the sum of the preceding neighborhoods 
U*. With each point Q on B’ let there be associated an arbitrarily small open 
neighborhood W°® of Q on B admissibly represented by parameters (u). Let W 
denote the domain of points (u, s) for which (uw) is on W° and 0 S s < 8». 

If Q is not a critical point of f° at least one of the partial derivatives aH /du‘ 
does not vanish at Q. These derivatives are independent of s; and so. Ac- 
cordingly the neighborhood W° of Q and the constant s2 can be chosen so small 
that at least one of the derivatives 0H /du‘ does not vanish on the corresponding 
domain W. 

If Q is a critical point of f° on B’ it follows from the absence of critical points 
of f on B that at Q, H, = f? # 0. Moreover at Q, H, = f? > 0, for otherwise Q 
would be on B — B’. But H, depends upon s; only through the term Ls*/s}. 
Hence a diminution of s,; merely increases the value of H,. Accordingly, the 
neighborhood W°® of Q and the constant sz can be chosen so small that H, will 
not vanish on the corresponding domain W. 

It now follows from an application of the Heine-Borel theorem that B’ can 
be covered with a finite number of open neighborhoods W? (h = 1, --- , A) such 
as W° and that accordingly one choice of the constant s2 can be made such 
that on each of the corresponding domains W = W,, at least one of the partial 
derivatives of H does not vanish. The function H(z) has no critical points 
on these domains W;, and the lemma holds for the above choice of 8». 


17. The type numbers of the critical sets of H neighboring B. We shall 
prove the following lemma. 

Lemma 17.1. The critical points of the function H(x) other than those of f(x) 
may be grouped into critical sets o* which correspond in a one-to-one manner to 
the complete critical sets o° of f° on the negative boundary of =. The first n type 
numbers of the sets o* are equal to the corresponding type numbers of the corre- 
sponding sets 0°. The (n + 1)-th type number of each critical set o* is zero. 

The first statement in the lemma is merely a rephrasing of the results obtained 
in §16. We proceed to the proof of the second statement. 

Let o° be a complete critical set of f° on the negative boundary of 2. Let 
o* be the critical set of H(x) which corresponds to the set o° in the sense of §16. 
Recall that s* is the value of s at points of c*. Let B* be the manifold defined 
by s = s*. We define a deformation J which deforms the points neighboring 
B* through such points onto B*. If s(p) is the value of s at a point p neighbor- 
ing B*, the point p shall be deformed under J along the normal to B through p 
in such a fashion that as the time increases from 0 to 1 the difference | s(p) — s* | 

_ decreases to zero at a rate equal to its initial value. 

Let c° be the value assumed by f° on o° and let c* be the value assumed by 
H(x) on o*. We shall show that J deforms points neighboring o* below c* 
through such points. Recall that . 
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(17.1) H(z) =p + (ate +) (OS 8s < s). 
38} 

Among points neighboring o* the parenthesis in (17.1) depends only upon s 
and assumes a relative minimum value, say c', on o*. Let P be a point neigh- 
boring o* and Q its projection on B. Let fQ be the value of f° at Q. Suppose 
that H(z) < c* at P. Observe that c* = c® + c', and hence fg < c®°. During 
the deformation J, Q remains fixed, while the function H(x) decreases mono- 
tonically to the value H = fg + c'. Hence J has the property stated. 

Let f* be the function defined by H on B*. Then f* = f°+c'. Hence f* isa 
function of the same type as f*. The set of points o* is a critical set of f*. Let 
us denote o*, thought of as a critical set of f*, by w*. Since f* and f° differ by 
only a constant, it is clear that the type numbers of w* are equal respectively 
to the first n type numbers of o°. 

Let VW be an admissible pair of neighborhoods of o* on R. For V suffi- 
ciently small and corresponding to V for W sufficiently small, the final images 
V’ and W’ of V and W respectively under the deformation J form an admissible 
pair of neighborhoods of w* on B*. Since J deforms points below c* through 
such points, it follows that maximal sets of spannable and critical k-cycles adm 
VW (k = 0,1, --- ,m — 1) on R are deformed into maximal sets of spannable 
and critical k-cycles adm V’W’ on B*. 

Thus the first n type numbers of o* are equal respectively to those of w* 
and hence to those of o°. The type number m, of o* is null, since H does not 
assume & maximum on o*. [8, p. 166.] 

The proof of Lemma 17.1 is complete. 


18. Proof of Theorem 13.1. The function H(z) of the preceding sections has 
a positive normal directional derivative on the boundary B, of =;. The spread 
B, is of class C? while the spread B is of class C*. Were B, of class C*, we could 
apply Theorem 10.2 to establish Theorem 13.1. We avoid the difficulty by 
transforming 2; into = and B, into B in accordance with the following lemma. 

Lemma 18.1. There exists a one-to-one non-singular transformation T* of 
class C? which carries =; into = and Ez into a subdomain of 2. 

The transformation T* may be defined in essentially the same way as the 
transformation T* in [9, §23] was defined. 

We come to the proof of the Fundamental Lemma of §13. Suppose that 
under the transformation T* the region 2- is carried into a region ~’ and that 
the point (y) on 22 is carried into the point (x) on =’. On D’ we define a function 
F(x) by the identity 

F(x) = H(y). 


It follows from the properties of H(z) enumerated in Lemmas 15.1 and 17.1 
and the properties of T* that the Fundamental Lemma is valid for the func- 
tion F(z). 

Theorem 13.1 then follows as stated. 
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PROOF THAT EVERY POSITIVE INTEGER IS A SUM OF FOUR 
INTEGRAL SQUARES 


By R. D. CarmIcHAEL 


The proof here given of the named classical theorem is a little longer than that 
offered by L. E. Dickson! in 1924, but has some elements of interest on account 
of the elegance of the method. Moreover, the reciprocal relations employed are 
of interest in themselves. Of the known proofs ours is most closely related to 
those of Dickson and Euler. 

Let us write? 


(1) a* + ab? + Be? + afd? = pq, 


where a, b, c, d, a, 8, p, q are integers, p, q are positive and a, 8 are not negative. 
Let 2, y, z, t, 4, wu, p, « be integers, and write 


(2) Aq = {qd+ (ax — aby — Bez — aBdt)}* + a{qu + (bx + ay + Bdz — Bet) }* 
+ Biqp + (cx — ady + az + abt)}? + aB{qo + (dx + cy — bz + at)}?. 


The sum of the squares of the parenthesis quantities within the braces, multiplied 
by the indicated outside factors, is 


(3) (a? + ab? + Be? + afd*)(x* + ay? + B2* + at’), 


or pq(x? + ay? + B82? + aft*), in accordance with the usual (and readily verified) 
product theorem for the forms in question. Hence A is an integer, and we have 


(4) A = q(\? + ap? + Bp? + aBo*) + p(z* + ay*® + B2* + aft’) 
+ 2a(dr + any + Bpz + aBot) + 2ab(—dry + ux + Apt — Boz) 
+ 28e(—drz — aut + px + acy) + 2aBd(—dM + wz — py + o2). 


The expression for A is invariant under the transformation 
(5) (p, q)(z, A)(y, u)(z, p)(t, o)(a, a)(b, —b)(e, —c)(d, —d)(a, a) (8, B). 


So is equation (1). Hence we may perform on (2) the transformation (5) to 
obtain the relation 


(6) Ap = (px + ad + aby + Bep + aBde)? + a(py — bd + au — Bdp + feo)? 
+ B(pz — ch + adu + ap — abc)? + aB(pt — dd — cu + bp + ao)’. 


It may also be verified directly that (6) is implied by (1) and (4). In the pres- 
ence of (1) equations (2) and (6) are equivalent. They constitute the reciprocal 
relations on which our proof is based. 


Received January 9, 1936. 

1 L. E. Dickson, Amer. Journ. of Math., vol. 46 (1924), pp. 1-16; see esp. pp. 2-5. 

2 We here need the immediately following results (two paragraphs) only fora = 8 = 1, 
but it seems well to put the more general formulas on record. 
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We shall now prove the following theorem. 

I. If p is a prime factor of the sum of four integral squares and is not a factor of 
each of them, then p is itself a sum of four integral squares. 

Since 2 = + 1? + 0 + O°, we assume in the following proof that p is an odd 
prime. 

By hypothesis, there is some multiple pm of p which is a sum of four integral 
squares not all divisible by p. Then let pq be the least positive integral multiple 
of p which is a sum of four integral squares not all divisible by p. We then have 
a relation of the form 


(7) av+bh+ece+d = pq, 


where a, b, c, d are integers not all divisible by p. We are to establish the theo- 
rem by proving that qg = 1. 

That q < p follows at once from the fact that if a, b, c, d are replaced by their 
residues, modulo p, of least absolute value we have an equation of the character 
of (7) with q < p. 

The numbers a, b, c, d in (7) have the greatest common divisor 1, since by 
dividing through by the square of their greatest common divisor we could other- 
wise replace (7) by another equation of like character and with a smaller value of 
q, contrary to the hypothesis that q has the least value possible. 

We next show that we are led to a contradiction if we assume (as we now do 
for the moment) that g > 2. Since a, b, c, d have the greatest common divisor 1, 
integers 2, y, 2, t exist such that 


ax — by — cz — dt = 1. 


We employ these integers x, y, z, t. Use equation (2) with a = 8 = 1, taking 
\ = 0 and choosing uy, p, ¢ so that the bases of the squares, after the first, in the 
second member of (2) are in absolute values not greater than q. Since qg > 2, we 
have 1 < 4q. Therefore, A in equation (2) is positive and is less than q if q > 2. 
Then from (6) we have an equation of the form 


ai +b] + ci +d? = pA O<A<q<p) 
Since A < p, it follows that not all the numbers a, bi, ¢:, d; are divisible by p. 
Since A < gq, we have a contradiction on assuming that q > 2. Hence q S 2. 
But if we assume that ¢ = 2 we have 
e’+bh4+ c+ ad = 2p. 


Since p is odd, the numbers a, 5, c,d are not alleven. Then at least two of them, 
say a and b, are odd, and the other two are both odd or both even. In either 
case we may write 


p = {3(a + b)}? + {3a — 6}? +13(¢ + dD}? + [3 — @)}?, 


thus representing p as a sum of four integral squares. It is obvious that they 
are all prime to p. Hence q # 2. 
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It follows therefore that gq = 1, and we have 


p=@+RP4+C42@, 
a sum of four integral squares, as was to be proved. 

We show next that every prime p satisfies the hypothesis in Theorem I, by 
giving the usual proof of the following long-known theorem: 

II. Every prime p is a divisor of a number of the form u2 + v? + 1, where u and v 
are integers. 

For p = 2take u = 1,v = 0. Henceforth, in the proof, assume that p is odd, 
say p = 2k + 1. Give to u successively the values 0, 1, 2, --- ,k. The least 
non-negative remainders of the corresponding numbers u?, after division by p, 
are k + 1 in number and are all different since if two of them, say those corre- 
sponding to uw and ue, are equal, we have uj — u3, and hence uw — we or 
Ui + Ue divisible by the prime p, p = 2k + 1, and this is impossible when u; and 
ue take values from the set 0,1, 2,---,k. Likewise from —v*? — 1 we obtain 
k + 1 different least non-negative remainders by giving to v the values 0, 1, 2, 
--- , k and dividing by p. Some number in one of these two sets, each of k + 1 
remainders, must be equal to a number in the other set, since otherwise we 
would have 2k + 2 different remainders all in the set 0, 1, 2, --- , 2k, and this 
is impossible. For the corresponding numbers u and v we have u? + v* + 1 di- 
visible by p. Hence the theorem is established. 

Now the number 1 is a sum of four integral squares: 1 = 1° + 0? + 0? + 0°. 
From Theorems I and II it follows that every prime is a sum of four integral 
squares. Hence if there is any positive integer which is not the sum of four 
integral squares, it must be composite. If we suppose m to be the least positive 
integer which is not a sum of four integral squares, then we may write m = mmo, 
0 < m < m,0 < mz < m, whence it follows that m, and likewise me, is a sum of 
four integral squares. From the product theorem for a sum of four squares by a 
sum of four squares (implied in our derivation of the reciprocal relations) we see 
that m itself must be a sum of four integral squares, contrary to the hypothesis 
that m is the least positive integer which is not a sum of four integral squares. 
Thus we have the following theorem. 

III. Every positive integer is a sum of four integral squares. 


UNIVERSITY OF ILLINOIS. 











TOPOLOGICAL FOUNDATIONS IN THE THEORY OF CONTINUOUS 
TRANSFORMATION GROUPS 


By P. A. Smirx 


1. Introduction. This paper contains an account of some elementary 
topology connected with the notion of continuous group of transformations. 
From the modern point of view the group structures which are studied in the 
classical Lie theory are frequently not groups at all and have group-like proper- 
ties only in restricted regions! of definition. This circumstance is due partly 
to the nature of the analysis involved and partly to the topological inadequacy 
of the ordinary types of codrdinate systems and is therefore unavoidable. As 
a result, the group concepts of the classical theory are necessarily rather nebulous. 
We have attempted, however, to bring these concepts into sharper being, to 
crystallize their topological properties by means of definitions formulated 
postulationally in the spirit of the modern theory. We have not tried to obtain 
the highest degree of generality, or abstraction; our object is rather to define as 
simply as possible the types of group structures that one actually encounters 
and to study some of their simplest topological properties. We shall appeal 
occasionally to results in the Lie theory, but for the most part we have made it 
a point to treat situations which are essentially topological with topological 
methods.” 

After the preliminary definitions we consider the question whether partial 
structures are essentially more general than total or completely defined struc- 
tures or whether on the contrary every partial structure can be considered as 
being simply a piece of some total structure. In the most general case the 
answer is as yet unknown. We have, however, settled the question in certain 
special cases. In this connection we require a preliminary examination of the 
notion of transitivity, particularly transitivity in the neighborhood of a point, 
and this in turn requires the study of the topology of spaces whose elements are 
cosets of a given partial subgroup. In the final section we obtain new relations 
between the fundamental group of a given continuous group @ and that of a 
space ¥ in which @ operates transitively. Some results of this sort have already 
been obtained by Cartan;* our relations, however, have a more quantitative 
character, since they involve the ranks of certain subgroups of the fundamental 


Received April 26, 1935. 

1 For this reason expositions of the Lie theory which do not define constantly the regions 
in which the objects dealt with have their existence would appear to have little validity 
beyond a purely formal analytic one. This systematic preoccupation with domains of 
definition seems only to occur in the classic treatise of Lie and Engel [9]. 

2 In the study of Lie groups the advantages which are gained from an interplay between 
topology and analysis are excellently illustrated in Cartan’s monograph [2]. We shall 
draw frequently from the ideas suggested in this work. 

$ [2], p. 27. See also Ehresmann [3], p. 399. 
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groups and the dimensions of @ and ¥. They are essentially an application of 
the results of a recent paper of the author [13]. 


2. Notation and use of terms. By a space we shall in general mean a 
Hausdorff space which satisfies the first denumerability axiom. This axiom 
asserts that there exists a complete system of neighborhoods which is of the form 
{V,.(p)} (n = 1, 2, --- ), p being an arbitrary point of the space. 

Let S be a space and q a point in S. We shall say that the sequence Aj, 
Ag, --- of open sets in S closes down on q if the following conditions are satisfied: 
(1) A, 2 Ao --- and (2) every point sequence a, a2, - - - such that a; € A; con- 
verges to q. For example, let {V:(p)} be a complete system of neighborhoods 


for S and let A, = II V .(q); then obviously the sequence A, Ag, --- closes 
i=1 


down on q. 

We shall denote the intersection of the sets H and K by H A K and the closure 
of H by H. The letters a, b, --- , 1, m, ai, a2, --- will consistently denote points 
in a space @ and 2, y, z, u, --- will denote points in a space ¥. A, B, --- are 


open sets in @ and X, Y, --- are open sets in X. 


Group structures 


3. Among the points a, b, --- of a space © we shall introduce “‘products” of 
the form ab and identify some or all of them with points of G according to various 
assumptions to be considered. If ab is a point of G, we shall say that “the 
product ab is defined”. For convenience, we shall refer to G together with its 
product definitions as a structure. 

Suppose that in the structure G there exist open sets Ai, --- , A, (m = 2) such 
that for h = 2,3, --- , n the products 


Gn—1Any Ap—2(Anady), «++ , Qi(A2(--+ (Q,aa,) --- )), 
AnAn+1, (GnQn41)On42, >> + » (+++ (Qndays)Onge = + -)dn 


(where a; is an arbitrary point of A;) are defined. The sets A; then determine 
a system (Ai, ---, An) of degree n for G. To assert that (A, B) is a system 
merely implies that every ab (a « A, b € B) is identified with a point of G. If 
(A, B, C) is a system, then ab, be, a(be), (ab)c are points of G, and so on. If G 
possesses a system of degree n, we shall say that @ is of degree = n. If Gis of 
degree = k for each k = 2, then we say that G is of infinite degree. If (Ai, --- , 
A,) is a system in G, it is clear that (A,, Anu, ---, Ax) (1 SA <k Sn) is 
also a system in @. 


4. We introduce the assumption 

I. If ab, be, a(bc), and (ab)c are defined, then a(bc) = (ab)ec. 

Suppose the structure @ satisfies I and is of degree = 3; then G possesses a 
system (A, B, C), and hence (§3) systems (A, B) and (B,C). The totality of 
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points ab (a « A, b « B) will be denoted by AB, and BC is similarly defined. The 
products a(be) (c eC) and (ab)c are defined because of the definition of system, 
and they are identical by I. The point a(bc) = (ab)c will be denoted by abe 
and the totality of these points by ABC. 

In case G is of degree = 4, there is a system (A, B, C, D) and therefore (A, B), 
(B, C), (C, D), (A, B, C), (B, C, D) are systems and therefore the sets AB, BC, 
etc., are defined. In addition, it follows immediately from I and the definition 
of system that 


(1) a(bed), (abc)d, (ab) (ed), a(be)d 


are defined and are identical. We denote the point (1) by abed and the totality 
of these points by ABCD. In what follows we shall have to deal largely with 
systems of degrees 2, 3 and 4, contained in various types of structures satisfying I. 


5. We introduce the further assumptions: 

II. ab and a,b can be identical points of @ only if a = a. ab and ab, can be 
identical only if b = by. 

III. If a, — a and b, — b in G, and if ab, ab, aebe, --- are defined, then 
a,b, — ab. 

A structure @ which is of degree = 2 and which satisfies I, II, III will be called 
a (continuous)* semi-group.5 

A semi-group can be of infinite degree. For example, if there exists a system 
of the form (G, G), then obviously there are systems of the form (G, --- , G) 
and of arbitrarily high degree. A simple example of a structure which is a 
semi-group of this type, but which is not a group, is the following: let G be the 
real numbers in the open interval (0, 1) and let ab be the arithmetic product of a 
and b. The characteristic property here is that all products are defined. In 
general, however, not all products need be defined, even though @ be of infinite 
degree and every point of G be contained in some system. 


6. Consider now the additional assumptions: 

IV. @ contains an identity, that is, a point e such that ae = ea = a for every a 
in @. 

V. @ satisfies IV and possesses a system (EF, E) where e « E. 

VI. G satisfies IV and there is an open set H containing e such that each a in 
H has a (right and left) inverse ain @. If a, ain H, then a,' > a-. 

A structure @ which satisfies assumptions I, III, IV, V, VI and for which 
E = H = @isa (continuous) group; if G satisfies these assumptions but is not a 
group, it will be called a partial group. In both cases the identity is unique, and 
inverses are unique when they exist. A semi-group, as we have defined it, may 
be a group or partial group; conversely, every group is a semi-group, since it 








4 Since we shall consider only structures which are continuous, that is, which satisfy III 
or similar assumptions, the word continuous will generally be omitted in the text. 
5 Cf. Eisenhart [4], p. 15. 
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obviously satisfies II. A partial group, however, need not be a semi-group 
although (§9) there is always a neighborhood of ein which II holds. A structure 
which is a group, partial group or semi-group will be called a group structure. 


7. Let @ be a group or partial group and let G,, Ge, --- be a sequence of 
neighborhoods closing down (§2) on e and all containedin EAH. SinceG, ¢ E, 
the product G,G, is completely defined as a point set in G for every n. Since 
G, & H, each a in G, has an inverse a~ and the totality of inverses will be 
denoted by G;'. 

For every neighborhood A of e, there is an N such that 


GG, & A, G;'cA (n > N). 


If this were not the case, for n = 1, 2, --- there would be points a,, b, in G, such 
that either a,b, @ A ora,'@Aor both. But this is impossible because a,b, — e 
and a,' — e, since a, — e€, bn — e. 

Groups and partial groups are structures of infinite degree. Let n be chosen 
so large that G,.G, ¢ E. Then if a, b, c are in G,, ab and be are points in EF and 
hence a(be) and (ab)c are defined. Hence (G,, G,, G,) is a system of degree 3. 
Now choose m so large that G,, Gn & Gn. If a, b, c, d are points of G,,, since 
Gm & G,, ab, be, ete.; a(be), b(ed), ete., are defined and in fact are all contained 
in E. Hence a(b(cd)), etc., are defined and (G,,, Gn, Gn, Gm) is a system of 
degree 4. It is clear that this process yields systems of arbitrarily high degree. 


8. There exists an N such that G;' is an open set when n > N. We need only 
choose N so that G;' ¢ H forn > N. For suppose that a~' is not an inner 
point of G,*. There exists a sequence b; — a such that b; «eH and b;@G,". 
Then b;' — a so that almost all the b,’s are in G,. This is a contradiction. In 
similar manner, G,G, will be open if n is sufficiently large, likewise G,G,G,, etc. 

If we observe that the set J, = G, + G," has the property that J;' = J,, we 
see readily from the preceding remarks that the sequence G,, Ge, --- can be 
chosen in such a way that G,' = G, for each n. 


9. If G is a group or partial group, there exists a neighborhood G of e such that II 
holds for all points a, b, a,, bi, in G. 

Proof. Let n be chosen so large that G,G, & E and G,;'¢ E. We may take 
G = G,. For if a, b, a are in G, and ab = a,b = ¢, say, thence. Since 
b- « E, (ab)b and (a,b)b~ are defined, and hence by I, a = a. Similarly, if 
ab = ab,, we have b = by. 


10. A structure G which satisfies I, LI, 1V, V and in which e has a neighborhood 
K which is an n-cell, is a group or partial group. 

Proof. Let the points of K be referred to a euclidean coérdinate system and 
let K, © K be a spherical neighborhood of e with radius r, and J, its boundary. 
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Let p be a value of r so small that K, + J, C #. On account of the continuity 
assumption III, we can choose p; < p so small that 


(1) d(a, ka) < p/2 (d = distance) 
whenever aeJ, andkeK,,. Now the totality of points 
kJ, = {ka,aeJ,} 


is a single-valued continuous image of J, and on account of (1), kJ, fails to 
contain e when ke K,,. Now let k be fixed in K,, and different from e. If a 
point k’ moves from e to k along a radius, the successive images k’J, consti- 
tute a deformation of J, to kJ, and since none contains e, the looping coeffi- 
cient® » of kJ, with respect to e equals that of J, and hence » # 0. Moreover, 
if r varies from p to 0, the successive images kJ, constitute a deformation of 
kJ,, and since the final image is merely the point ke = k # e, its looping coeffi- 
cient is 0. Hence some intermediate image kJ; (9 > # > 0) contains e. This 
means that e = kk, where k is a point of J; C K,. Hence each point in K,, 
has a right-inverse in K,. By the same reasoning there exists a K,,, each point 
of which has a right-inverse in K,,. Thus, if ae K,,, there is a point b in K,, 
such that ab = e, and a point b- in K, such that bb-' = e. Hence a = 
a(bb-!) = (ab)b- = eb-' = b-', so that ba = e and b is also a left-inverse of a. 
Hence every point a in K,, has a left- and right-inverse a“ in K,,; a~ is 
unique, for if aa = aa’ = e, then a-(aa) = aaa’) and a = a’. 
Suppose that a, — ain K,,. Since a," « K,,, we can choose a converging subse- 
quence, say a, — c, where ce K,, + J,,, and since p, < p, c is contained in EF 
so that ac is defined. Hence e = a,,; a, — ac, so that e = ac, and in the same 
way e = ca. Hencec = a", anda,'—a. Therefore, if we take K,, for H, 
@® satisfies VI and is a group or partial group. 

We may remark that a sufficient condition that © be a group, or at least con- 
tain a substructure which is a group, is that there exist a system’ of the form 


(G, @). 


11. Realization structures. Let @ and X¥ be spaces. We introduce prod- 
ucts of the form a-x (ae@, x «X), and identify some or all of them with 
points of ¥. This new type of structure will be denoted by (G, ¥). A product 
a-x which is identified with a point of X is “defined’’. 

Suppose that @ itself is a structure and possesses* a system (Aj, --- , An) 
(n = 1) and that X is an open set in ¥ such that 


Qn-XZ, An1-(Qn-X), +++, Gy: (Ge-(--+ (Gn-2) ---)), (a; ¢€ Ay, ee X) 


are defined. There is thus defined a realization system (A,, ---, An; X) of 
degree n in (G, ¥). In particular, (A, X) is a realization system (of degree 1) 


6 For the definition and theory of looping coefficients, see Brouwer, [1]. 
7 A proof can be found in the author’s note [12]. 
8 In case n = 1, (Ai) means merely the set A;. 
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if and only if a-z is defined for each ae A, xe X. We shall have occasion later 
to consider realization systems of the form (A), --- , An; 2) when 2 is a point 
in X. 

We now introduce the assumptions 

I,. If ab, b-z, and a-(b-x) are defined, then so is ab-zx (i.e., (ab)-x), and 
a-(b-x) = ab-z. 

II,. a-x and a-y can be identical points of ¥ only if z = y. 

Note that if G is a group or partial group, and (A, A; X) a system in (G, ¥) 
such that e ¢ A, thene-x = x, for everyxe X. Forlete-x =y. Thene-(e-x) = 
e-yande-x=e-y. Hencey = z by II,. 

Ill,. If a, ~aand x, — 2 and if a-z, a,-x%, --- are defined, then a,-x, — a-xr. 


12. Suppose that @ is a semi-group of degree = n, and that the structure 
(G, ¥) satisfies I,, II,, III, and possesses a realization system (Aj, --- , Am; X) 
of degree m (2 S m <n). Then (G, ¥) is a (continuous) realization of degree 
=>mofG. If X = X we shall call (G, ¥) a total realization. If G is a group or 
partial group, every realization is to satisfy the further condition that there be 
a system of the form (A, A; X), where ee A. It is easy to see (cf. §7) that 
realizations of groups and partial groups are of infinite degree. If a; « Ai, 
then since a@j-(@,41-(@i42 --- (@n-X%) ---)) is defined, it follows from I, that 
OnOn41- (Ange +++ (Qn-%) ---), +++ 5 GnQnga --- Gn-X are defined and equal. We 
shall denote the totality of these points by A,A, --- A,-X. 


13. Let G be a group structure and (A, B, C) a system in @. Special realiza- 
tions of G can be defined as follows. Let ¥ = G, and let X = C. If xis an 
arbitrary point of X, say x = ¢ (ceC), let b-x = be, a-(b-x) = ab-x = abe. 
We have thus created a realization (G, X¥) with a system (A, B; X) of degree 
2. Insimilar fashion, we can form a realization of degree = n — 1 if G possesses 
a system of degree n. Realizations of the sort we have just described are usually 
called “first parameter groups’. The second parameter groups are defined as 
follows. First form the conjugate group structure @* with multiplication 
denoted by axb, and defined by the relation axb = ba whenever ba is defined. 
If G has a system of degree n, (n = 3), so does G* and the corresponding first 
parameter group of (* is the “second parameter group” of G. 


14. (G", X¥") denotes a realization of a semi-group @’, where the spaces @" and 
X" are euclidean spaces of r and n dimensions. If (A, B) is a system in G’, the 
set AB is open. For let ao, bo be points in A, B and let V © B be a bounded 
neighborhood of bb. The correspondence b — ab is (1, 1) and continuous 


® The definitions of first parameter group in the literature make no reference to the 
degree of G and therefore, strictly speaking, do not define; for the first parameter group 
does not exist unless G is of degree = 3. This criticism does not apply, of course, when @ 
is explicitly assumed to be a group or partial group (i.e., to have an identity and inverses) 
since © is then of infinite degree (§7). 
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because of II, III, and therefore when extended over the closed compact set V 
it is a homeomorphism. Hence aoV is homeomorphic to V and hence aoV is 
homeomorphic to V. From the invariance of regionality (Brouwer), aoV is an 
open set and since dob € @a2V & aoB | AB, abo is an inner point of AB. But 
since dobp is an arbitrary point of AB, AB is open. By similar reasoning, A-X 
is an open set if (A, X) is a system in (@’, ¥"), and G~ is open if G" is a group 
or partial group and G © H where H is defined in VI, §6. 

As a consequence of the preceding remarks, we note that if (A, B, C) is a 
system in a group structure @’ and if (A, B; X) is a system in (G’, X¥"), then 
(AB, C), (A, BC) are systems in @" and (A, B; X), (A; B-X) are systems in 
(G", ¥*). 


15. Structures in the classical theory. Let (” be a structure and (G’, ¥") 
a realization. We shall say that the systems (B, A) and (A; X) in @ and 
(G", ¥") are of class C™ if 


(ba); = gil, --+ , Gy, by, --- , d) (@@=1,---,n), 
(a-x); = f(z, st y Eny Qi, --> , ay) (¢ ” 1, = n), 


where the ¢g’s are defined for ae A, be B and the f’s for re X, ae A, and 
both sets of functions possess continuous derivatives of the first n orders through- 
out their regions of definition. The Lie theory of continuous groups deals largely 
with systems of degree 2 and of class C®, C® or C®, and systems of degree 3 in 
which the various systems of degree 2 which can be formed from such systems 
(§14) are of class CM, C®, or C®, 

Consider for example the first part of the “first fundamental theorem” of 
Lie. The statement of this theorem seems at first sight to imply only the 
existence of a system of the form (A; X). An examination of details, however, 
shows that both the statement and proof require a system of the form (B, A; X). 
The theorem asserts, in fact, that if (B, A; X) is a system in a representation 
(G", ¥") of a semi-group ’, and if the systems (B, A), (A; X) and (B; A-X) are 
of class C™, and if, moreover, 








™ 
| a(ba); £0 (ae A, be B), 
ab; 
(1) (b ") | 
a” ~ 0 (be B, x’ = a-reA-X = X’, say), 
x 





i 
where the expressions on the left are functional determinants, the functions 
(a-x); = x, satisfy equations of the form 


Ul 


Ox ; 


(2) aa; b> Vix (a)érj(x’), 
k=1 


© The proof to which we refer is that given in [9], vol. 1, p. 28. 
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where y’s are defined throughout A and the é’s throughout X’ and where the 
determinant of the y’s is not identically zero. 

Suppose in particular that (B, A, C) is a system in a semi-group @" such that 
(B, A), (A, C) and (B, AC) are of class C™ (so that (BA, C) is also of class C™), 
and write C = X,c = x, ac = a-z, thus forming a first parameter group (§13). 
Now since the transformations b — ba and x’ — b-z’ are (1, 1) and all partial 
derivatives are continuous, there exist open sets A,, B,, X,; contained in A, B, X 
such that relations (1) hold for a € A,, b e¢ B,, x’ € A,-X;. Hence the functions 
(a-x); satisfy equations of the form (2) within A;, X;. Since the transforma- 
tions a — a-z are (1, 1), the determinant of the derivatives in (2) cannot vanish 
identically in A; for any z in X;, and hence from (2) there exists a point dp € A; 
such that 


(3) | ix(ao) | #0. 


We now write 
a-x = t,(zx) (a € Ay, x € Xj). 


Then ¢, is a (1, 1) continuous transformation which carries z to a-zx. Let 
Tt. = tata’, and y = ao-x. Then 


ta(y) = Ta(Qo-X) = ta(x) = a-z (y € ao-X;). 


Hence the functions (ra(y)); satisfy the equations (2) when substituted for 2}. 
The family 7, contains the identity, since 7.,(y) = y. 

Let us now assume that the systems (B, A) and (B; A-X) are of class C®. 
An examination of the proof” of the Lie theorem shows. that the functions (a) 
and ¢(x’) have continuous derivatives. Because of (3) and the fact that r,, is 
the identity, it follows from the second part" of the Lie theorem that the 
functions 7.(y) “define a group’. What this means is that there exists a par- 
tial group @* defined in the same space as @ and in which a is the identity 
and for which there is a realization system of the form (A2, Az; Ye), where 
@ € As & Ai, V2 = ao-X2 | ao- Xi, and where Aj’ is defined and a-y = ra(y). 


16. For later application (§32) we need to define certain additional functions. 
The transformation ¢,,(z) is (1, 1) and continuous over X. Hence if X; is a 
bounded open set such that X; C X, then t,,(x) will be bi-continuous over X; 
so that ¢;) is continuous over ao-X3. Now choose X; such that X; © Xe, and 
let x» be a fixed point in X3. Since ao-x € a9-X3 and ao- Xz is an open set, we 
can choose a neighborhood A; © Az of ap such that a-xo € @o-X3 when ae Aj. 
Hence there is a uniquely determined x in X; such that a-x = a-x. Hence 
we may write z = (a), where \(a) is defined and single-valued over Aj. Since 
a@- xo € d9-Xo, t2'(x’) is defined and continuous whenever xz’ = a-2 (a Aj). 
Hence we may write \(a) = f,)(a-z»), and the transformation a — (a) is (1, 1) 

11 [9], vol. 3, p. 563. The proof given here holds when the é’s and y¥’s have continuous 
derivatives. 
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and continuous. Finally, let u(a) = a-2 and let A; © Ag be a neighborhood 
of do such that the transformations a — (a) and a — u(a) are bi-continuous 
over A; (see the beginning of this section). 


17. The definition of group structures. It is generally taken for granted 
in expositions of the Lie theory that if a family of transformations possesses the 
group property, there is determined a group structure for which the given 
transformations constitute a realization. So far as we are aware, this theorem 
has been never stated with complete precision, and a proof has been given only 
in case all the defining functions are analytic." It seems therefore worth 
while to formulate a statement and proof which will justify an almost universal 
presumption. 

Let (G, ¥) be a structure in which the spaces G and X are euclidean. Suppose 
that a-x and b-x are defined for all points x in some open X. We then write 
a = b. 

We introduce the assumptions 

I’. If b-X, a-(b-X) and c-X are defined and if c-x = a-(b-x) for every x « X, 
the same relation holds for every z for which b-x, a-(b-x) and c-z are defined. 

II’. Each point a of G possesses a neighborhood G, within which the relation 
a = b can hold only if a = b. 

III’. There exist open sets A, B, C, C being compact, and an open set X such 
that B-X, A-(B-X), C-X are defined, and for each a and b in A and B there are 
one or more points ¢ in C such that a-(b-x) = c-x for every x « X. 

TueoreM. If (G, X) satisfies II,, III, and I’, Il’, III’, products of the form ab 
can be defined so that & is converted into a semi-group for which (G, X) is a realiza- 
tion with a system of the form (A*, B*; X) where A* & A, B* & B. 

Proof. Let ao, bo be fixed points in A, B and let {c},,, be the set of c’s in C such 
that ao-(bo-x) = c-a (x eX). This set is discrete; for if co were a limit point, 
there could be no G,, satisfying II’. Since C is compact, {c}.,», is finite, say 
(Chay) = (C1, - ++ yen}. 

Let Fi, --- , Fx be non-overlapping neighborhoods of «, --- , c, so small that 
F,oG., A C (i = 1,---,h). There exist neighborhoods Ao, Bo of ao, bp con- 
tained in A, B such that {c}., C =F; when ae Ao, be Bo. If this were not the 
case, we could choose sequences a, — do, b, — bo in Ao, Bo such that at least one 
point c, of {c}.,s, would fail to lie in =F;. Since c, ¢ C and C is compact, there 
is a converging subsequence Sa, —c,eC. Now An, (bn;-) — ao- (bo-x) and since 
An(bng-X) = Cn,-2, we have d-(by-2) = c,-x so that c, € {c}.,s,; hence, infinitely 
many ¢,’s lie in ZF;. This is a contradiction. 

Let a X b be an arbitrary point in the topological product Ay X Bo. To each 
a X b there corresponds the set {c}. in =F; Let H; be those points of Ay X By 
whose corresponding sets have at least one point in F;. Then Ap X Bo = =H. 
Moreover, H; is closed in Ay X Bo. For suppose a’ X b’ is a limit point of M,. 


12 [9], vol. 1, p. 16. 
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Choose a sequence a, X b, a’ X b’. This implies that a, — a’, b, — b’. 
There exist points c, in F,; such that a,-(b,-7) = c,-2. We may choose a 
converging subsequence c,;— c’ « Fy © C, and then, reasoning as above, we 
have c’.x2 = a’-(b’-x) so that c’ € {c}ay and a’ X b’ e€ Hi, and H, is closed. It 
follows that not all the sets H; can be nowhere dense in Ay X Bo; we may 
suppose that H, contains an open set Hy, © Ao X Bo. We can choose open 
sets Ao & Ao and Bo © Bo such that Ao XK Boo S Hu, and we shall have, for 
every a € Aw, b € Bo, a point c in Hy such that a-(b-7) = c-x (xe X); € is 
unique, for if a-(b-x) = c’-x for every xe X, then c = c’; but since c and c’ 
are in G,,, ¢ = c’. 

Let Aco, boo be fixed points in Aw, Boo and let A*¥ = Aw A Gans Bt = Bo A G,,: 
The unique c which corresponds to every pair of points a, b in A*, B* we denote 
by ab. Thus @ is converted into a structure with a system (A*, B*) and for 
(G, ¥) there is a realization system (A*, B*; X). We shall now show that © is a 
semi-group. 

We first verify I. Suppose lm, mn, l(mn) and (lm)n are defined. This im- 
plies that 1 « A*, me B*; me A*, ne B*; lm e A*, mne B*. (This incidentally 
implies that A* A B* # 0; hence I is trivially satisfied if A* and B* do not meet.) 
Let z be an arbitrary point of X and let y = n-z. Then m-y is defined, since 


(1) mn-x = m-(n-x) = m-y. 
Furthermore, lm-y is defined, since 

(2) (lm)n-x = lm-(n-x) = lm-y. 
Finally, 1-(mn-x) is defined and from (1), 


(3) l-(mn-x) = l-(m-(n-x)) = I-(m-y), 
so that l-(m-y) is defined. Since l(mn-x) = l(mn)-2, we have from (3) 
(4) l(mn)-x = l-(m-y). 


Since Im-y and l-(m-y) are defined, we have from I’ lm-y = l-(m-y). Hence 
by (2) and (4), l(mn)-x = (lm)n-z, for every xe X. Hence l(mn) = (lm)n, 
since these points are both in G.,. Hence I is proved. 

Suppose that ab = a,b. This implies that 


(5) a-(b-x) = a,-(b-z) (a, a; « A*, b € B*) 


for every re X. By the argument of §14, b-X is an open set and therefore (5) 
implies that a = a,. Since A* ¢& G,,,, we havea = a. Again, if ab = ab,, then 
a-(b-x) = a-(b;-x). Hence from II, )}-2 = b-x. Hence b; = b and since 
B* © Gry, b1 = b. This verifies IT. 

Finally, we establish continuity (III). Suppose a, — a in A* and b, — b in 
B*. Then a,-(b,-7) — a-(b-x) from III,. Since a,b, ¢ Pi: & G., A C, there is 
a converging subsequence a,,b,; > ¢ € Fy, and hence c-z = ab-x. Since ¢ and ab 
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are both in G.,, we have c = ab. Thus c is independent of the particular con- 
verging subsequence chosen, and a,b, — ab. 

Thus © is a semi-group. (@, ¥) is obviously a realization of © and our 
theorem is established. 

Remarks. In Lie’s proof of the existence of the function ab, the assumption 
II’ is replaced by the assumption that the “parameters are essential’’ and at the 
same time the functions which define a-z are assumed to be analytic; but it is 
not easy to see from the proof how the assumption of essentiality can be made 
to replace II’ in any situation short of the analytic one. To be sure, in certain 
cases, essentiality implies II’; but we can prove this only if the theorem in 
question is assumed proved, (§38). It is of course obvious that II’ can be 
omitted in our theorem if it is assumed in III’ that for every a, b there is a 
unique c. This seems to be a tacit assumption in most treatments." 

Transitivity 

18. Let (G, ¥) be a realization of a group or partial group ©. A system 
(G, --- , G@; x) (e « G) will be called a transitive system if G ¢ H, G = G, and 
if G-z is an open set." 

TueoremM. If (G, G, G; x) is a transitive system and if there is a system (G; X) 
such that GG-2 & X, then GG-x is an open set. 

Proof. Let ggi-%o (g, g: « G) be an arbitrary point in GG-2», and let x, be an 
arbitrary sequence of points such that z, — ggi-%. Since ggi-% «GG-x & X, 


almost every zx, is in X and hence almost every product g~-z is defined (since 
g''e«G" = G). Hence, 


J *-Lm > F"G9i-Xo = Ji-Xo (m > ni, say). 


Since gi-2% €G-x» and G-z» is open, almost every g™-z,, is in G-x. Hence 
almost every 2» is in gG-2x» and this last set therefore contains gg;-2» as an inner 
point. This proves the theorem. 


19. Let G be a group and (G, ¥) a total ($12) realization of G, and (G, G, G; x) a 
transitive system. It follows from the preceding theorem that GG-z» is open. 
As a matter of fact, since the degree of a system no longer plays any part in the 
argument, it is clear that if G is any open set containing e, and if G-zo is open, 
then GG-2, GGG-x, --- areallopen. Let G™ = GG --- G(n factors). Then 
GO SG ¢ --- , and GM.x & G™-x | ---. Suppose that G is connected. 
From a theorem of Schreier [10], lim G™ = G. Hence lim G™.z9 = G-2. 


na no 


Suppose further that @ is compact and Xis connected. We assert that G-xo= ¥. 


13 There are various ways of stating assumptions which will lead to the existence of a 
group or partial group. We shall not, however, discuss these questions further at present 
nor shall we discuss the possible relations between the various determinations of the 
function ab in the case where it is multiple-valued. 

14 The last condition is the essential one; the first two are only a convenience. 
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Since @- zo is open, we need only show that G-z» is closed. Let x, — x, where 
rn, €@-x. We may write 2, = gn-X% (gn €@). Since G is compact, there is a 
converging subsequence g,; — g; then g,,;-%) —> g-%o, and hence g-x%) = x and 
xe@-2xo. Hence G- 2 is closed. 

As a consequence of the relation G-z) = X, we note that for any pair of points 
x, yin X there is a point c in G such that c-x = y, that is, the realization (G, %) is 
transitive. Since x = a-% and y = b-2 for properly chosen a, b, then ba~'-x = 
b-2%) = y, and we may take c = ba. We have the theorem 

If (G, &) is a total realization of a compact connected group, if X is connected, and 
if for some X and open G (e € G) G- a is open, then (G, %) is transitive. 

This theorem states, in effect, that complete transitivity is a consequence of 
transitivity in the neighborhood of a point. We shall now turn to the converse 
problem, and show what type of local transitivity is implied by complete transi- 
tivity. Our results will be used later in the proofs of certain imbedment 
theorems. Although we are able to avoid the assumption that @ is compact, 
an assumption which would make the proofs fairly obvious, it will be expedient 
to assume from now on that @ is separable, metric, and semi-compact, and that 
X is metric and complete. The particular properties which we shall require 
and which follow from these assumptions are 

(a) every open set G in @ can be expressed as the sum of a denumerable set of 
closed, compact sets; 

(b) if X, + X_+ --- isan open set in %, not every X; can be nowhere dense 
in = X;.% 


20. Suppose that G is a group or partial group, (G, ¥) a realization. Suppose, 
further, that (G, ¥) possesses a system (G, G, G, G; 2) such that e eG, G' = G. 
All the points in GG have inverses. In fact, if g, h eG, then g~, h-' € G, since 
G-= G. The product (gh)(h~'g~) is defined and equals e. Hence (gh) = 
hg and (GG) = G" G = GG. 

We shall construct a new space’ whose elements are certain subsets of G. 

Let g consist of all points g in GG such that g-z) = x. We assert that 
(1) agg AGG = 
(2) Ifag AG = bg A G (a, b, € G), then ba € g. 

(3) If hy-29 = he-2o (hi, he €G), then hy ehag A G. 
(4) (hg)g AG = hg AG, if heG,g eq, hg €G. 
(5) Ifh, ehg A G (heG), then hig A G = hag A G. 

Proof. (1) gg is defined because g ¢ GG and G is part of a system of degree 4 
Since 99-2 = 2, then certainly (gg A GG)-x) = x, and hence gg A GG & gq. 
Obviously, g & ag A GG; hence (1) is proved. 


46 See Hausdorff, [5], pp. 136-141, particularly X, p. 141. 
16 Cf. Cartan, [2], p. 26. 
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(2) Since a eag A G (because e ¢ G), then aebg A G; that is, a = bg where 
géeqg. Hence ba = g. 

(3) Obvious. 

(4) An arbitrary point in (hg)g A Gis of the form (hg)g:, where qi € g, (hg)gi € G. 
Now (hg)gi-%0 = (hg)-(gi-%0) = hg-to = h-(g-%) = h-a. Hence, by (3), 
(hg)g: ehqg AG. Hence (hg)qg A GQ ha AG. Conversely, consider an arbitrary 
point in hg A G; it is of the form hg, where gz eq and hg2eG. We have hg: = 
(he)g2 = (h(gg-))g2 = ((hg)g)ge, since hg « G, and since g~! « GG and gz « GG, 
it follows that g~'ge is defined, so that hge = (hg)(g~g2). Now hge e G and 
(hg)! «G"' = G. Hence (hg)~(hge) is defined, and (g~'g2) = (hg)~'(hge) € GG. 
Since g-'ge € qq, we have g"ge = gg A GG = gq by (3). Hence hg. = 
(hg)(g~"g2) € hgg, and since hgs e G, we have hge « hgg A G, and (4) is proved. 

(5) Since hy € hog A G, we may write h; = hg, geg. Hencehg A G = 
hogg A G = hog A G by (4). 

From (5) it follows that if hig A Gand heg A Ghave a point in common, they 
are identical. Moreover, if h is a point of G, thenhehg AG. Hence the points 
of G fall into a family of mutually exclusive sets which we shall denote by b:, 
b,, ---. Let us choose in each of these sets, once and for all, a single point, 
denoting the point in h; by h:. By (5) we have bh: = hg A G. 


21. Let be the totality of sets b;, b,, --- . We shall introduce a topology 
into § as follows. Let G., Ge, --- be a sequence of neighborhoods contained in 
G and closing down on e. Let 


Vn(be) = (AiG, A G)g A G. 


V..(b:) obviously consists of a set of 5,’s and we may therefore think of V(b:) as 
a subset of . The totality of these subsets is to constitute a complete set of 
neighborhoods for . We shall show that the Hausdorff axioms are satisfied. 

First, it is clear that if V,(b:) and V,,(b:) are two neighborhoods of 6; and if 
k > max(m, n), we have Vi(b:) & Valbsr) A Vm(be). 

Next, suppose §}: + 5,. We show that for n sufficiently large, V,(b:) A 
V.(6,) = 0. If this were not so, we would have for each n points a,, b, in G,, 
such that hang A G = hybag A G, hea, € G,hyb, eG. Then ha, €h,b, A G, and 
therefore we may write 


(1) han = hybrGn (Gn € Q). 


Hence (hybn)"(hidn) = (Rgbn)—"(Agbngn). Since he, an, hy, b, are in G and hybrgn € 
GGG, and since G is part of a system of degree 4, we have (h,b,)—(hybrgn) = 
(habn)(hybn)gn = gn. It follows from (1) that hyan-xo = hybngn-%o = (habn)- 
(b, +20) = h,b,-x0, and on letting n — © and observing that a, — e, b, — e, we 
see that hz-% = h,-%. Hence h; € h,g A G by (3), §20, and by (5), §20, 
hg A G = h,g A G; that is, hb: = 6,. This is a contradiction. 

Finally, let 5, be an element in V,,(:). We shall show that for & sufficiently 


CC 
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large, Vi(b,) & Valbe). We have 6, C (bh G, A Gg A G, and therefore we 
may write 


(2) hb, = (heha)g A G (h € Ga, Agha € G). 


Let m be chosen so large that h,G,, © G, and h:h,G, GG. Now 6, = hg A @ 
and on comparing this with (2) we have, from (2), §20, (h:h,)“h, = g, say, 
where g eg. Since (A;h,,)~' « Gt = G, we may write h, = (h:h,)g. Hence 


Vi(6,) = (h, G:. A G)q A G = (((heh,)g)Gi A G)q A G ° 


Since h:h, « G, 9 & GG, Gi | Gand G defines a system of degree 4, we have 
(heh, )gG. = heh,(gGi). Let k be chosen so large that gG, © G,g. Then 
((hehn)g)Ge & heha(Gng) = ((heh,)Gu)g and Villy) & (((hehaGng A Gg A G, 
and since h:h,Gm € G@ by choice of m, we can apply (4), §20, to the expression 
on the right; it results that Vi(b,) S ((hehn)Gn A Gg AG. Since (hh,)G, = 
h:(hnGm) and this last set is contained (by the choice of m) in h:G,, we have 
Vi(b,) & (AG, A G)g A G = V,(by). 

We have shown that § is a Hausdorff space; obviously it satisfies the first 
denumerability axiom. 


22. The fact that g satisfies the relation gg A GG = g shows that g is a sort 
of (partial) subgroup of G, and it is possible to start with this property, together 
with the assumption that g is closed in GG, and construct without reference 
to a space X. In case G is a group and G = G, g is a closed subgroup, and the 
elements of © are the left cosets g. The verification of the Hausdorff axioms 
is now quite simple. For example, suppose ); ~ 6,. We have hy = heq, 
b, = hag, Valhs) = A:G.ig, Va(h,) = hyG.g. Suppose that for each n, V,(b:) A 
V.(b,) = 0. We could then choose a,, b, in G, such that h:a,g = h,b,g. Hence 
(b,h,)(hea,) = g, where g ¢ g. Since a, — e, b, — e, we have h, — h>‘g:, 
and since g is closed, h;'h; eg. Hence h; = hg (g € g) and hig = higg = hg, 
or §: = §,. This is a contradiction. 

One further remark concerning the present case (G = @ = a group): if 
a, — a, and if ag = by, ang = bg,, then obviously hy, — bh: in $. Conversely, 
if h: = ag and if b:, — b;, there can be chosen a sequence a, such that a,g = 
h:, anda, — a. This is perhaps not quite obvious, but the proof offers no real 
difficulties, and the details closely resemble those in the proof of a later theorem, 


(§25). 


23. Let us now return to the system (G, G, G, G; x) of §20, and assume that 
it is transitive. Let the open set G-2» be denoted by Xo, and let b, (x « Xo) con- 
sist of all points h in G such that h-2) = x. If h, is one of these points, we have 


(1) b. = hg AG. 


For, let hig (g € g) be an arbitrary point of hig AG. Then (hig)-x%0 = hy(g-2) = 
hi-% = x. Hence hg ¢ bz and hg A G ¢& bz. Conversely, if h is an arbitrary 
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point in b,, since h-zy = x = hy-x%, we have h « hig A G by (3), §20. Hence 
bh. | hig A G, and (1) is proved. 

It follows that each b, is a b;; conversely, it is obvious that each }; is a be. 
Thus the elements of § can be denoted by 6., b,, --- and the corresponding 
points h;, h,, --- by hz, hy, ---. If « ¥ y, then h, ¥ h,, for otherwise we 
would have x = h,z-% = hy-% = y. We conclude: there is a (1, 1) corre- 
spondence x — h, between the points of § and those of Xo; for each z, b. = 
hig A G. 

We shall now show that the correspondence is bi-continuous. Let us denote 
bz by h(x). We shall first show that if (z,.) — b(x), thenz, — zx. Let V be 
an arbitrary neighborhood of z. Choose 7 so large that h,G;-%9 © V. Almost 
every 6(z,,.) isin Vi(bz) = (hzGa A G)g A G, and is therefore of the form hz ang A 
G (am € Gi, heim €G). Hence, for almost every m, hza,,-%) = Im, and therefore, 
by (1), almost every z,, isin V. Hence z,, — 2. 

Conversely, if z, — 2, then 5(z») — 6(x). To prove this we first establish 
the following 

Lemma. Let x, 2%, Xo, --- be points in Xo such that for each n, hz-t, € Xo. 
If 6(am) —> b(xo), then b(hz-tn) — b(hz-X0), and conversely. 

Proof. Let m be chosen so large that h.G; ¢ G. Let m be an arbitrary 
integer > m. Then h.G, Gh:Ga GG. We have also V,.b(%0) = Gnh(xo) A G, 
since Gm & G, Vnb(hz-%0) = Vn(bz) = (AzGm A G) b(t) A G = hz G,.b(x) A G. 

Now suppose that b(x,) — (2). Then for almost all n b(z,) € Vn(b(2o 
so that we may write 


(tn) = Cnb(%o) A G (Cn € Gm) , 
(n sufficiently large). Since m > m, we have h.c, e G. Since c, € h(z,), we 
have ¢n:%o = Zn. Hence hien-%o = hz-(Cn-%o) = he-Xn. Consequently, 


Q(he-2n) = (hitr)g A G C V.(bz) for almost every n. Hence b(hz-2,) — 
h(x) = b(hz-x%). The proof of the converse offers no difficulty, and amounts 
essentially to reversing the preceding argument. 


24. Suppose now that I is a closed compact subset of G. Let §" be the 
totality of §.’s which meet T. Then §° is closed and compact in ©. For 
suppose 6(z,) is a sequence of elements in §" such that h(z,) > b(x). Let a, 
be a point in h(z,) A T. Since T is compact, we can choose a converging 
subsequence a,,— a, where ae IT. Clearly a,,q A G— ag A G, so that b(z,,) — 
h(a-a) € S". Since h(a-2o) = b(z), H" is closed in §. The compactness of 
" follows in similar fashion. 

Let X° be the totality of points x such that h(x) CS". Then since the corre- 
spondence 6(x) — z is continuous, X" is closed and compact in Xo. 

Let Ti, T's, --- be closed compact subsets of G such that G = ZT; (see §19). 
Then =X": = Xp, and hence by (6), §19, at least one set in the summand, say 
X", must contain an inner point, say z. 

Now suppose z, — 2% in G. Then h,-x, — h,-x%) = z. Hence almost all 
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the points h,-x, are in X", and the corresponding elements b(h.-z,) of © are 
then in ©". Therefore, for almost all values of n, 6(h.-2,) A IT; contains at 
least one point b,. Since b, ¢« T; and 1, is closed and compact, we can choose 
a converging subsequence b,, — 6, where be T;. Since bp, € b(h.-2n,), we have 
bn;-%o = hz -Xn,. Consequently, b(h.-tn,) = b(bn;-%0) = bang A G— bg A G. 
Since b,,-2%9 — b-xo and h,-2,, — hz-%, we have b-x = h,-%. Consequently, 
bg A G = b(h.-2o), and hence b(h.-2n,) — b(h.-2o). Hence, by the lemma 
(§23), b(a,;) — b(xo). Since the correspondence h(x) — z is (1, 1) and con- 
tinuous, it follows immediately that §(2,) — b(2o). 

Suppose finally that zx, 2 = hz-%. Thenh;'-z,— 2. Hence §(hz'-2,) 
h(zo), and hence, by the lemma, 6(z,) — b(hz-%) = b(x). This establishes the 
bi-continuity of the correspondence §(z) —> z. 


25. THroreM. Suppose (G, G, G, G; 2) is a transitive system. If x = b-2X 
(b € G) and if x, — x (x, € Xo = G-x), there exists a sequence b,, such that b, — b 
and b,-% = Xn. 

Proof. Assume first that b = e. Then since z, — 2, we have b(z,) — 
b(2o). Let N(m) be an integer such that N(m) < N(m + 1) (m = 1, 2, --- ) 
and such that 


(1) b(rn) © Vn(b(t0) ) = Gung A G when n > N(m). 
(1) implies 
H(tn) = Camg A G, (Cam € Gm, 2 > N(m)). 
Since Cam € b(z,), we have 
CamZq = Za (n > N(m)). 


Now let b, be an arbitrary point in G, when 1 < n S N(1), and let db, = Cam, 
when N(m) < n S N(m + 1), m = 1, 2,---. Then clearly b, — e and 
b,-X9 = 2», and the proof is complete for the case b = e. For the general 
case, we note that b-'-z, = a. Hence there exist points g, such that g, — e 
and g,-% = b!-z,. If we let b, = bg,, we have b,-% = z, and b, — b. 

Corotiary. If B is any open subset of G, then B-2xo is open. 

For let b-zo (b « B) be an arbitrary point in B-z. Let z, be an arbitrary 
point sequence in X» converging to b-2, and let a sequence 6b, be chosen in G 
such that b, — 6b and b,-% = z,. Since almost all the b,’s are in B, almost 
all the z,’s are in B-a. Hence b-2 is an inner point of B- 2p. 

The corollary states in effect that transitivity over a given region implies 
transitivity over any arbitrarily small portion of that region. 


26. Partial subgroups. Let G be a group and g a subset of @ such that 
qt = g and such that gg A G = g A G, for some suitably chosen neighborhood 
G of e. We shall call g a partial subgroup of G. The set [g] = 9 + ag + --- 
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is obviously a subgroup of G. For [g] [g] = [g] and [g]-' = g''+q"g¢'+--- = 
g+og+--- = [gl 

Suppose that g is a partial subgroup of G, and that there is a neighborhood 
A of e such that g A A is closed in A, while [g] is not closed in G. We shall 
say that g is recurrent. For example, let G be the 2-dimensional toroidal 
group; its elements are then number pairs a = (a), a2), where a = b if a; = hy, 
az = be (mod 27). Group multiplication is given by (ab); = a; + b; (mod 27) 
(¢ = 1, 2). Let @ be incommensurable with 27, and let g consist of the pairs 
(a, da) (—1 < a <1). Then g is a partial subgroup of © and is recurrent, 
since [q] fills G densely but is closed in no region in G. If we drop the modulus 
2x, G becomes a vector group and is simply connected, but g is no longer re- 
current. We do not know whether or not simply connected groups, even if 
they are Lie groups, can contain recurrent partial subgroups. 


27. Lemma. Let g be a partial subgroup in the group G, let G,, Ge, --- be 
neighborhoods closing down on e, and let gq. = Ge Ag (k = 1,2,---). Ifgisa 
point of [g] and n a positive integer, there exists an M [= M(n)] such that gng | 
Gn, whenm > M. 

Proof. Suppose that g = gige --- gp (gi € g). If our assertion is false, we 


can choose an increasing sequence 7m, m2, --- anda point hn, € Gm, (¢ = 1,2, --- ) 
such that hng @gqn. Now let 
Dm, = Fn I = Ip --- G2 GT hmgG2 ++ Gp - 


Then we may write 
Bin; = 9," ss ga hy go eee Jp, h’ = G1 hmgr e 

Since h,,, — e, it follows that for almost all m,’s, hngi ¢ G, where G is the open 
set which occurs in the definition of partial subgroup. Since h,,, € g and gi: € 4, 
it follows that hngi ¢ gg. Since G A gg = G A gq, it follows that for almost all 
m’s, Ji'hmgi egg. Since gy’ ¢g-! = g and since, for almost all m,’s, gz‘ (Am,gi) «@ 
(because h,,, — e), it follows similarly that for almost all m’s, hm, = gi! (hm,g1) 
eg. Hence almost all the hy,’ are in gn. By exactly the same argument, 
we can show that for almost all m,’s, G2 ‘hn. go € Gx, and on repeating this we 


finally obtain the result that almost all the b,,’s are in g,. Since hn; = gbm,, 
it follows that almost all the h,,,’s are in gg,. This is a contradiction. 


28. We now recall that @ is separable (§19). If g is a partial subgroup, 
there exists a denumerable set A © g which is everywhere dense in g. We 
assert that every set cgn, where c ¢ [g] and g. = G, A g, contains points of the 
denumerable set [A] = A + AA + ---. To prove this, we shall first introduce 
a new topology into [g]. The neighborhoods of an arbitrary point g in [g] are 
to be the sets of ggi (kK = 1, 2, --- ). The Hausdorff axioms can be immedi- 
ately verified. Moreover, products ab remain continuous in the new ([g]- 
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topology. To see this, let abg, be a neighborhood of ab. Choose N so that 
GG San When p > N,q > N. Let k be an integer > N. Choose (by the 
lemma, §27) an integer M such that gb € bg., when 7 > M. Now let h be 
an integer > max (j, N). Then gb © bai, hence giba, S barge & bg,, and 
ag,ba, | abg,. If a, > a and b, — b relative to the new [g]-topology, almost 
all the a’s are in ag,, and almost all the b’s are in bg,. Almost all the points 
agb, are in ag,bg, & abg,. Hence a,b, — ab. Our assertion follows immedi- 
ately. For suppose that c = gigs --- g: (gieg). There are points d{”’, --. , d\” 
in A such that d\* — g; (¢ = 1, --- , ). Hence in the new [g]-topology 


d@ = d\? dy Beis d\? —¢. 


Hence an arbitrary neighborhood cg, of c contains almost all the d‘"”’s, and since 
d@ CA, our assertion is proved. 


29. Let @ still be a group, and let (G, ¥) be a realization, and 
(G, G, G, G; x) a transitive system in (G, ¥). The set g defined in §20 is 
obviously a partial subgroup on account of (1), §20, and we shall now show: 

If g is non-recurrent, there exists a neighborhood H of e such that H A q = 
H A {g).” 

Proof. We assert first that there can be chosen a denumerable set of points 
{x,} in Xo = G-2zo such that (see §23) 


G A [ql] = Blas) + Blas) + =. 
For let A (A € q) be a denumerable set which is dense in g. Let 


L = [A4) = th, hl, ---}, K = b(h-2) + b(le-20) + ---. 


Since 1; € [g], we have lig & [g] (¢ = 1, 2, --- ). Consequently, liq A G & 
[a] A G; that is, h(li-20) & [g] A G, and hence K ¢ [g] A G. On the other 
hand, let f be a point of [g] A G and let n be chosen so large that fg, € G, where 
Qn = G, A g (§27). Now by §28, fg, contains points of [A], and we may sup- 
pose that one of them is . Since l; € fan G fan A G & fa A G, we have 
l,-x9 = f-%o, and consequently fg, A G = b(l,-20), [(3), §20]. Hence f € h(,-2), 
f € b(L-20), [9] A G GK, and [g] AG = K. If we let x; = 1-2, our assertion 
is proved. 

Let x = {%1, 22, --- }. We assert now that x is closed in Xo. For suppose 
that z,, — x (tn, € x, © € Xo). Then (§23) b(z,,) — h(x), which we may also 
write b(l,;-%) —> b(l-ao) (l ¢ G, l-a = x). By the theorem of §25, we may 
replace I, by * such that . — l. Since X. € h(l;,,- 20) = h(l,,-%) & Ial, 
we have 4 e [a] A G. Since g is obviously closed in G and is non-recurrent, 
[9] is closed in G. Consequently I ¢ [g] A G, and g(l-xo) €& [g]. Hence x = 
l-ao € x, and x is closed in Xo. 

17 This theorem probably holds not merely for g, which requires for its definition the 


existence of a realization, but for every partial subgroup of @ which is closed in some 
neighborhood of e. 
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Suppose now that our theorem is false. If we assume, as we may, that 
G, |G (n = 1, 2, --- ), G will surely contain a point of [g] which is not in g, 
and therefore G, is intersected by some b(1;-2)—-we may say b(1,,-20)—different 
from h(a) = g A G. Since b(l,,-2) is closed in G, we can choose mz > 1 so 
large that G,,, fails to meet b(/,,-20). But since the theorem is being denied, 
G,,, is also intersected by b(1,,-20) different from 6(x). By the choice of me, 
b(1,,:%) is also different from (,,-20). On continuing in this manner, we 
obtain a sequence b(/,,-20) (¢ = 1, 2, --- ), and B(In;-20) A Gn, AO (1 = m < 
me <--- ). Let a; be a point in b(l,;-20) A Gn, and let a = {a, de, --- }. 
Since the sets 6(/,,-20) are mutually exclusive, the a’s are distinct, and since 
a, — e, a has e for a limit point. Clearly, then, the set [a] = a + aa + --. 
is dense in itself, and so is a A G;. This last set is denumerable, say [a] A G,; = 
{bi, be, ---}, where the b’s are distinct. Let x» = {bi-x0, be-x, ---}. Then 
obviously x» & x. Moreover, x is dense in itself. For if §(b;-20) is a limit 
point of {6(b;-xo)}, then by §23, b;-2x0 is a limit point of {b;-20} = xs. 

Since G is separable, we may assume the G,’s so chosen that G; C G. Then 
G,-2%) & G-x. Since G,-2x is closed, while G-2» is open, it follows that G,-29 C 
G-x. Therefore, since x, C G,-x9 C G,-2 C G-a, every limit point of x, 
is in G-2o; and since x» © x C G-2, and x is closed in G-2xo, every limit point 
of x» is in x. Hence % & x. But the set % is perfect, and therefore non- 
denumerable (theorem of Cantor for complete spaces), whereas x is denumer- 
able. This contradiction proves the theorem. 


30. Lf g is non-recurrent, there can be chosen a neighborhood A ¢& G of e such 
that if a ¢« A, 


(1) alg] A A = b(a-x%) AA. 


In fact, we need only choose A such that A~' = A and AA ¢ H, where H 
is defined in the last theorem. Then if a « A, g « [g], and age A,gea"A C 
AA (H,g¢«H Ag = Q, and alg] € ag A G = BH(a-x%). Since ag is an arbi- 
trary point of ag A A, we have proved that a[g] A A & b(a-x). Since b(a-2») & 
[a], we have A A b(a-2) A A [gl], and (1) follows. 


Imbedding theorems 


31. We have already remarked that the classical theory of continuous trans- 
formation groups deals with partial structures and their systems; the modern 
theory, on the other hand,—or at least that part of it which studies properties 
in the large—deals with total structures, that is, (total) groups and total reali- 
zations of them. We shall now show that many structures of the first type can, 
in a sense, be imbedded in structures of the second type, so that in all probability 
the partial structures do not constitute an essentially more general class than 
the total ones. : 

DerFINiTIon. Let (A,B) and (A*,B*) be systems in the group structures & 
and @*, and suppose there exist homeomorphisms between A and A*,B and 
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B*,AB and A*5* such that if a — a*, b — b*, then ab — a*b*. We shall say 
that (A,B) and (A*,B*) are isomorphic: (A,B) = (A*,B*). 


32. Let G, be a connected (total) group in which there can be introduced a 
coérdinate system a, --- , a, extending over a neighborhood A of e, relative 
to which there exists a system (E,E£) (e « E © EE ¢ A) of class C®. G, is 
called an r-parameter Lie group.* Cartan has proved that if a partial group G 
possesses a system (E,F) (e « £) of class C®, that portion of G* which lies near 
e can be imbedded in a (total) Lie group G,.’* Stated more precisely, there 
exists a Lie group @, and a system (A,A) (e € A & £) in @ which is isomorphic 
to a system (A*,A*) in @,. We propose now to show that every system of 
degree 2 in a group structure G* can be similarly imbedded in a Lie group if 
it is part of a sufficiently regular system of degree 3. 

Tueorem. If (B,A,C) is a system in a group structure @ and if there exists a 
coérdinate system in which (B,A) and (B,AC) are of class C™, there exists a system 
(Ao,Co), where Ao & A, Co | C, and a Lie group G, such that (Ao,Co) is isomorphic 
to a system (A*,A*) in G,. 

Proof. We observe first that (A,C) is of class C™; at least, this will be so 
if we replace A and C by suitable open subsets of themselves,” as is obviously 
permissible. It follows that if we write C = X,c = 2x, ac = a-x, bac = ba-z, 
the results of §§15, 16 are applicable. Consider the system (A2,A2) of §15, 
with the identity ap. Since ao « Az © Ae (A; as in §16), (A3,A3) is a system of 
the same type. We shall show first that there are systems (Aj, A;) (ao¢€A; © 
A;) and (A3;,C;) (C3 & Cs) such that 


(1) (A3,A3) & (A;,C3) . 


Consider the homeomorphisms (§16) of A3: a— v(a) = a, a— (a), a — ula). 
Now let A; be a neighborhood of ap such that A;A3 © As3, and let C} = X; = 


18 Cartan, [2], p. 15. 

1” Cartan’s theorem ((2], p. 19) asserts that for every allowable set of structure constants 
there exists a total Lie group with these constants. An allowable set of structure con- 
stants is determined for @ when (E,E£) is of class C® and the Lie group with the same 
constants has the same structure as G’ in the neighborhood of the identity. 

20 To prove this, note first that since the transformation b — ba is (1,1), we can choose 


| A(ba); : ab ab, (ba); 

A®°C A and B° € B such that | = #~ Owhen b « B,a¢A%, Since — = —~— = 
0a; Oak A(ba); ax 

(summed with respect to the repeated index) and since (B, A) is of the class C®, the de- 

rivatives 0b,/8a, exist and are continuous for B°,A®. In the relation (ba)c = b(ac), let 

a((ba)c); _ 


ay 





(ba);, a:, and c; be taken as the independent variables. Then 0. Hence 


— o(b(ae))i _ a(b(ac)); Ab, , A(b(ac)); A(ac)s 











0 + ¢(ba, a,c, Aa) , 
da 0b, daz d(ac), Aa “ 
| 8(b(ac)); | 
where ¢« — 0 with Aa. By the same argument as above, — | ~ 0 for, say, 
h 


A(ac), 
Aa, 





aeA®C A% ce C*, Hence for these values of a and c lim exists and is continuous. 
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\(A;). Let a, a2 be arbitrary points in Aj. Then ajaz € As and X(a2) = 
Ze, where x2 « X. By the definition of A(a), a2-% = ao-x2 ($16). Hence 
M(Q,d2) = G)Q2-%) = Taja,(Ao-X0) = Ta,Ta,(Ao-Xo) = Ta,(@2-%o) = Ta,(@o-X2) = 
@\-%2 = a,-A(a2) = v(a;)-d(as), and yu(a) is a homeomorphism between A;A, 
and A,-X; = A;C3. (1) now follows from the relation u(a,a2) = v(a;)-d(a2). 
From the theorem of Cartan, there exists a system (Ao,Ao) (ao € Ao & A’) 
and a group @* such that (Ao,Ao) is isomorphic to a system (A*,A*) in GF. 
Taking Cy = (Ao), we have (Ao,Co) & (Ao,Ao) & (A*,A*); and the theorem 
is proved. 


33. Let us now consider isomorphism between realization systems. We shall 
define only a special type of such isomorphism: the systems (A ;xo) and (Az ) 
(A SG, x € ¥, x € ¥*) are isomorphic provided there exists a homeomorphism 
between A-a and A-x> under which a-z) and a-x} (for every a in A) are 
corresponding points. 

TueoreM. Let (G,X) be a realization of a group with a transitive system 
(B-x). If G contains no recurrent partiul subgroups, there exists a transitive 
total realization (@,X*), a point x} and an open set A (e « A © B) such that 
(A -a) & (A-2>). 

This theorem says in effect that the partial realization (G,%), or at least a 
characteristic portion of it, can be considered as imbedded in a total realization 
of G. 

Proof. Let G be a neighborhood of e such that GGGG ¢€ B, and G" = G. 
Then (G,G,G,G;z9) is a transitive system. Let g be the partial subgroup 
defined in §20, and let § = [gq] (§26). Let 2*, y*, --- be the left cosets of §, 
and denote their totality converted into a space as in §22, by X*. 

Let a be an arbitrary point of G, and let z* = bg. Then ab is a left coset 
of gj, and hence a point of ¥*; denote it by a-2*. Suppose a-x2* = a-y*. This 
would imply that if 2* = bg, y* = cg, then ab§ = ac§, and hence bg = c§, i.e., 
x* = y*. It is obvious, moreover, that a-(b-2*) = ab-x*, and finally, the 
continuity of a-z* is a matter of immediate verification. Hence we have a 
total realization (@,X*), which is obviously transitive. 

Let be the space constructed by the aid of g, as in §21. Since g is non- 
recurrent, there exists by §30 an open set A € G containing e and such that 


(1) aj A A = h(a-m) AA, 


when ae A. Let § be the totality of sets (4-29) which meet A, and let ¥% 
be the totality of sets z* which meet A. A (1,1) correspondence can be estab- 
lished between $, and %* as follows: Let x* be an arbitrary element of X¥% 
(it is of the form ag, where a e A). By (1) we have 


(2) zw*AA=agZrAA = Bh(a-m) AA. 


Then 6(a-2o) is a set in 4 and the desired correspondence will be obtained if 
we make z* = a§ correspond to h(a-x9). We assert that the correspondence is 
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bi-continuous. For suppose z= — 2* in ¥%. Then by §25 we may write 
r. = 4,4, 2* = aj,a,—ainA. The corresponding elements in §, are §(a- 2») 
and b(a,-2o), and by §23, b(a,-20) — b(a-2). Conversely, let 5(b,-20), 6(b- 2) 
be setsin H,4. Then we can write §(b,-2) = b(a,-20), b(b-x0) = b(a-xo), where 
a, a, € A and where, by §25, it may be assumed that a, — a. The corre- 
sponding sets in ¥4 are r~ = a,§ and z* = ag, and by §22, x* — z*. Hence 
the correspondence in question defines a homeomorphism ¥% — ©,. 

Under the homeomorphism § — G-2» of §23, S4 corresponds to a subset of 
G-2xo, and from the definition of the correspondence, that set is obviously A - 2. 
On the other hand, it is clear that ¥% is the set A-x>, where x, = §. Hence we 
have a homeomorphism A-z)— A-2z>. Finally, let a be an arbitrary point in A. 
Then a-2» corresponds, under the homeomorphism A -2 — §,, to the element 
bh(a-2o), which in turn corresponds in ¥4 = A-2x> to a§, that is, to a-2>. Thus 
under the homeomorphism, A-z» — A-2>, @-2% corresponds to a-x>, and we 
have (A 3a) & (A;z9). 


34. We shall now reverse the situation of the preceding theorem and assume 
that @ is partial and (G,X) total. In this case the notion of “imbedding” 
depends on isomorphisms of the form (G;X) > (G*;X). This relation means 
that there exists a homeomorphism between G and G* such that*® if ab = c¢ 
in G, then a*b* = c* in G*, and such that if ais an arbitrary point in G, then 
a-x = a*-z for every z in &. 

TuHeoreM. Let (G,%) be a total realization of a partial group, and assume 
that there exists a system (G;X) (e « G & H (see VI, §6)) such that a relation 
of the form g,-x = g2- (91,92 € G) can not hold for every x unless g, = go. There 
exists a total group @*, a realization (G*,X), and a system (G*;X) such that 
(G;X) = (G*;X). 

Proof. Let & be the totality of symbols of the form 


(ajag--- ap) (aceG; t=1,---,p; p2l). 


We define two operations on such symbols. (A) If a,, aay; are a pair of con- 
secutive elements in a symbol and a,a,,; is defined and equal to b, replace 
4,1 by b in the symbol. (B) If a, is an element in a symbol, if a;, a, are 
in Gand a,a, = 4, replace a, by aia... Two symbols, (a; - - - a,) and (0; - - - 6), 
one of which can be obtained from the other by a sequence of operations (A) 
and (B) will be called equivalent. This equivalence is obviously symmetric, 
reflexive, and transitive. Hence & falls into mutually exclusive equivalence 
classes, and the class determined by (a, --- a,) will be denoted by [(a --- a,)]. 
We now define products of classes a* = [(a, --- a,)], b* = [(b; --- b,)] by the 
formula 


(1) a*b* = [(a; --- ap, by --- bg)). 


21 J.e., if aand b are in G and ab is defined and in G. 
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The validity of this definition depends on the obvious fact that if (a, --- a,) 


and (b; --- b,) are replaced by equivalent symbols, the class on the right is 
unchanged. Similarly we define 
(2) a*! = [(a," --- aj')]. 


It is now seen that with products defined by (1), the totality G* of classes is 
a group, the identity is e* = [(e)], and inverses are given by (2). 

We now observe that if a,c « G and a # c, then [(a)] ¥ [(c)]. For suppose 
[(a)] = [(c)]. Then (a) and (ec) are equivalent, and there exists a sequence of 
symbols 


(a), (a’a”), sixee- (bibs aii bg Naat by), (bibs sth bib” aoe by), <9 (c’e”), (c), 
in which all the letters denote points of G, and a’a’’ = a, --- , b/b” = b,,---, 
e’c’’ = c. Now a-2, a’-(a’’-2x), ne b,- (be: ( sie bq: (- - (b,-2)) ve )), oe 
c’-(c’’-x), e-x are all defined, since (G, ¥) is a total realization, and they are 
all equal because of I,, §11. Hence a-x = c-x for every re X. This, by hy- 
pothesis, is impossible unless a = ec. 

It follows that the correspondence 


(3) a — [(a)] 


is (1,1) between G and a subset G* of G*. Moreover, if ab = ¢ in G, then 
by (1) 


(4) ((a)] ((6)] = [(@) (6)] = [©], 
so that product relations are preserved under (3). 

Let Gi, Ge, --- ( G@) be a sequence of neighborhoods closing down on e 
and Gj, G3, --- the corresponding subsets of G*. Let V,(a*) = a*G*, and 


take the totality of V’s to be a complete set of neighborhoods for G*. It is 
easy to see that * is thus converted into a continuous group and that the 
correspondence (3) is now a homeomorphism between G and G*. 

Now let a* be an arbitrary element in @*, say a* = [(a;a2 --- a,p)], and let 
a*.x = a-(a@2-(--- @)-x) ---). The validity of this product definition depends 
on the fact that a*-x, as we have seen, is independent of the particular symbol 


(a, --- ay) chosen to represent a*. In particular, if a* = [(a)] (a e G) so that 
aand a* correspond under (3), then a-2 = a*-x. It is clear, moreover, that 
a*.(b*-2) = a*b*-x. We have finally to establish the continuity of a*-z. 


Su se first that q, - e* In @. For n > N, an € G, and hence we ma 
’ ’ 
write 


ay = [(a,)] (a, €G,n > N), 


and sinée (5) is a homeomorphism which preserves products, we have a, — e. 
Hence a*-x = a,-x — x. The general case follows immediately. 


35. Lie groups. Let G, bea Lie group (§32) and let the coérdinates a', -- - , a” 
be a canonical system; in such a system, e is at the origin. Moreover, if g is 
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a closed subgroup of @,, it follows from a theorem of Cartan ([2], p. 24) that 
there exists a spherical neighborhood A» (& A) of e such that g A Ao is a flat 
p-cell (0 S p Sr). We may assume that the flat p-space which contains g A A 
isa@e*!} = --. = a? = 0. Let f be the flat space orthogonal to g at e, and let 
B; be a spherical neighborhood of e with radius 6. Let the projections of a 
point a = (at, --- , a") on f andg bea’ = (0, --- ,O,a**!, --- , a") anda” = 
(a', --- , a*, 0, --- , 0), so that (symbolically) a = (a’,a’’). Let f, = FA B, 
gs = g A B;. The totality (f., gs) of points (a’,a’’) which are such that a’ « f, 
a’’ ¢ g is homeomorphic to the topological product f, X gs and is therefore an 
r-cell. We assert that if « and 6 are small enough, the single-valued correspond- 
ence (a’, a’’) — a’a”’ (a’ « £,, a’’ € gs) is homeomorphic between (f,, g;) and 
f.gs. Tosee this, letc = (c!,---,c")=a’a’’. Thec’sare functions of a’, ---,a’, 
and on account of the continuity of their derivatives, we have only to show that 

bes ~ 0. 


' da’ a'aa*** warad 


In a canonical parameter system, the group product of two elements in an 

infinitesimal neighborhood of e is obtained simply by adding codrdinates. 

Hence ci = a‘ + e' (a', --- , a") when lim ¢'/r = 0, 7 being, say, the distance 
r—0 

from a to the origin. It follows immediately that the value of the determinant 

is 1; and this proves our assertion. The set f.q; is therefore an r-cell, and hence, 

by the invariance of regionality, an open set in G,. 

It follows further that if a’ is an arbitrary point of f,, then a’g; has only the 
point a’ in common with f,. Now the set g-q; is at a distance = 6 from e, 
and hence, if ¢ is sufficiently small, a’(g — gs) will be near enough to g — aq, 
to avoid meeting f,. Hence for such an e, a’ meets f, only at a’. Consequently 
the correspondence a’ — a’g between points and cosets is (1,1) when a’ « f,. 
If the cosets a’g are regarded as forming a subset X, of © (§22), the corre- 
spondence is a homeomorphism. Its continuity, in fact, is readily seen. As 
for that of its inverse, suppose that b/g — b’g (bi, b’ « £.). Then, by §25, 
choose a* such that a* — b’ and a*g = b’g. Now b’ = b’e is contained in 
the open set f,g;, and hence almost all the a*’s are of the form a,a‘, where 
a; ef, a’ eqs. Since a,a,, — b’, we have (a,, a”) — b’. Hence a, — b’, 
a, —0. Since big = a,a‘q = a’g, we must have b, = a,,sothatb, +b’. Let 
e and 6 now be fixed so small that the situations just described hold. 


36. Let » < min(e, 6), and let L, be the r-cell f,g,. There can be chosen 
({10], p. 19) a sequence a,, a,2, --- such that G, = 2; a,:L,. The sets 


aL, (i= 1,2, --- ;7r =9,9/2,9 3,---) 
are r-cells and obviously form a denumerable complete set of neighborhoods 


for @,. Hence G, is a topological manifold. Since g is closed, it falls into a 
discrete set of connected homeomorphic pieces, and the piece which contains e 
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is a Lie group ([2], pp. 22-24). Hence g consists of a discrete set of topological 
manifolds. 

The realization (@,, ) defined by a-bg = abg (cf. §33) is transitive, and 
hence § is homogeneous. Now since the sets L, are open subsets of G,, the 
sets L,-q are open subsets of © (§25, corollary). Since L,-¢ = f4,-¢ = 
f.q;4 = f,g, and since, by the preceding section the subset f,g of S is homeo- 
morphic to f,, i.e., to an (r — p)-cell, the sets 


aL, -Q (¢ = 1,2, ---;7 = 9, 9/2, --- ) 


clearly form a denumerable complete set of neighborhoods for §. Since G, 
is connected, § is obviously so. Hence § is a topological (r — p)-manifold. 

Suppose in particular that (G,, ¥) is a transitive total realization of G, and 
that g consists of all the points a such that a-% = 2. Then X, being homeo- 
morphic to  (§23), is a topological (r — p)-manifold (cf. [2], p. 14). 


37. Turorem. Let G be a Lie group and (G, %) a transitive total realization, 
X being connected. If a transformation x — b-x leaves invariant the points of 
an open set in X, it leaves invariant all points of %. 

Proof. Suppose X is an open set such that b-2 = 2x for every x « X, or 
symbolically, b-X = X. Let g be the totality of points a such that a-X = X. 
Let x» be a point in X and let G,, G2 --- be connected neighborhoods closing 
down on e. Choose n so large that G,-x) € X, and choose m so large that 
G;,'Gm & G,. Let g and h be points in G,, and G, respectively. Then 


(1) gag *- (gh-x0) = gg(h-x) = gh-xo, 


since h-a « X. Now for each g in G,,, the totality of points gh—that is, the 
set gG,—contains G,,; for since g"G,, & G,, we have gG, 2 G,,. Hence for 
each g we have from (1) gqg-!-x = a for every z in G,,-2%. This relation 
holds a fortiori if g eG, (k = m). 


Let 
ae = {gag g ¢ Ge}, be = Ge + Gage + --- (k =m). 
We have 
(2) beg Gar Gh Sb. (k 2m), 
(3) fe-to = 2 for 2eGm-%% (k =m), 
(4) bm 2 bmi D-: 


Each 6, is a closed subgroup of G, and is therefore a discrete equi-dimensional 
set of mutually exclusive connected pieces, and each is a topological manifold 
(§36). Let 5, and 5t be the connected parts of §, which contain e and b. If 
h is an arbitrary point of ht, we have hf = hb?. 
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On account of (4), there exists an integer g = m such that 


(5) dim §, = dim hygy1 = --- . 
Let p (> q) be so large that G,G, € G,. We assert that if c e G,, then 
(6) che & by. 


For let cje~ (7 € 6p) be an arbitrary point in ch,c~'. Then 7 is a product of the 
form 


TI ogo! QgQeGpji=1,---,, 


i=l 


and hence 
t 
cje! = I] (cigs) g(cags)—. 


Since cg; « G, (¢ = 1, --- , t), we have cje~ € h,. Hence ch,c~ © b,, and this 
relation obviously holds when we pass to the closures. 

Obviously ch?c~ is that connected part of ch,c~! which contains e, and since 
it is homeomorphic to 5%, it is of same dimension as 5%, and by (5) of same 
dimension as §?. By (6) it follows that ch?c~ is a subgroup of 52; but since 
the dimensions are equal and both are connected, it follows from a theorem of 
Schreier [10] that ch}c-! = 5%. In the same way it follows from (4) and (5) 
that 6) = 63. Hence 


(7) chtct = 6;. 


Now ch*c-' ¢ 5, on account of (6). Since c « G, and G, is connected, we can 
join e to c by a path c(t), (c(0) = e, e(1) = ec) in G,. The successive sets 
c(t)h*c(t)— constitute a deformation of §*; but they are all contained in 5,, and 
hence in the connected piece 5%. Hence 


(8) chzct & b5. 


Now ch*c™ is clearly one of the connected parts of che. Hence if d is a point 
in ch*c~!, we have ch*ce! = d(chic~). By (8), d is also in 5%, and we have 
h* = db?. Hence by (7) 


(9) ch$c* = 6%. 


Let X, consist of the points z such that §*-2 = z. On account of (3), 
X, #0. Obviously X, is closed; it is also open. For, if y « X,, then §*(c-y) = 
ch*cc-y = ch*-y = c-y, by use of (9). Since c is an arbitrary point in G,, 
c-y is an arbitrary point in G,-y. Hence §*-2 = 2, for every x ¢ G,-y, and 
G,-y | X>». Since (G, ¥) is transitive, G,-y is open by §25. Hence X, is 
open. Since X is connected, it follows that X, = %, and since b « §* we have 
b-¥ = X. 











272 P. A. SMITH 


38. A remark about essential parameters. We have asserted (§17) that the 
assumption II’ is a consequence of the assumption that the “parameters be 
essential”. We shall now prove that this holds, in a sense which will become 
obvious, when (G,, X) is a transitive total realization of a Lie group. Suppose 
on the contrary that II’ is false. There exists a point a in @, and sequence 
a, — a, b, > a (a, * b,) such that a, = by, ($17). Let c, = b;"a,. Obviously 
c, = e, andc, — e. From the preceding theorem, the relation c, = e implies 
that 


C,X =X (n = 1,2, ---). 
Let g consist of all the points a such that a-¥ = X¥. Then g is a closed sub- 
group of G. We assert that dimg>0. Letc = {a,c ---}. Sincea, ¥ b,, 
the set cis infinite. Let @ be the closure of the set ce + cc + ---. Sincec, — e, 


é contains at least one 1-parameter subgroup” of G,, and since @ € q, dim g 2 1. 
g is obviously invariant, and the group @/g may be converted into a space by 
identifying it with the space  (§22) formed with the cosets of g. Since g is 
closed, the results of §35 are applicable; hence is a topological manifold, and 
we have dim § = r — dim g, so that dim § < dim @,. On writing ag-z = a-x 
we obtain a representation (§, X). The correspondence a — ag between G, 
and © is clearly single-valued and continuous. Hence on introducing cartesian 
coérdinate systems in neighborhoods of the identities of @, and §, the corre- 
spondence a — ag assumes the form 


a—a* [a = (a, --- , a,), a* = (aj, --- , a*), p < rl, 


so that the a*’s are single-valued continuous functions of the a’s. Since the 
relation a-x = a*-zx holds identically in x, the number of parameters in the 
functions a-x can be reduced; that is, the parameters in (G,, ¥) are not essential. 
This contradiction proves the theorem. 


The fundamental groups 


39. Let (G, X¥) be a transitive total realization of a Lie group. In this section 
we obtain a relation between the ranks of certain fundamental (Poincaré) groups 
which occur in connection with (G, %). 


PRELIMINARY LEMMAS. Let [a, b,c, --- , d] denote a path in © beginning at a, 
passing successively through b, c, --- andendingatd. Let [y,z] be a path in ¥ 
beginning at y and ending at z. A relation of the form fa, b, --- , d]-x = 
[y, 2] will mean that while a point a’ traces the first path from a to d, the point 
a’-x will trace the second, from y toz. In case y = z, a’-z is to trace the closed 
path [y, y] in the positive direction. 

Lemma 1. Let = be a closed path in X defined by the function &(t) of period 1. 


22 This will be obvious on referring to Cartan, [2], p. 23. 
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There exists a 6 > 0 such that for each t there is an arc [e,a] in @ with the property 
that 


the path on the right being the subpath of & traced when the parameter is increased 
continuously from t tot + 6. 

Proof. Suppose the lemma is false. There exists a sequence t, ts, --- such 
that a relation of the form [e,a]-é(t,) = [&(t), E(t. +1/n)] fails to exist for 
n= 1,2,---. On passing to a subsequence, if necessary, we may assume that 
t, >t (mod 1). Let 2 = £(£), let g be the subgroup of G corresponding to xo, 
and form § with the cosets of q (§22). Let X, and f, be defined as in §35. 
X, is an open set containing x. Hence \ > 0 can be chosen so small that 
the path 


(1) [g(@ — A), E(@ + )] 


is contained in X,. Since X, is homeomorphic to f,, each point é£(t) of this 
path corresponds to a point k(t) in f,. The correspondence is such (§35) that 
k(t) = e, and k(t)-m% = &(t) (E —- >A StS f+). The second relation can be 
written in the form 


(2) [k(t — d), k( + d)]-E() = [EE — ), EE + Ad). 


For m sufficiently large, the path [E(t,.), (tm + 1/m)]isasubpath of (1). Hence 
there exists a subpath of the path in the left of (2), say [a’, a’’], such that 
fa’, a’’]-&(t) = [E(tm), E(tm + 1/m)]. In particular, a’-&(@) = &(t,.). Hence 


(3) [a’, a’"}-((a’)*E(tm)) = [E(tm), E(tm + 1/m)). 


The path [a’, a’’] [(a’)-']}—that is, the path traced by b(a’)~' as b traces [a’, a’’}— 
is of the form [e, a], and by (3), [e, a]-E(tn.) = [E(tm), E(tm + 1/m)]. This isa 
contradiction. 

From now on let 2 be an arbitrarily chosen but fixed point in %, so that the 
sets denoted by the symbols g, f., 5, etc., defined relative to x9 (§35) are fixed 
from now on. 

Lemma 2. Let — = [20, 2] be a path in ¥ beginning and ending at x. There 
is a path [e, a] in G such that [e, a]-29 = [xo, Xo). 

Proof. Suppose é is defined by &(t), of period 1. Choose a 6 with the property 
described in Lemma 1, and let points a = £&(to), E(t), --- , E(tea), E(t) = Zo, 
be chosen such that 


| t; — tisa | < 6 (mod 1) (¢ =0,---,k — 1). 
By Lemma 1, there are ares [e, a;] such that 
(4) le, a] - Xo = [E(to), &(t)], le, aa] - E(t) = [&(4), E(ts)], ete. 


Since &(t:) = a:-2o, E(t) = G2ai-%o, --- , the left members of (4) can be replaced 
by [e, a:]-20, ([e, @2]a1)- 20, ([e, @3]a2a;)-20, --- . The symbol [e, a;] + [e, aeja, + 
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[e, as]aga, + --- + fe, ala --- a evidently represents a path, since each 
path in the sum begins where the preceding one ends; it is of the form [e, al], 
and because of the relations (4), [e, a]-2o = [2o, xo]. 

Lemma 3. Suppose a(t) and B(t) (£ < t < t’) define paths in © such that 


(5) a(t)-% = B(t)- 20, a(?) = (2). 


Either path can be deformed to the other in such a way that its initial point remains 
fixed while its final point remains in a fixed coset of g. 

Proof. We may obviously assume that ~ = 0, t’ = 1. Because of (5), 
the point a(t), for each ¢t, lies in the same coset of g as does B(t). We may write 
B(t) = a(t) y(t) (0 S t S 1), where y(¢) defines a path ing. Let 


a(t) <t<s<)}), 
B(t,s) = 
«(S=*) te - 9 O<s 





IIA 


— t < 1). 
Observe that A(t, 0) = B(t) and B(t, 1) = a(t), while, for each intermediate s, 
A(t, s) is a path beginning at a(0) and ending at the point a(1)y(1 — s) on the 
coset a(1)g. A(t, s) therefore defines the desired deformation. 

Lemma 4. Let a set of points in X be defined by é(t, s) (0 S t, s S 1), the 
function & being continuous in (t, s) and such that £(0, s) = &(1, s) = E(t, 0) = 2%. 
Let s be a fixed number = 0 and S 1, and let a(t) define a path such that 


(6) a(t)-xo = &(t, 3) (0st 1). 


Let t be an arbitrary number such thatO < tS 1. There exists an h > 0, inde- 
pendent of t, and a continuous function a(t, s) defined for 


(7) |é-t|<h, l\§—s|<h (0st<1;0 


lA 
IIA 


8 1) 


such that for these values a(t, 8) = a(t), a(t, s)-% = é(t, 8). 

Proof. Let g(t, t) = a(f)— a(t), W(é, t, s) = a(t) E(t, s). Now off, t) = e 
when t = #, and because of (6), Y(t, t, s) = a» whent = tands = §. Hence 
by the principles of uniform continuity, there exists an h independent of t, 7 
and s such that (see §35) 

(8) o(f, t) € F.gs, when |\¢—t| <h, 
V(i, t, s) € X., when |\¢-—t| <h,|s—8|<h. 


Now let 7 be fixed. Because of the correspondence X, — f, (§35), there exist 
points x(t, s) in f, [x being continuous in (¢, s)] such that 


(9) x(t, 8)-20 = ¥(é, t, s) = a(t)&(4, 8). 


From the first of the relations (8), there exist continuous functions «(t), y(é) 
such that x(t) € f., y(t) € gs and «(é)y(t) = ¢(%, t) = a(t)“ a(t). The functions 
x(t, s), x(t), y(t) are defined for values of t, s which satisfy (7). Since «(t)-x = 
k(t)y(t)-2 = a(f)-ta(t)-2, and, by (9) and (6), «(t, 8)-ro = a(Z)“E(t, 8) = 
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a(f)~'a(t)-20, it follows that «(t)-2o = x(t, 3)-2%. Therefore, since the points 
x(t) and x(t, §) are in f,, we must have 


(10) x(t) = x(t, 8). 


Let a(t, s) = a(i)x(t, s)y(t) when s, t satisfy (7). Then a(t, s) = a(Z)x(t, 3y(t) = 
a(é)«(t)y(t) = a(é)(a(é)“a(t)) = a(t), by use of (10) and a(t,s)-x% = 
a(i)x(t, s)-%o = &(t, s), when t, s satisfy (7). Hence a(t, s) has the desired 
properties. 

Lema 5. Let &(t, s), a(t), 5, h be defined as in Lemma 4. For every s’ such 
that 


(11) 0<s' <1, |s3—38’| <h, 


there exists a path a’(t) satisfying a’(t)-% = &(t, s’) deformable into a(t), end 
points remaining fixed. 

Proof. Let to = 0, ti, te, --- , tea, te = 1 be an increasing sequence such 
that tis, — t; < h. Then by Lemma 4 there exist functions a,(t, s) (i = 0, --- , 
k — 1) such that 


(12) ai(t, 8)-% = &(t, 8) ((; St Stn, |s—8| Sh,0 58 1), 
(13) ai(t, 3) = a(t) (t; St S ti). 
Let s’ be fixed, and let 
a(t, 3) = b:, arti, 8’) = ¢;, alti, 8’) = cj, (6 = 0,---,k — 1). 
Note that if we define b, = a(1), then 

ai(tiss, 3) = aspl(tion, 3) = bis (@=1,---,k—1). 


For definiteness, let us suppose that s’ > § Then for the values of s, ¢ such 
that t; St S tin, § S 8s S 8’, the points a;(t, s) constitute a singular rectangle 
with vertices bj, ¢;,¢;, bis: One edge of this rectangle coincides by (13) with 
the subpath [6;, bi,:] of a(t). The remaining edges constitute a second path 
[bi, ¢;, ¢:, bi4:] joining b; to by,,. This path can be deformed across the singular 
rectangle to the path [b;, b:.:], end points remaining fixed. Now observe that 
as) ai(tiss, 8)-Zo = aizs(tizs, 8)-%o = E(tins, 8), 

ai(tisa, 8) = aigiltigs, 3) = a(tizs) (§Ss Ss 8’). 
Hence by Lemma 3, the path [bi,:, ¢;], defined by a;(tis:, 8), can be deformed 
to the path [bi.1, ¢:4,], defined by ai4:(tis1, 8), the initial point remaining fixed 
and the final point tracing a path [c;, c;4,] in some fixed coset of g, so that 


(15) [c’;, Ci41)-Zo = C+ Xo -™ Ciaa°Z. 
Let [c;, c;] be the path defined by a(t, s’) (t; < t < ti:). Because of (12), 


(16) [ci, c:]-t0 = [E(ts, 8’), (tina, 8”)], 
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and hence if we join the paths [bo, c4], [co, Col, [eo, ci], --~ 5 [ex—ay Cx—alyl Cea, bal 
forming a path a’, as a point a moves from bp to by along a’, a-x moves along 
the path defined by é(¢, s’), because of (16), pausing (by (15)) at each point 
£(t:, s’) while a traces [¢,, c;4,] and pausing at £(0, s’) and £(1, s’) while a traces 
[bo, cg] and [c;_,, b:]. Hence a’ can be defined by a suitably chosen function 
a’(t) such that a’(t)-% = é(t, s’) (0 St $1). Itis clear from the construction 
of a’ that it can be deformed to a(t), its end points remaining fixed. 

TueoreM.” Let a(t) (0 S t S 1) define a path in © beginning at e and such 
that a(1)-2% = x. If the closed path in X, defined by a(t)-2xo, is deformable to x» 
through a family of paths beginning and ending at x, then a(t) can be deformed 
to a path in g through a family of paths beginning at e and ending at points in g. 

Proof. The assumption that the path a(t)-zo is deformable to a point im- 
plies the existence of a function é(¢, s) satisfying the conditions of Lemma 4 
and such that £(t, 1) = a(t)-a%. For each s, é(t, s) is a path of the form [z, xo], 
and hence by Lemma 2 there exists a path defined, say, by a,(t) beginning at e 
and such that a,(t)-z9 = &(t, s). We may in particular take a;(t) = a(t). The 
family a,(t) is not necessarily continuous in s. Now by Lemma 5 there exists 
for each § in the closed interval [0, 1] an open interval o; such that, if s’ is in 
o; and in [0, 1], there will exist a path «,.(t) which can be deformed to a;(t), 
end points remaining fixed and such that a/.(t)-2») = £(s’,t). Let a finite cover- 
ing subset of intervals, o1, --- , gu, ¢v, --+* , oo, be chosen and arranged so that 
1,---,u,v,---,0isa decreasing sequence. There exists a w between u and », 
and contained in both o,, ¢,. Hence there can be chosen paths a(t) and a(t), 
the first deformable to a,(t) and the second to a,(t), with end points always 
remaining fixed and such that a,(t)-x = a(t) (= &w, t)). It follows by 
Lemma 3 that a,(¢) can be deformed to a(t) in such a way that the terminal 
point moves along a fixed coset of g, the initial point remaining fixed. Hence 
a,(t) can be similarly deformed to a,(t), and hence a(t) in a finite number of 
steps to ap(t). Since the terminal point of a;(¢) is in g, it remains in g during 
the deformation. Since ao(t)-x) = E(t, 0) = x, the path ap(é) is in g. 


40. Fundamental groups. Let the fundamental group G(@) be defined by 
the paths beginning and ending at e, and G(X) by the paths beginning and 
ending at 2. Under the correspondence a — a-2o, the image of G(@) is a sub- 
group G°(X) of G(X). Let qo be that connected part of g which contains e, 
and let G°(G) be the subgroup of G(G) consisting of all the paths which are 
equivalent to paths in go, that is, those paths that can be deformed to paths 
in go through families of paths beginning and ending at 2%. We assert that 


(1) G(®)/G(G) = G(X). 


Since the image of every path in gp is the point 2, every element in G°(@) 
corresponds to the identity in G(X). Conversely, every element of G(@G) 


23 This theorem is implied by Cartan in his remarks on fundamental groups ({2], p. 27), 
and again by Ehresmann ([3], p. 399). 
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which corresponds to the identity in G°(X) is an element in G°(G). To see this, 
let E(t) (0 S ¢ S 1) be a path beginning and ending at 2», and a(t) a path be- 
ginning and ending at e and such that a(t)-x = &t). Suppose that £(¢) is 
deformable to z) through a family of paths beginning and ending at x. By 
the last theorem, a(t) is deformable to a path in g through a family a,(t) 
(0 S s S 1, a(t) = a(t) of paths beginning at e and ending in go. Let a’(s) = 
a,(1). Then a’(s) defines the path traced in g) by the terminal point of a(t). 


Let 
2t ) 
| 8 eens (0 
Bi) =~ (=> s 


| a’(—2t + 2) (l—s/2sts 1). 


IIA 


¢s 1 — 8/2), 


It will be seen that 8,(t) defines a deformation of a(t) to a path in go through a 
family of paths beginning and ending at e. The resultant path in gp is a union 
of the paths defined by a(t) and a’(t). We have now established the rela- 
tion (1). 


41. Let the maximum number of linearly independent elements in an abelian 
group be called its rank. It is well known™ that the fundamental group of a 
connected continuous group is abelian. In particular, the subgroup G°(@) is 
abelian, and from its definition it is clear that its rank can not exceed the rank 
of G(go). Moreover, the rank of the fundamental group of a connected con- 
tinuous group of r dimensions® can not exceed r. Hence 


(1) rank G°(@) < dim gp . 
Moreover, it follows from (1), §40, that 

(2) rank G(@) = rank G°(X) + rank G°(@) , 
and from §35 that (since dim g = dim Qo) 


dim G = dim go + dim %. 


24 Schreier, [10]. 

2 Smith, [13]. Application of the results of [13] requires that @ be subdivisible into a 
simplicial complex. (Such a subdivision will automatically satisfy (a) and (b) of p. 210, 
since @ (as well as X) is a topological manifold (§36).) We can choose a coérdinate system 
about e in which the functions (ab)* will be analytic (Schur [11]). Let o be a spherical 


region with center at e, and let @:, a2, --- be chosen such that © aio = G (§36). The cells 
a,0, a9, --- and their boundaries form a subdivision of © into pieces of r, r — 1, --- dimen- 


sions, and if the radius of ¢ is small enough, this subdivision will be, at least locally, ana- 
lytic in character. Hence it can be locally further subdivided into simplexes by the methods 
of [7] or [8]. There is no essential difficulty in coérdinating the local simplicial subdivi- 
sions into a simplicial subdivision for the whole of G. The same remarks apply to Qo. 
With regard to %, we remark that among the coérdinate systems for @ in which the (ab)‘ 
are analytic there are canonical systems. It then follows readily from §35 that, such a 
system being chosen for G, there exists a coérdinate system in ¥ extending over a neighbor- 
hood of zo in which the (a-z)‘ are analytic. By an argument very much like the one just 
outlined, this leads to a simplicial subdivision of ¥. 
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Hence we have 
(3) rank G(@) + dim ¥ S rank G(X) + dimG. 


Thus, for example, an r-parameter Lie group ©, can not operate transitively in 
a euclidean space ¥ of n dimensions if the rank of the fundamental group of ©, 
exceeds r — n. 

We assert finally that rank G°(X) < dim ¥ = n. For suppose, on the con- 
trary, that there exist n + 1 paths a(t), --- , @n4a(t) beginning and ending at e 
such that the paths defined by a;(t)-2, --- , @ns:(t)-x9 are linearly independent. 
The function 


(4) f(t, ah tn) = ay (ty) ao(te) eee Oin(tn) - Xo ’ 


where t, --- , ¢, are independent variables varying between 0 and 1, defines 
a single-valued continuous mapping of an n-dimensional torus 7 on ¥ such that 
the n “parameter curves’ of + are mapped on the paths defined by ai(é), --- , 
a,(t). Let ¥ be the covering space of ¥ which belongs* to the subgroup T 
of G°(¥) generated by a(t), --- , @nsi(t). The fundamental group of % is” I, 
and is therefore abelian. Furthermore, an examination of the construction of 
¥ shows easily that with the aid of (4) there can be defined a mapping of 7 
on ¥ such that the parameter curves are mapped on the first n generators of I. 
But since the rank of Tis nm + 1, such a mapping is impossible,* and our asser- 
tion follows. From (2) and the fact that rank G°(G) < rank G(q), we have 
rank G(G) — rank G(g) < dim X¥. Thus, for example, if ¥ is n-dimensional 
and rank G(G) = n + s, then rank G(go) = s. 


BaRNARD COLLEGE, CoLtuMBIA UNIVERSITY. 
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A CLASS OF QUATERNION ALGEBRAS 
By James H. D. TELLER 


1. Introduction. Let % be a rational generalized quaternion algebra with 
basis elements 1, 7, j, ij, where 7? = —1, 7? = a, ij = —ji. Without loss of 
generality we may take a odd and either +1, or a product of distinct primes of 
the form 4n + 3 or the negative of such a product.! Latimer*® has proved 
theorems similar to those of this paper, which include the case a = 1 (mod 4). 
Hence we assume a = 3 (mod 4). Then % is a division algebra and the set G 
of integral elements containing the basal elements 1, 2, j, 77 consists of all elements 
x + py, where z, y range over the set G of Gaussian complex integers and 
p = n(1+ J), where* » = 3(1+ 7). The conjugate of X = a+ bi 4+ ej + dij 
is X’ = 2a — X and the norm of X is N(X) = XX’ = @+4+ 8B — ac? — ad’. 

We shall show that there is a one-to-one correspondence between the classes 
of left ideals in G and those classes of binary Hermitian forms arz’ + (b/2) 2’y + 
(b’/2)xy’ + cyy’ which represent positive integers, where a, c are rational 
integers, b is in G, x and y range over G, b’, x’, y’ are the conjugates of b, zx, y 
respectively and bb’ — 4ac = 2a. If a > 0, we prove two theorems on the 
existence of a g.c.d. and on the factorization of elements in G, almost identical 
with certain theorems proved by Dickson for the case a = —1, i.e., the case 
where @ is the set of Hurwitz integral quaternions. 

Latimer in the paper cited above proved similar theorems for the sets x + jy, 
where 2, y range over the integral elements of a quadratic field. The correspond- 
ing forms in his case were arz’ + ba’y + b’ry’ + cyy’, where the same assump- 
tions hold for the coefficients except that bb’ — ac = a. If b is a rational 
integer, and zx, y are restricted to the set of rational integers, Latimer’s forms 
become classic binary quadratic forms az? + 2bry + cy*, while those of this 
paper become non-classic forms az? + bay + cy, b odd. Latimer’s paper* 
extends the factorization theory of the Lipschitz integral quaternions; this 
paper makes a similar extension of the theory of the Hurwitz integral quater- 
nions. 


Received August 19, 1935. 

1 Dickson, Algebras and Their Arithmetics, p. 192. 

2 Latimer, On ideals in generalized quaternion algebras and hermitian forms, Transactions 
of the American Mathematical Society, vol. 38 (1935), pp. 436-446. 

3 Dickson, loc. cit., p. 192. 

‘ Latimer, loc. cit. 
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2. Basis of an ideal in G. The following equations which may be verified 
are used in the sequel. 


(a) P =p—e, 

(b) N(p) = «, 
(1) (c) zp = pr’ + (x — x’)n, 

(d) xp’ = p’x’ + (x — 2x’)n’, 

(e) N(X) = N(X) + eN(Y) + (i + yw — (1 — yr, 
where e = (1 — a)/2,¥ = X + pY,X =214+ wt, ¥Y = w+ ow. 

Ideal, equivalence of ideals, class of ideals, basis of an ideal with respect to G, 
and proper basis are defined as in Latimer’s paper,’ with # replaced by », 
except that the phrase “non-singular” is replaced by “not equal to zero’’ as 
is a division algebra. 

Lemma 1. Every ideal ¥ in © has a basis a, b + pd where a, b, d are in G. 

Let a; + pd; be the set of elementsof%. Thesetofd;haveag.c.d. d= 22d, 
where the z’s are in G. Then by (lc) = z,(a; + pd;) = b + pd is an element 
of Y%, where b isin G. Let the g.c.d. of the complex integers in 2 be a. Then 
by employing (1), it will be found that a, b + pd form a basis of %. We shall 
write ¥ = [a, b + pd]. 

Lemma 2. Every ideal 2 in G is equivalent to an ideal [a, b + p], where a is an 
odd rational integer and b is in G. 

Let %1 = [a:, b: + pd,] be an ideal in G. Since pa; and (p — 1)(bi + pds) 
= —ed, — b; + pb; are elements of &, a: = ad;, b; = bd,, where a, b are in G. 
Consider 2 = [a, b + p], fd: = &%:. Since N(d;) > 0, Lis equivalent to &:. 

It remains to show that a is an odd rational integer. Since pa — a’(b + p) = 
(a — a’)n — a’b and (p — b’ — 1)(6 + p) = —e + (6b — b’)n — b(b’ + 1) 


are in %, we have 


@) (a —a’)n—a’b=0 (mod a), 
e — (b — b’)n + B(b’ + 1) = O (mod a). 

Setting b = b, + bei in (2) we find 

(3) (2b, + 1)? + (2b. + 1)? — 2a = 4ac, 


where c is in G. Since a = 3 (mod 4), a and ¢ are prime to 2. Let a = ta, 
where ¢ is an odd rational integer and a; is in G and not divisible by a rational 
integer > 1. Then a; is prime to a;. Multiplying the left member of (2;) by 
1 — cand noting that a; is prime to a we find 


(4) 1+b(1 — 7) =0 (mod a). 


Multiplying the left member of (22) by 1 — 7, reducing by means of (4) and then 
multiplying by 1 + 7, we find 


(5) b(1 — i) + 2e =0 (mod a). 


5 Latimer, loc. cit. 
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From (4) and (5), 
(6) 1—-2=a=0 (mod ay). 


Since a has no non-rational complex prime factors, a7 = +1, +7. Hence 
we may assume a; = 1, and therefore a is an odd rational number. 

If an ideal & has a proper basis w, w2; #; = gia + gize (¢ = 1, 2), where the 
determinant | g,; | is a positive rational integer, then | gi; | is defined to be the 
norm of 2 and written N(2). It may be shown that N(®) is independent of 
the particular proper basis employed. If £; = taw: + tiwe (¢ = 1, 2), where 
the w’s form a proper basis of &, it may also be shown that the ¢’s form a proper 
basis if and only if the ¢’s are in G and |t,;| = 1. By the proof of Lemma 2 
every ideal % in @ has a proper basis [ad, bd + pd], where a is a positive odd 
rational integer and b, d are in G. Hence N(%) = add’. If & is an element of 
@ of positive norm, it may be shown that N(LE) = N(®) N(é). 


3. The class of forms corresponding to anideal. If a, c are rational integers, 
bisin G, x, y range over G and b’, x’, y’ are the conjugates of b, xz, y respectively, 
then 
(7) f(z, y) = axx’ + dbx'y + 9b’xy’ + cyy’ 
will be said to be a Hermitian form of discriminant bb’ — 4ac. If fi(a1, y:) is 
obtained from f by a linear homogeneous transformation of determinant unity, 
f and f; will be said to be equivalent. Equivalent forms have equal discrimi- 
nants. All the forms equivalent to a given form will be said to form a class. 

Let ¥ = x + yp be an element of G. By (1) p¥ = (x — 2’)n — y'e + 
[x’ + y’ + (y — y’)n]o. The determinant 

z y 
(x—2')n—ye xv +y'+(y—y')n 
will be found equal to N(X) as given by (le). 

Let & be an ideal with proper basis w:, w2; wi = ga + gine (¢ = 1,2). Since 
each pw; belongs to %, we have 
(8) pw; = bio: + diewe (i = 1, 2), 


where the b’s are in G. The general element of 2 is ¥ as written below, where 
z, y range over G. 


X = To, + yoo = (Gur + gay) + (git + goy)p, 
pX = lo: + lewe = (guli + gale) + (Gioli + goale)p, 
where 1; = bux’ + bay’ + (x — 2’)n, le = biex’ + dooy’ + (y — y’)n. Then 


gut + gay Git + Jy 
Juli + gale  —giali + Goole 


zr sy 
lL, l, 


gu 9Ji2 
ga 922 


N(X) = 


= f(z, y) N(Q, 
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where 


zx 


(9) f(z, y) = = berr’ + (yn — budx’y + (bee — n)ry’ — bayy’. 








1 2 


f(x, y) will be said to correspond to the proper basis a, we of &. 

We have seen that ¢; = tw: + tiewe (¢ = 1, 2) form a proper basis if and only 
if the ¢’s are in G and |t,;| = 1. The form corresponding to such a basis is 
fila, yx) = N(aihi + yate)/N(Q), where ¥ = afi + ys: = Toi + yor, 


zx 


(10) tur + tay, 
tied: + leoyi. 


y 


Hence f is equivalent to f;. Conversely, if f is transformed into f; by a trans- 
formation (10), the ¢’s being in G and | t,; | = 1, then f, is the form corresponding 
to the proper basis {1, f2, (; = tiw: + tigwe (¢ = 1,2). Hence there is a one-to- 
one correspondence between the proper bases of % and the forms in the class C 
containing f. We shall say that 2 corresponds to C. 

Multiplying f(z, y) of (9) by 2, we have 


2f (x, y) = 2bierx’ + [1 + i — bu] x’y + [2bee — 1 — i) zy’ — Aayy’. 


Since f(z, y) = N(%)/N(® is rational, 2f is rational and is in @ for every 
x, y in G, and therefore is a rational integer for every such z, y. It may then be 
shown that 26:2, 2b2: are rational integers and that the coefficients of z’y and zy’ 
are conjugate complex integers. Since the b,; are in G, bi: and ba are rational 
integers and f (z, y) of (9) is a form of type (7). 

Lemma 3. If C and C, are classes’ of hermitian forms which correspond to the 
ideals L and & respectively, then C = C, if and only if 2 and %, are equivalent. 

Let f(z, y) of (7) be aform in C. We may assume without loss of generality 
that a = 0. Suppose C = C;. Then /f corresponds to a proper basis w, w2 of & 
and to a proper basis [1, 2 of &. From (7), (8), (9) we have 


por = byw + diwe = (n — Fb)or + awe; phi = (n — 9dr + ale 
and 
(po + $b — n) w = dwn, (0 + $b — n)fi = afe. 


From N(xa, + yor) = N(Xf(z, y) it follows that N(@.) = aN(2). Simi- 
larly N(¢:) = aN(%). Then N(w,) N(f,:) > 0. We have 
Law, = [aw, ws] w; = [a, p + $b — 9] N(w). 


Similarly, Z:a¢; = [a, p + 4b — ny] N(Si). Then Law, N(o1) = Sat; N(w), and 
f is equivalent to &. 
Conversely, suppose %¢ = &1:, and N(é)N(é:) > 0. Wehave LE = [wit, wet] = 
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[f1£:, f2&:] where the w’s and ¢’s form proper bases of & and %, respectively. 
Since we may assume that ¢ is a rational integer and N(é) > 0, 


wiE = E(9u + gi2p), 
wet = £(goi + go2p), 
and the first basis of ££is proper. Moreover, iff; = hia + Asp (¢ = 1, 2), by (1c) 
Siti = (hun + Aisp)ix, = Saki = (Aer + Aevp) ir, 


and it will be found that these elements form a proper basis of %¢. Hence 
wit = (tati + tite)i: (¢ = 1, 2), where the ?’s are in G and |t;;| = 1. Let 
f(x, y) be the form in C corresponding to the proper basis w, we of %, and let 
Si(ai, y:) be the form in C; corresponding to the proper basis {1, 2 of %&. Then 
N(Q)N(E)f(z, y) = N [ (tur + tery) Sb: + (trae + teey) Fob] = ND A(t, ys) N (Ei). 
But N(ON(E) = N(LE) = N(GE) = N(VDN(E) ¥ 0. Therefore f(x,y) = 
Silas, Y1) and C = Ci. 


4. The correspondence between classes of ideals and classes of forms. We 
shall prove 

THEOREM 1. There is a one-to-one correspondence between the classes of ideals 
in ® and those classes of hermitian forms (7) of discriminant 2a which represent 
positive integers. 

By Lemma 3, for every class of ideals there is a uniquely determined class of 
forms. Also, no class of forms corresponds to two classes of ideals. To prove 
the above theorem, it is therefore sufficient to show that (a) if C is a class of 
forms corresponding to a class of ideals, then C contains a form which represents 
a positive integer and is of discriminant 2a, (b) every class of forms of discrimi- 
nant 2a which represents a positive integer corresponds to a class of ideals in @. 

From Lemma 2 every class of ideals contains an ideal £ = [a, b + p], where 
a is a positive odd rational integer and 6b is in G. The indicated basis of 
£ is proper, N(%) = a. From (8) and (2) we get bu = —b, be = a, bn = 
[—e+ (b — b’) n — b(b’ + 1)]/a = —c, bee = b’+ 1. Substituting these values 
in (9) we obtain f(z, y) = ara’ + [(2b + 2n)/2] x’y + [(2b’ + 2n’)/2] zy’ + cyy’. 
Then f represents the positive integer a, the discriminant of f is 2a and the class 
containing f corresponds to the class containing %. This proves (a). Let C 
be a class of forms of discriminant 2a which represent a positive integer a; and 
let f of (7) beaformin C. Then for properly chosen z, y, which we may assume 
are relatively prime, f(21, y1) = a1. Then fis equivalent to a form with leading 
coefficient a;. Hence we may assumea> 0. Since bb’ = 4ac + 2a = 2 (mod 4), 
b, = (b — 2n)/2 isin G. Then it may be shown that there is an ideal & = 
[a, b} + p] which correspondstof. If ¥ = ax + y(bi + p) is the general element 
of 2, N(¥) = af(z, y). Since a > 0, the above basis of 2 is proper and C is the 
class corresponding to the class of ideals containing &. 
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5. A class of algebras in which every ideal is principal. A principal ideal 
{mn} is the set of all elements &y, where £ ranges over G and 7 is a fixed element 
in ©. 

We shall show that if a > 0, every ideal in G is principal. Since a contains 
no square factors, f of (7) is a primitive form. If in addition a > 0, f is an 
indefinite form, and it may be shown that a, c, b;, and be are all odd. We shall 
prove 

THEOREM 2. Every indefinite primitive form f = arx’ + }bx'y + 3b’ry’ + cyy’ 
represents 1. 

We may assume a > 0,c < 0,aprimetoc. Forsuppose 0 < a S ¢, (a,c) = d. 
Then the transformation zx = 2, + ky, y = y: carries f into f; = ax, + 
4Beriy, + 3B’ry; + Cyyi, where B = 2ak + b, and if we set k = ki + kei, b = 
bi + bet, then C = a(k? + k3) + biki + boke + ¢. By proper choice of ki, ke, 
the number bk; + bek2 + c may be made prime to a, while | 2ak,; + b:| S a, 
| 2ake + be| S a. Then C is prime toa. If we set B = B, 4+ Bei and note 
that | B;| = | 2ak; + b, |, then the discriminant of f is BB’ — 4ac = Bj + 
B? — 4aC = 2a > 0. But B? + B} < 2a? < 4a’, so that Bi + B} — 4a? < 0. 
Hence C < a. If C > 0 we may set zr = —y:, y = 2%, and then repeat this 
process. We eventually get a form f of type (7) with a > 0,c < 0 and a 
prime to c. 

The transformation x = 2 (nz + it), y = u + imt carries f into 


o(z, t, u) = H2 + 4af? + cu? + 2bynzu + Whymzt + hetu, 


a ternary form of determinant D = —2aH, where H = 4an? + cm? — 2bemn. 
H is an indefinite primitive binary quadratic form and for proper choice of m, n 
represents k, the negative of a prime, prime to 2a. Then ¢ is a primitive 
indefinite form. The adjoint ® of ¢ is a ternary form of determinant D*. 
Since D contains no square factors, ® is a primitive form. The coefficient of 
2 in ® is 4ac — bj, an odd negative number. Hence ¢ and ® are both properly 
primitive indefinite forms. Then # is the negative of the reciprocal of ¢, and 
the invariants are’ Q = —1,4 = D. Since D = 2 or 6 (mod 8), a binary quad- 
ratic of determinant D has h + 1 characters where h is the number of odd prime 
divisors of D. Hence f represents 1.’ 

Therefore every form (7) is equivalent to a form with leading coefficient 
unity. To every such form by the second part of Theorem 1 corresponds the 
principal ideal [1, b; + p] = {1}. Hence every class of ideals is equivalent to 
{1}. Since it may be shown that every ideal equivalent to a principal ideal is 
itself principal, we conclude that every ideal in G is principal. 


6. Existence of a g.c.r.d. and factorization of elements of G. Employ the 
definitions of unit and greatest common right divisor given by Latimer.® 
* Dickson, Studies in the Theory of Numbers, p. 10. 


7 Dickson, loc. cit., p. 63, Theorems 52, 54 (with m = 1). 
8 Latimer, ioc. cit. 
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We have proved the first sentence in 

THEOREM 3. Jf a > 0, every ideal in G is principal. Let Xd, u be elements in G, 
\ # 0; then dX, wu have a g.c.r.d. 5, which is uniquely determined apart from a unit 
left factor, and 6 = —\ + nu where =, n arein @. If d has no rational prime factor 
and N(A) = + pipe--- pr, where the p’s are rational primes arranged in an 
arbitrary but fixed order, then \ = myr2 --- 7,, where N(xi) = + pi and each x; 
is uniquely determined apart from a unit left factor. 

The proof of the remaining part of this theorem may be made word for word 
as in Latimer’s proof? of his Theorem 4 starting from the point “--- % is a prin- 
cipal ideal {A, uw} = {6} --- ”’, except that the phrase “‘non-singular” is to be 
replaced by “not equal to zero”’. 

Since every ideal is equivalent to {1} = [1, p] for the case where a > 0, it 
follows that every form f of (7) is equivalent to N(x + py) = F(z, y). Sinee F 
obviously represents 2, to show that every such form is universal it is sufficient 
to show that F represents —1 and every odd prime. It may be shown that F 
represents —1 by an argument similar to that used to show that it represents 1. 
It is well known that if p is an odd prime, there are rational integers a, b such 
that a? ++ Bb? —a=O(modp). Ifweset u = a+ bi + j, it may then be shown 
as in the proof of Theorem 3 that the set of all elements &p + nu, where &, 7 
range over G, form an ideal {5}, where N(6) = + p. Hence every such form 
f is universal. 

We have seen that there is a single class of left ideals in G, ifa > 0. Bya 
result due to Brandt” it follows that there is a single class of left ideals for every 
set G, of integral elements in AX. We may then deduce the same results on the 
existence of a g.c.d. and factorization for any set @, in % of integral elements as 
for the G treated here. 


UNIVERSITY OF KENTUCKY. 


® Latimer, loc. cit. 
10 Brandt, [dealtheorie in Quaternionenalgebren, Mathematische Annalen, vol. 99 (1928), 
p. 23. 














CONVERGENCE OF SEQUENCES OF POSITIVE LINEAR FUNCTIONAL 
OPERATIONS 


By R. P. BatLey 


Introduction. L. Fejér has recently called attention to the importance, 
in certain convergence problems of analysis, of a class of functionals to which 
he gives the name positive operations.' By his definition (loc. cit., p. 523), a 
functional U(x) defined over a set of functions {x(t)}, real-valued throughout a 
certain fundamental range a < t S&S b, is said to be positive, provided U(x) = 0 
whenever x(t) = 0 in (a, b). Sequences of operations of this kind often arise 
in the singular integral theory, in interpolation and in the theory of mechanical 
quadratures; in certain particular cases their convergence properties have been 
the object of much investigation. In his classical paper of 1909, Lebesgue gave 
the sequences of positive functionals which occur in the singular integral theory 
a special treatment, emphasizing again and again the simplicity of the reasoning 
involved, and their comparatively wide convergence properties.2, At various 
times, many other writers have pointed out simplifications in a general theory 
which result from the hypothesis that certain sequences of functionals involved 
have the positive property. 

It is our purpose, in the first part of this paper, to apply to the special case 
of the positive operations the very general theory which Hahn,’ Banach‘ and 
others have developed for convergence problems involving sequences of linear 
functionals, with the object in view of establishing a set of theorems from which 
the particular theorems of Fejér, Lebesgue and others in the singular integral 
theory, mechanical quadratures and interpolation will be a matter of direct 
inference. The main discussion is divided into three parts. We take up first 
the question of the convergence of a sequence of positive linear functionals 
{U,(x)} (n = 1, 2, --- ) to the value of the function z(t)(n — ©) at a certain 
fixed point ¢ = + of the fundamental interval. Sequences of this kind are 
familiar in interpolation and in the singular integral theory. In the second 


Received September 26, 1935. The author wishes to express his gratitude to Professor 
J. A. Shohat for many valuable suggestions which have aided in the preparation of this 
paper. 

1L. Fejér, On the infinite sequences arising in the theories of harmonic analysis, interpola- 
tion and mechanical quadratures, Bulletin of the American Mathematical Society, vol. 39 
(1933), pp. 521-534. 

2H. Lebesgue, Sur les intégrales singulitres, Annales de Toulouse, (3), vol. 1(1909), 
pp. 25-117. 

*H. Hahn, Uber Folgen linearer Operationen, Monatshefte fiir Mathematik und Physik, 
vol. 32 (1922), pp. 3-88. 

4S. Banach, Théorie des Opérations Linéaires, Monografje Matematyczne, Warsaw, 
1932, pp. 122-130. 
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part we consider the convergence of the above sequence to a limit functional 
having the absolute continuity property of the Riemann integral and make 
applications to the problem of mechanical quadratures. In the third part we 
discuss an extension of the property of positiveness to more general operations, 
together with certain results on uniform convergence which can be obtained in 
this way. 

The last section is devoted to a proof of the fact that one of the most im- 
portant, although very special, types of positive operation, namely, the formulas 
of mechanical quadratures of Gauss’ type, can have equal Coétes coefficients 
only in the well-known case of the trigonometric polynomials. 


1. Definition and properties of positive linear functional operations. Se- 
quences of positive functionals. Consider the function-space G whose elements 
{x(t)} are the set of real-valued bounded functions of a real variable ¢ defined 
over a closed finite interval’ a < ¢ < b. Two points of @ are considered 
distinct if the corresponding functions differ at least at one point of (a, b). 
With the usual definitions of sum and scalar product, this set constitutes a 
linear or vectorial space. The zero element z = 9 is the point corresponding 
to the identically vanishing function z(t) = 0. The point corresponding to the 
function z(t) = 1 we denote by x = J. The distance (x, y) between two ar- 
bitrary points x and y of G we define to be L.U.B. | z(t) — y(t) |. With this 


astsb 
distance-function, G becomes a metric space. It may be considered normal if 
we define the norm || z || of a point z of G to be the non-negative number (z, 0). 
If x and y belong to G and z(t) = y(t), we write x = y. 


A sequence of points {z,}(m = 1, 2, --- ) of Gis said to converge to the point 
x of G, provided lim || zx, — xo|| = 0. This clearly requires the uniform con- 
no 


vergence of the sequence of functions {z,(t)} to xo(t). 
A functional operation U(x) defined over a vectorial subset EF of G associates 
with each z of Z a real number U(x). This operation is said to be linear, if 


(i) U(ai + ae) = U(r) + Uae) (1, 22 C E), 
(ii) lim U(2,) = U(a), whenever ||2z, — 2|| ~0 ({xn},x20 C EB); 


that is, a linear operation is additive and continuous. Every linear operation is 
homogeneous, i.e., for an arbitrary real constant c and every xz of E, U(cx) = 
cU(zx). Every linear operation is necessarily bownded, i.e., there exists a positive 
number |U |, = L.U.B. | U(2) |, called the norm of the operation U over E, 


zCeé,||z|| <1 
such that | U(x) | < | U |e||2|| for every x of Z. As was stated above, U 
is said to be positive over E, if U(x) =O wheneverz 29. It follows at once, in 
virtue of (i), that here x; = x2 implies U(x1) = U(a:). 


5 This interval will be understood to be fixed throughout the paper unless the contrary is 
explicitly stated. In terminology, definitions and notations we follow S. Banach (loc. cit.). 
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The norm of a positive linear functional frequently has a very simple ex- 
pression, as we show immediately. 

Lemma 1. [If the linear functional operation U(x) is positive over a vectorial 
subset E of G which contains the point x = I, then| U |x = U(J). 

Evidently if || z|| < 1, —IJ < x Ss I, and therefore, since U(z) is positive 


over E, —U(I) Ss U(x) Ss U(J) and | U(x) | S U(I); whence 
[\U\e = L.U.B. | U(z)| = UW). 


zCe,|z\| <1 
We shall need this result later on. 

We now turn to the convergence of sequences of functionals. In general, 
for the convergence of a sequence {U,(x)}(n = 1, 2, --- ) of linear functionals 
over a space E, it is sufficient to know (i) that it converges over some subspace 
H of E which is dense in £, and (ii) that the set of norms {| U, |g} is bounded.* 
However, in the case of positive operations, these two hypotheses are sometimes 
redundant. Under certain conditions, (i) implies (ii), and hence (i) alone is 
sufficient. This point is brought out by 

TuHeoreM 1. Let E be a vectorial subset of G which contains an element of 
positive lower bound in (a, b). If a sequence {U,(x)}(n = 1, 2, --- ) of positive 
linear functionals defined over E converges over a subset H dense in E to a linear’ 
functional U(x) defined over E, then lim U,(x) = U(x) for every x of E. 


It suffices to show that the set of norms { | U, |z}(n = 1, 2, --- ) is bounded. 
Since E contains an element of positive lower bound in (a, b), and H is dense in 
E, then H necessarily contains a positive element bounded away from zero, say 
&(t) [p> 0. The function n(¢) = p~'é(t) now has the property that n(¢) = 1, 
whence —y S zx S 7 for all z of E such that ||z|| < 1. It follows, since the 
operations involved are positive, that —U,(n) S U,(z) S Un(n) and| U,(z)| S 
U,(n) (\|z|| S 1; = 1, 2,---), whence | U,|s S Un(n) (n = 1, 2, --- ). 
This shows that the set of norms {| U, |z} is bounded, since the sequence con- 
verges for z = £, and therefore for z = 7. 

That some restriction of the kind we have made on the subset E is necessary 
for the validity of the theorem can be seen from the following example. The 
functional 





represents the arithmetic mean of the first n partial sums, at the point ¢ = 0, 
of the Fourier series associated with the integrable function x(t), of period 27. 
In particular, 


* Banach, loc. cit., p. 123. 
7 Under very general conditions, the limit functional of a convergent sequence of linear 
functionals is necessarily linear itself. Cf. Banach, loc. cit., p. 122. 
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nt 
1 Qn aad | 
(1) ll) = 5 [ dt = 1 (n = 1,2, ---). 
2nxr 0 : t 
=. 


Suppose now that our fundamental interval (a, b) is (0, 27). Let E be the 
set of continuous periodic functions {x(t)} such that z(0) = 0, and let H denote 
k 


that subset of E which consists of the trigonometric sums > a; cos it + 0, sin it 
t=0 

(k = 0, 1, 2, --- ) which are zero at t = 0. H is dense in E by the Weierstrass 

theorem. Nevertheless, it can be shown that the sequence of positive linear 


functionals 
(2) U,(z) = nto,(z) ae 1, ---) 


converges to the linear functional U(x) = x(0) = 0 over H without converging at 
all points of E. In fact, if x belongs to H, since the ordinary partial sums 
S,(x) of the first 2n + 1 terms of the Fourier series associated with a trigono- 
metric sum of order k are identical with the trigonometric sum itself, for n = k, 
we readily conclude that in this case*® 
lim n'-*o,(r) = 0 
for any e > 0, and in particular for e = 3. This shows that the sequence (2) 
converges over H. However, by (1), 





and therefore the sequence of norms {| U,, |z} is unbounded. Since E is complete 
in this case, we may conclude that there exists a point z of EH — H at which 
the sequence cannot converge to the value z(0) = 0, for it is well known that a 
sequence of linear functionals cannot converge over a complete vectorial space 
unless the norms are bounded in their set. Theorem 1 does not apply here, 
since £ does not contain an element bounded away from zero in (0, 27). 

In what follows we shall find it convenient to refer to certain particular sub- 
sets of G as follows: P = the set of all polynomials of degrees 0, 1, 2, --- , with 

k 
real coefficients; T = the set of all trigonometric sums >> a; cos it + 0; sin it 
1=0 

(k = 0,1, 2,---) with real coefficients; C = the set of all continuous func- 
tions; R = the set of all bounded R-integrable functions; L = the set of all 
bounded L-integrable functions; S = the set of step-functions with a finite 

8 n'-*o, = n-*[So + Si Se a S,-1] = n-* [So + Si + eee + Sia] = o(1), since 
Sk = Sey = +++ = San = 0. 

® Banach, loc. cit., p. 80, Th. 5. 
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number (2 0) of steps; K = the set of all functions of bounded variation. It 
should be remarked that the same metric (that of G) is used throughout. 

If E is an arbitrary subset of G, the symbols #, and E, will be used to denote 
respectively the set of functions {x(t)} of E which are continuous at a certain 
fixed point t = + of (a, b), and those functions of EF which are periodic, of period 
2r. Thesymbol E,, will denote the set of functions of Z having both properties. 

With these notations, the subsets H and E of Theorem 1 may be taken to be 
P and C respectively, for C is evidently a vectorial subset of G containing posi- 
tive elements bounded away from zero, in which the polynomials are dense by 
the Weierstrass theorem. Similarly, we might take H = T and E = C,, or 
H = Sand E = K. 


2. The limit-functional U(x) = x(r). When the convergence of a sequence 
of positive linear functionals {U,(x)} over a subset H dense in a vectorial sub- 
set E of G is known, Theorem 1 will, in general, enable us to draw conclusions 
about the convergence of the sequence at other points of E. However, in the 
particular case where the limit-functional is U(x) = z(r), r being some fixed 
point of (a, b), conclusions can be drawn about the convergence of the sequence 
over certain subsets of G from much weaker hypotheses. Theorem 2, below, 
illustrates this point. We first establish 

Lemma 2. Let u, | denote, respectively, the upper and lower bounds of the 
bounded function x(t) at the point t = r. Then there exist continuous functions 
f(t) and &2(t) such that (i) & S xz S &:, (ii) H(r) = I, (7) = u. 

Let us construct, for example, the function f. We can suppose that u < 
\| x ||; if wu = || 2 ||, evidently &(t) = wis the function required. Let 5 > 0 be 
so chosen that L.U.B. z(t) < || ||, and denote by u, (k = 0,1, 2, --- ) the 


r—8StSr+s 
least upper bound of z(t) in the interval (r — 6/2*, r + 6/2*). Clearly uw =u 2 
U, 2--- >u, =--- ; limu, = u. Denote by Pi, Q; respectively the points 


ko 
whose coérdinates are (r — 6/2**', ux), (r + 6/2**', ux) and by P the point 
whose coérdinates are (7, u). Evidently lim P, = lim Q; = P; hence the 


k—- ke 
polygonal line joining the points (a, || z ||), (r — 4, || 2||), Po, Pi, Pe, --- in 
succession from the left, and the points (b, || z ||), (r + 4, || x |]), Qo, Qi, Qe, --- 
in succession from the right, is continuous throughout (a, 6). It defines a 
continuous function £2(¢) which evidently meets the requirements of Lemma 2. 
The function £,(¢) can be constructed in a similar manner. It should be noted 
that if x(t) is continuous at t = 1, then &(7) = &(r) = 2(r). 

THEeorREM 2. Let E be a vectorial subset of G containing C(C,). If a sequence 
{U,(x)}(n = 1, 2,---) of positive linear functionals defined over E converges 
over a subset H dense in C(C,) to the value x(r) [r a fixed point of (a, b)], then 
lim U,(x) = 2(r) for every x of E,(E,,). 


no 


Consider the first case; let x be any point of Z. Since z(t) is continuous at 
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t = 7, we may conclude (by Lemma 2) that there exist functions £,(t), &(t), 
belonging to C, such that 


(3) &S2zs é, 

(4) f(r) = 2(r) = &2(r). 

Since the operations {U,(zx)} are positive over E, (3) implies that 

(5) U,(&) S U,n(z) S UnlE2) (n = 1,2,---). 


By Theorem 1 both lim U,(é) = &(r) and lim U,(&) = £&(7) exist, and hence 


by (5) &(r) < lim U,(x) S £&(r). This proves our statement, in virtue of (4). 


When C and E, are replaced by C, and E,,, respectively, £ and & must be so 
chosen as to be periodic, of period 27, as well as continuous. Though Lemma 2 
does not affirm the existence of such functions, it is clear that methods analogous 
to those used in its proof will furnish the required construction, if we use the 
fact that x(t) (belonging to E,,) is now periodic, of period 27, as well as con- 
tinuous at t = 

The question naturally arises whether a sequence of positive linear func- 
tionals {U,(x)} satisfying the hypotheses of Theorem 2 will not further con- 
verge to the value 3[z(r + 0) + x(7r — 0)] for those functions {2(t)} of EZ for 
which this functional is defined. This cannot be the case, in general, as one 
sees easily by considering the special sequence 


(6) U,(z) = x(r + 0) (n = 1,2,---). 


The sequence (6) is defined, for instance, over K, and converges to the value 
z(r) over K, by identity, but cannot converge to the value }[z(r + 0) + 2(7 — 0)] 
at any point of K where 2(r + 0) ¥ 2(7r — 0). 

The following theorem gives a sufficient condition for convergence of the 
type described above. 

THEOREM ws 4 E be a vectorial subset of G containing K. If a sequence 
{U,(z)}(n = - ) of positive linear functionals defined over E converges 
over S to the we ei, + 0) + 2(7 — 0)], then lim im U,(z) = 3[x(r + 0) + x(7r — 0)] 


for every x of E which has a discontinuity of the first kind att = 

The proof follows the same lines as that of Theorem 2. Since S is dense in K, 
Theorem 1 assures the convergence of the sequence to the required value over K. 
We now have only to construct, by the methods of Lemma 2, two functions 
fi, & of K such that & < x < & and &(r + 0) = &(7 + 0) = 2(7 + 0), 
£i(7 — 0) = f(r — 0) = 2(r — 0). The convergence of the sequence for the 
element x (of which we assume only that it has at most a discontinuity of the 
first kind at t = 7) follows as in Theorem 2. 

We may mention in passing an obvious extension of the theorems of this 
section which can be proved by means of Lemma 2. 

TuHeoreM 4. Let uz, 1, denote respectively the upper and lower bounds of x(t) 
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at the pointt = r. If a sequence {U,(x)}(n = 1, 2, --- ) of positive linear func- 
tionals defined over a vectorial subset E of G which contains | C converges over a subset 
H dense in C to the value x(r), then 1, < lim U,(x) < lim U,(x) S uz at every 


point of E. 
In Theorem 2 we may set E = L, H = S, (the set of step-functions with a 
finite number of steps which are continuous at t = 7), and 


U,(z) = [ r(t)y(t oe By n) dt, 


where, for all n = 1, 2, --- , g(a, m) is a bounded, non-negative, L-integrable 
function of a. These substitutions are admissible, and our conclusion in 
this special case is that the convergence of the given sequence (a so-called 
“singular” integral) to the value z(r) over S, implies convergence to the value 
z(r) for every bounded L-integrable function z(t) continuous at t = r. This 
result was obtained for the first time by Lebesgue.” 

The choice of the set H of Theorem 2 depends, in general, upon the nature 
of the functionals {U,(z)}. For instance, in discussing the convergence proper- 
ties of Fejér’s integral, we might take H = T rather than H = S, (as in the 
theorem of Lebesgue), for the convergence of the Fejér sequence over T follows 
directly from its definition, while the question of its convergence over S, requires 
further investigation. 

In a similar manner, Theorem 3 and Theorem 4 can be interpreted so as to 
give certain results of Lebesgue and others on the behavior of singular integrals 
for functions discontinuous" at ¢ = r. 

The following is an application of Theorem 2 in a situation where the singular 
integral theory cannot be used directly, i.e., interpolation." Let 


ti 


tie tee 
his tes ts3 
lin lon tan shales ban 


10 Lebesgue, loc. cit., p. 75. The hypothesis of Lebesgue requires the convergence of 
the given sequence of integrals over what is known as the base of Hamel of the set S; (i.e., a 
linearly independent subset of S; upon which every other element of S, is dependent) and 
therefore is equivalent to an assumption of convergence over S, itself. In the same way, 
instead of assuming that a certain sequence of linear functionals converges for all poly- 
nomials, we may assume convergence only over the set 1, ¢, t?, --- ; the two assumptions 
are equivalent, since each is an immediate inference of the other. 

11 Lebesgue, loc. cit., p. 78. See also Hobson, Theory of the Functions of a Real Variable, 
vol. 2(1926), p. 456, where a special case of Theorem 4 is discussed. 

12 H. Hahn, Uber Interpolation, Mathematische Zeitschrift, vol. 1 (1918), pp. 115-43, first 
suggested an attack on the general problem of interpolation from the viewpoint of the 
theory of linear operations; certain of the results of his paper Uber Folgen linearer Opera- 
tionen (loc. cit.) can be applied to the problem of interpolation to continuous functions. 
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be a triangular sequence of real numbers such that a S tin < ton < --+ < tna S 
b(n = 1, 2, --- ), and let {hi(t)} (@ = 1, 2,---,n;n = 1,2, --- ) be a set of 
continuous functions defined on (a, 6), having the property hi, (tin) = dix, where 


5is the Kronecker delta. The expression 


(7) H(t) = D0 x(tin) hin(t) (n = 1,2, ---) 
t=1 

can then be considered a formula of interpolation to the function z(é) in the 
interval (a, b). Let us suppose further that the fundamental functions {h;,(t)} 
are positive’ throughout (a, b). Theorem 2 enables us to conclude at once 
that the convergence of the sequence of positive linear functionals U,(r) = 
H,(r) (n = 1, 2, --- ) to the value z(r) over P or T will imply the convergence 
of the given sequence over G, or G,,. Hence the formula (7) converges, in this 
case, to the value of the function z(t) at every point of continuity. Once more 
we have an illustration of the importance of the very general theorems of this 
section: the convergence properties of the formula (7) follow directly from 
Theorem 2 without any further considerations. 


3. Absolutely continuous limit-functionals. We now pass to considerations 
which have important applications in the theory of mechanical quadratures. 
In this section attention will be confined to the set R of bounded functions 
integrable in the sense of Riemann. We introduce first some notations which 
materially simplify the discussion. Let J denote a finite set of disjoint intervals 
contained in (a, b), and J’ its complement. With each z of R associate an 
element 2x, of R as follows: x,(t) = x(t) or 0 according as t belongs to J or J’. 
With each operation U(x) defined over R associate an operation U’(xz) as 
follows: U7(x) = U(az,). Clearly, if U(x) is a linear functional over R, U%(z) is 
also; if U(x) is positive over R, U’(x) has the same property. Evidently U(r) = 
U?(x) + U*'(x) for every z of R. 

The following is a generalization of theorems due to L. Fejér“ and G. Pélya. 

TuHeoreM 5. Let a(t) be absolutely continuous on (a,b). If a sequence {U,(zx)} 
(n = 1, 2, --- ) of positive linear functionals defined over R converges over P to the 

b 
functional U(x) = / x(t) da(t), then lim U,(x) = U(z) for every x of R. 


a 


13 Interpolation formulas of this type have been studied by L. Fejér, Uber Interpolation, 
Géttinger Nachrichten, vol. (1916), pp. 66-91, and D. Jackson, The Theory of Approxima- 
tion, Colloquium Publications of American Mathematical Society, vol. 11 (1930), pp. 142- 
148. See also in this connection J. Shohat, On interpolation, Annals of Mathematics, 
vol. 34 (1933), pp. 130-146, and G. Szegé, Interpolationspolynome, Mathematische Zeit- 
schrift, vol. 35 (1932), pp. 579-602. 

41. Fejér, Mechanische Quadraturen mit positiven Céiesschen Zahlen, Mathematische 
Zeitschrift, vol. 37 (1933), pp. 287-309. 

5G. Pélya, Uber die Konvergenz von Quadraturverfahren, Mathematische Zeitschrift, 
vol. 37 (1933), pp. 264-286. 

16 The integral is to be taken in the Riemann-Stieltjes sense. It is defined over R, 
since a(t) is absolutely continuous. 
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Under our hypothesis, and by Theorem 1, lim U,(x) = U(z) for every con- 


tinuous function z(t). Hence U(z) is necessarily positive over C, being the 
limit, when z = 0, of a sequence of non-negative numbers. We readily conclude 
that a(t) is monotone non-decreasing,’” and from this it follows that U(x) is 
positive over R. 

Suppose now J is any finite set of disjoint closed intervals. We will show that 
lim UZ(I) = U%(1). Evidently, it will be sufficient to consider the case where J 


consists of a single closed interval (A, u), wherea <4 < yx <b. The proof can 
then be extended to the most general case by the use of the homogeneous- 
additive property of the linear functional. Let z(¢) =1 or 0 according as ¢ 
belongs to J or J’.. Then U,(z) = UZ(I) (n = 1,2, ---). We will show that 
lim U,(zx) exists and has the value U(x) = U’(I). Let 6 > 0 be taken so small 


that a fA —6,A1+6<yw—6 andyw+s6 <b. Define auxiliary func- 
tions y(t) and z(t), belonging to C, as follows: 


0 astsZiA-6 0 astzav 

— A Stu _ A+6sSt5yp-—6 
M-%  sectics “te. a6tKd 
linear elsewhere, linear elsewhere. 


Now, y 2 z =z. Therefore, since the operations {U,(x)} and the operation 
U(z) are positive over R, 


(8) U,(y) 2 U,(x) 2 U,(z) (n = 1,2,---), 
(9) U(y) 2 U(x) 2 Ue). 

From (8), U(y) = lim U,(z) = U(z), for y and z are elements of C, whence 
lim U,(y) = U(y) and lim U,(z) = U(z). Finally, since 


no 


| U(y) — U@)| = 





[ [y(t) — 2(t)] da(t) | S 2V;, 


where V; represents the total variation of a(t) over the intervals (A — 6, A + 6) 
and (u — 4, » + 4), and V; is known to approach zero with 6, we conclude that 
lim U,(z) exists, and by (9) lim U,(x) = U(x). This proves our statement. 


With this, we are in a position to establish the main theorem. Let x now be 
any element of R. Being given e > 0, by the criterion for Riemann integrability, 
it is possible to fit a net D on the interval (a, b) such that the sum of the lengths 
of those intervals of D is < «, in which the fluctuation of z(t) is 2 «. Let &(d) 
be that continuous function which coincides with z(t) at the end points of the 
intervals of D and is linear in the interior of each of these intervals. If we 


17 Tf we assume the contrary, there is no difficulty in constructing a non-negative con- 


b 
tinuous function z(t) such that I x(t) da(t) < 0. 
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denote by J the interval-set, throughout which the fluctuation of x(t) is 2 e«, 
evidently | x(t) — &(t) | < « for all values of ¢ in the intervals J’ complementary 
toJ. Wemay take J to be closed. By our construction m(/J) < «. Now 


| U(z) — U,(z) | =| Un(E — x) + Ue — £) + UE) — UE) | 


Using the fact that | UZ |, = UZ(I) (by Lemma 1), and since |x — | S$ 2 !/z/), 
we conclude in succession, 


| UE — x)| S| UE — 2) | + 1 Ual(E — 2) | 
S2\r|Urle+elU,le = 2 2{! UU) + UD), 
|U@—- | s|U@-)|+|Ule—- rl | 
S2||r\||U" le+e|U le = 2|\ 2) UU) + UU); 
and since £ belonging to C implies lim | U(£) — U,(&) | = 0, evidentiy 


no 


lA 


(10) lim | U(x) — U,(x)| S$ 4\\x|| UU) + 2), 


for we have shown that lim UZ(J) = U“(I). Since U%(I) = [eat represents 


n—20 J 
simply the variation of the absolutely continuous function a(t) over the set J, 
and m(J) < «, it is evident that the quantity on the right hand side of (10) can 
be made arbitrarily small by choosing « sufficiently small, and Theorem 5 is 
established. 

Examination of the proof of Theorem 5 will show that the only property of the 
limit-functional U(x) really needed in the demonstration (in addition to linear- 
ity) is the absolute continuity property which it has in common with the ordinary 
Riemann integral. This property may be characterized as follows: a linear 
functional U(x), defined over R, will be said to be absolutely continuous, provided 
that for every x of R, « > 0 arbitrarily given implies the existence of a positive 
number 6 = 4(e, x) such that if J is any finite set of disjoint intervals contained 
in (a, b), and m(J) < 6, then® | U7(r) | < «. The question arises whether we 

b 


cannot obtain greater generality by replacing x(t) da(t) with a more general 


absolutely continuous limit-functional. The fact is, every linear functional 


U(x) defined over R which possesses the absolute continuity property above described 
b 


is necessarily of the form [ x(t) da(t), where a(t) is an absolutely continuous 


function. This statement may be proved as follows. With the methods 
which Banach (loc. cit., p. 59) has used to establish the well-known theorem of 


18 In effect, we have called U(x) absolutely continuous if, for fixed z, U7(z) is an abso- 
lutely continuous interval-set function (J) in the sense of de La Vallée-Poussin, Inté- 
grales de Lebesgue, Borel Collection, 1916, p. 57. 
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F. Riesz on the general form of linear functionals defined over C-space, there 


is no difficulty in showing that every absolutely continuous linear functional 
b 


U(x) defined over R is of the form | r(t)da(t) (a(t) absolutely continuous) 


over C-space. We have to show that the given expression is the only extension 
of this functional to R-space which has the required property. For this we need 

Lemma 3. If a linear functional U(x) has the absolute continuity property 
over R, | U! |p > 0 uniformly with m(J) (i.e., U(x) has this property uniformly 
over every bounded subset of R). 

We wish to show that « > 0 being given arbitrarily there exists a 6 = 4(e), 
such that if J is any finite set of disjoint intervals, and m(J/) < 4, then | U"(x)| < «€ 
for every x of R with ||z || S 1. Assume the contrary. Then there exists a 
p > 0, such that no matter how small 6 > 0 be taken, there is an z = 2; (of 
norm $1) and aset J = J;, with m(J) < 6, such that U’(x) = p. That is, 


there exists a sequence of elements {z,} (n = 1,2, --- ) of Rof norm S1, anda 
sequence of sets J, (n = 1, 2, --- ) with the property lim m(J,,) = 0, such that 
(11) U’"(x,) = p (n = 1, 2,---). 


Let N be a given positive integer, and let 6, > 0 (k = 1, 2, --- ) be so chosen 
that | U’(z,) | < 1/2* for every J such that m(J) < 6;. This is possible on 
account of the absolute continuity property of U(x). Since m(J,) approaches 


zero with 1/n, it is possible to select a set of indices n = mj, no, --- , mw such 
that the part which J,, (k = 1, 2, --- , N) has in common with the sets which 
follow (namely, Jn,,,, Jn. *-*» ny) is Of measure < 6,,; let us call the 


sequence of sets obtained by removing this part Li, Le, ---,Ly. It follows 
that no two of the sets {Z,} can have a point in common; each L; is a subset of 
J,,, and differs in measure from that of J,, by less than 6,,. Suppose now 7 is 
that element of R which is zero throughout the complement of (L; + Le + --- 
+ Ly), and in the set L;, (k = 1,2, --- , N) coincides with z,,. We have: 


N N N 
U(n) = >> U**(z,,) = SS U’™(2,,) — SS U2 **(z,,), 
k=1 k=1 k=1 


= Np - a oe 
k=1 2* 
in virtue of (11) and the fact that | U’(z,,) | < 1/2"* for every J such that 
m(J) < 6,,(in particular, J = J,, — Lx); since || » || S 1, and N may be chosen 
arbitrarily large, this contradicts the boundedness of the linear functional U(z), 
and the lemma is proved. 


18 F. Riesz, Annales de |’Ecole Normale Supérieure, (3), vol. 31 (1914), pp. 9-14. This 
theorem has been extended to the class of bounded functions with at most discontinuities of 
the first kind by H. S. Kaltenborn, Bulletin of the American Mathematical Society, vol. 
40(1934), pp. 702-708, and by T. H. Hildebrandt, Transactions of the American Mathe- 
matical Society, vol. 36(1934), pp. 868-875, to the class of all bounded measurable functions. 
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With this lemma, our original statement can easily be proved. Let x be any 
element of R. As we have seen in the proof of Theorem 5, there exists an 
element £ of C, and a finite set J of disjoint intervals with m(J) < «¢, such that 
| a(t) — &(t)| < e(tinc(J) = J’), || 2 —€]| S 2)/2|!. Then, since U(¢) = 

b 


g (t)da(t) , 





U(z) - [ x(t) da(t)| = |U(e — ®) - i fx) — el] dato | 





<|U@-8|+ i “fa — 01 date | 


< 2\|2| Url tel le + 2ileil f date + Ve, 


where U(x — §) = U*(x — £&) + U*'(x — £), ete., and where V, denotes the 
total variation of a(t) over (a,b). Since, by Lemma 3, | U’ |, > 0 uniformly with 
m(J), as e— 0, and since a(t) is absolutely continuous, the right member of this 
inequality can be made arbitrarily small by choosing «€ sufficiently small. 
Hence the left member must be zero, and our statement is proved. 

As an application of Theorem 5, consider the formulas of mechanical quadra- 
tures of Gauss’ type 


[ " x()p(t) dt = 3° Hintlts) + Balt) (n= 1,2, ---), 


where p(t) is non-negative and summable in (a, b), determined by the conditions 
(12) R(t) =0 (k = 0,1,2,---,2n — 1; = 1, 2, ---). 


It can be shown that 
Te) | - 
Hi, = J be = as! p(t) dt (¢=1,2,---,n;n =1,2,.--) 
| eo = (t — tin)(t = ton) “re (t = tan), [ ©, &,, p(t) dt = 0 (m = »|, 


whence” H;, > 0 for all i and n. Putting U,(xz) = DS Hinr(tin), U(x) = 
i=1 
t 


p(t)dt, and applying Theorem 5, we conclude 


/ : z(t)da(t), where a(t) 


that the formulas of Gauss’ type converge for all bounded R-integrable functions 
{a(t)}, since convergence over P is assured by (12). The operations {U,(x)} are 
positive, since H;, > 0 (¢ = 1,2,---,n;n = 1,2,---). 


20 J. Shohat, Théorie générale des polynomes orthogonaux de Tchebichef, Mémorial des 
Sciences Mathématiques, LX VI (1934), p. 15. 
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In the paper referred to at the beginning of this section, Fejér showed that 
any mechanical quadratures formulas of the type 


+1 n 

(13) [ x(t) dt = pe H;,,2(tin) —_ R,(z) (n = 1, 2, +. -) 
—j i=1 

based on the Lagrange interpolation formula (i.e., R,(t*) = 0, k = 0,1,2,---, 

n — 1) must converge for all R-integrable functions z(t) if H;, 2 0. Pélya™ 

has also given theorems from which the same conclusion can be derived. This 

result follows at once from Theorem 5. 


4. Non-functional positive operations. Uniform convergence. We have 
shown, in the preceding pages, how the characterization of a certain type 
of linear functional as positive proves useful in discussing the convergence 
properties of sequences of such functionals; we shall now try to indicate how this 
characterization can be extended to include certain types of linear operations 
which are not necessarily functional. Consider, for example, an operation 
U(zx) defined over a vectorial subset E of G which associates with every z of E an 
element y = U(x) of some subset EZ, of G. Eis called the range or contra-domain 
of the operation U(x). We make the following definition: U(x) is positive over 
E, provided x = 8 implies y = 9. It is clear that we have here a direct extension 
of the definition of positiveness given previously for functionals. We have 
simply replaced the contra-domain r of real numbers by another normal vectorial 
space in which the symbol 2 is defined. ; 

The statements made in §1 concerning the definition and properties of linear 
functionals defined over G apply without exception to linear operations with 
domain £ and contra-domain #;. In general, we use the same notations as in 
the previous case, though it should be noted that here the norm of U(x) (as an 
element of G) must be written |! U(z) ||. An analogue to Lemma 1 follows 
at once. 

Lemma 4. If the linear operation U(x), defined over a vectorial subset E of G 
which contains x = I, and with contra-domain lying in G, is positive over E, then 
|U |x = || UW) |]. 

Exactly the same argument as that given in the proof of Lemma 1 will suffice 
to establish Lemma 4, if we note that in G, as well as with real numbers, 
—x Sy S zimplies || y || < || x}. 

Similarly, Theorem 1 has its analogue, which we can state as follows: 

TueoreM 6. Let E be a vectorial subset of G containing an element of positive 
lower bound in (a, b). If a sequence {U,(x)} (n = 1, 2, --- ) of positive linear 
operations defined over E, and with contra-domain lying in G, converges over a 
subset H dense in E to a linear operation U(x) defined over E, with contra-domain 
in G, then™ lim U,(x) = U(x) for every x of E. 


21 Where previously we dealt with sequences of numbers, here it is a question of the 
convergence of a sequence of functions {U,(x)}. This convergence is uniform, in view of 
the metric of G. 
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Again, the proof follows exactly the same lines as that of Theorem 1. We see 
without difficulty that the sequence converges for an element n(t) of F such that 
—n <2 S nforall z of LE with ||z\|| <1. It follows that | U,\e S || U.(n) |! 
(n = 1,2,---). This proves the theorem, for it shows that the set of norms 
{| Un |e} is bounded. From this follows the convergence of the sequence to 
U(z) at all points” of EZ. 

As an application of Theorem 6, consider the integral of Fejér. Let 





(14) U,(z) = y(t) = a fe x(s) {an ini - Vas (an 2, & «--). 


sin 3(s — t) 


The sequence (14) is a sequence of positive operations defined over C,, [(a, b) = 
(0, 2r)], with contra-domain lying in C,, which converges to the operation 
U(x) = x(t) over T; by Theorem 6 the sequence of functions {y,(t)} converges 
uniformly over (0, 27) to the value z(t) for every function x(t) continuous in 
(0, 27) and of period 27. 

We can prove further that the convergence of the Fejér integral to the value 
x(t) is uniform over any subinterval of (0, 27) in which 2(é) is continuous. Re- 
sults of this kind can evidently be obtained for any singular integral or inter- 
polation formula of positive character. 

It is clear that the idea of positiveness in connection with linear operations 
can be extended to include operations defined over the most general abstract 
space of the normal vectorial type, with elements of any nature whatsoever, 
provided only some convenient meaning be assigned to the symbol = in domain 
and contra-domain. Whether such a classification of linear operations will be 
useful will depend to a great extent upon the type of space. The author hopes, 
at some future date, to extend the idea to the Hilbert spaces, with applications 
to the theory of integral equations in view. 


5. Coincidence of the formulas of mechanical quadratures of Gauss’ type 
and of Tchebichef’s type. In §3 we called attention to the convergence prop- 
erties of certain mechanical quadratures formulas of Gauss’ type. In this 
section we consider a special problem connected with such formulas. Let 
y(t) be a function bounded and non-decreasing, with infinitely many points 
of increase, over an interval (a, 6) finite or infinite, and such that all moments 

h 
a, = imdy(t) (n = 0, 1, 2, ---) exist, with a > 0. It is known that there 


exists an infinite sequence of orthogonal Tchebichef polynomials 


®,(t) = i (t — tin) = 0 + Pinl®* + +++ + Dany 


22 Banach, loc. cit., p. 79. 
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with all roots real, distinct, and lying in (a, b), determined by the conditions” 
b 
i] t'd,()dy(t) = 0 (k = 0,1,2,---,n—1;n=1,2,---). In order to avoid 


a 
at 


trivial equivalences, we assume ¥(a) = 0 and im dy(t) [. dy(t) > 0 for 


h > 0. Under these conditions ~(¢) will be called a davianedatie function in 
(a, b). Two characteristic functions will be considered distinct only if they 
differ at a point of continuity. 


6 
Let x(t) be such that [ x(t)dyp(t) exists in the Riemann-Stieltjes sense. Using 


the Lagrange interpolation polynomial which coincides with z(t) at the points 
{tin} (¢ = 1, 2,---,m), we construct the so-called mechanical quadratures 
formula of Gauss’ type 


b 
(15) [ x(t) dy(t) = > Hin 2(tin) + Ra(x), 


®,(t) dy(t) 
Hin ~ [ (t = tin) ), (tin) 


having the property R,(t*) = 0 (k = 0,1, --- ,2m — 1). If (a, b) is finite, it is 
‘ dt 

well known that for y(é) =| sVGoea® 
trigonometric polynomials cos n arc cos [(a + b — 2t)/(a — b)], the Cétes numbers 
H;,, are equal for each n: H;, = H, (i = 1,2,---,n;n = 1, 2,--- ), and the 
Gauss formula (15) is therefore, in this special case, at the same time of Tchebi- 
chef’s type.** The question naturally arises whether the two types of formulas 
coincide in any other case. We propose to show that the two formulas cannot 
coincide in any other case.** 

Our method is to use the hypothesis of coincidence to derive conditions on the 
moments {ax} (k = 0, 1, 2, --- ) of y(t), and by this means to show that y(t) 
is uniquely determined in the class of admissible characteristic functions. In 








, which gives rise to the so-called 


23 For the basic theory used throughout this section, see J. Shohat, Théorie générale des 
polynomes orthogonaux de Tchebichef, loc. cit. 

24 Mechanical quadratures formulas characterized by the property of possessing equal 
Cétes numbers were first discussed by P. Tchebichef, Journal de Mathématiques, vol. 19 
(1874), pp. 19-34. 

25 The above result was obtained by the author late in 1933 and presented to the Amer- 
ican Mathematical Society in November, 1934. I learned recently from Professor Shohat 
that Professor M. Krawtchouk in June 1934 presented to the All Russian Mathematical 


Congress in Leningrad a similar theorem, but only for the special case y(z) = p(x)dz. 


Cf. M. Krawtchouk, Sur une question algébrique dans le probléme des moments, Journal de 
l'Institut Mathématique de |’ Académie des Sciences de |’Ukraine, vol. 2 (1934), pp. 87-92; 
in Ukrainian. 
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fact, since [®,(s) — ®,(t)]/(s — t) is a polynomial of degree n — 1 in t, if the 
formula (15) is of the Gauss-Tchebichef type, we must have” 








'y ®,,(s) —_ ®,,(t) dy(t) a H > ,(s) = ®, (tin) 
. s—t at 8 — lin 
(16) 
= H.,(8) Dy —— = Hai (3). 


Replacing ®,(s), $,(t), ®,(s) in (16) by their explicit expressions in terms of the 
coefficients {pin} of ®,(t) and performing the indicated operations, we conclude 
immediately that for n = 1, 2,--- , 


nay + Pin = 0, 


(17) Naz + NPine + Zona = 0 


NQpn-1 + NPin An—2 + hac hi + (n — 1) Pr-inQ% = 0. 


To these relations we may add 


(18) Qn + PinG@n-1 + +--+ + Panay = 0 (n 21), 
(19) @ngt + Pin@n + -++ + Prnai = 0 (n = 2), 
in view of the orthogonality properties of ®,(¢). The equations (17), together 
with (18), give the coefficients {px.} (k = 1,2, --- , m) in terms of ap, a1, --+ , On; 
since the determinant of the system is clearly different from zero; therefore, in 
virtue of (19), ani: (n = 2) is determined by apo, ai, ---,a@n. That is, the 
moments {ax} (k = 0, 1, 2,---) are completely determined by ap, a, ae, 


through a recurrence formula which is independent of (a, b) and y(#). 
We have only one more step. There is no loss of generality in assuming 
a = 1; by the Schwarz inequality, it follows that 


(20) ai S a. 


Moreover, (20) is precisely the necessary and sufficient condition for the existence 
of real numbers c, d such that a, = 3(c + d), ag = (ec — d)?/8 + (ce + d)?/4. 
Inasmuch as all the moments {a;} are determined by a; and az, it follows that 


the set {a,} is identical with the set of moments associated with the character- 


= t dt 
istic function Y(t) = / ‘van au. in the finite interval (c, d), for the 


moments of this latter function satisfy the same relations as the set {a,}, and 





26 This interesting formula shows that, when coincidence takes place, the numerators 
{2,(t)} of the successive convergents of the associated continued fraction (Shohat, loc. cit., 
p. 12) are simply the derivatives of the polynomials themselves. J. Sherman, Stieltjes 
continued fractions, Transactions of the American Mathematical Society, vol. 35 (1933), 
pp. 64-87, p. 81 has pointed out that this cannot occur with the classical polynomials of 
Jacobi, Laguerre and Hermite except in the trigonometric case. 
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[ adit) = 1, | tad) = He +a), | Pade) = ce — 4/8 + (0 + O/4, 


can easily be verified. But the set of moments {a,} (k = 0, 1, 2, --- ) associated 
with a characteristic function y(t) in a finite interval (a, b) cannot be eesti by a 
different characteristic function in the same or any other interval, finite or infinite.” 
It follows that coincidence cannot take place in the infinite interval, and that in 
the finite interval (a, b) the known solution o d) = (a, b), 





vi) -[ Vitae i 


is unique. 
UNIVERSITY OF PENNSYLVANIA. 


27 As Prof. Shohat pointed out to the author, this fact is an immediate consequence of 
theorems due to Stieltjes and Carleman on the determinateness of the moment problem. 
Cf. J. Shohat, loc. cit., p. 7. 














THE ZEROS OF JACOBI AND RELATED POLYNOMIALS 
By C. EvGcrene Buetu 


Introduction 


1. Definitions. The ultraspherical polynomials of degree n, P°?(cos #), are 
defined as polynomials not vanishing identically for which the differential equa- 
tion 


(1) y’’ + {(n + A)? + ACL — A) sin “fy = 0 


has the solution y = sin’?-P%?(cos 8). It will also be convenient to consider 
the generating function of these polynomials normalized in a proper way, 
namely, 


“a 


(2) (1 — 2w cos # + w?)> = Zz P™ (cos 8)-w". 
n=0 
The Jacobi polynomials of degree n, P'* *’(cos 8), are defined as polynomials 
not vanishing identically for which the differential equation 


” a+p+1\ ij —-o@ i-? od 
(3) y+ i(n + 2 ) + 4 sin? 0/2 + 4 cos? 6/2 tli 
has the solution y = [sin (8/2)]**! [cos (8/2)]8*+!. P\* *(cos 8). 
The Jacobi polynomials reduce to the ultraspherical polynomials if a = 6 = 
\ — 3. The ultraspherical polynomials reduce to the Legendre polynomials if 
= 34. Concerning further properties of these polynomials, we refer! to [5] and 
[8]. 





2. Previous estimates. For \ > — 3 all of the zeros of the ultraspherical 
polynomials are real. Let #; denote the k-th zero in increasing order, 0 < 3; < 7. 
The following estimates for 3; have been given: 


(1) Bruns [2] for Legendre polynomials: 











k—3 k 
(A) sc" << (hk =m 1,9, ---, @). 
(2) A. Markoff [4] and Stieltjes [7] for Legendre polynomials: 
k—}4 k n 
(B) " "<< oi" («=1,2,-.-,[3]), 


Received December 2, 1935. 
1 Numbers in bold face type refer to the bibliography at the end of this paper. 
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(3) Szegé [9] for0 <A <1: 


‘ k+rA-1 k 
- a: iiaeieg y % iy Wa i& ---2m, 
A-1 
k+ —— 
(C2) 2 k i |) 
es wide de yy fs (« =1,2,--,[2]), 














. jn <—* Tar ner o+/) 
(Cs) Jaailanlone - <a+n (i 1, 2, | 2 , 


where 7; denotes the k-th positive zero of the Bessel function of order \ — 3 
and c = 1 — (2/z)?. 

The purpose of this paper is to obtain estimates for the zeros of ultraspherical 
polynomials for some values of \ < 0 and for \ > 1, and to obtain analogous 
estimates for the zeros of Jacobi polynomials. 


3. Preliminary theorems. In the following, we shall consider the ultra- 
spherical and Jacobi polynomials (apart from a factor) as solutions of an ordinary 
second order differential equation of the form y’’ + ¢(x)y = 0. This will be 
compared with another differential equation of the same form, the zeros of whose 
solution will be considered as known. The following theorems due essentially 
to Sturm will form the basis of this comparison.” 

TuHeoreM 1. Let f(x) and F(z) be two continuous functions on a < x S b such 
that f(x) <= F(x) and f(x) # F(x). Consider the ordinary differential equations 
y"’ + f(x)y = 0, Y"’ + F(x)¥ = 0, and let y(x) be a solution of the first and Y (zx) 
be a solution of the second. Suppose that 


(1) y(z) > Oona < zx < band y(b) = 0, 
(2) lim | {y’(x) ¥(x) — y(x)¥'(x)} 20. 


Then either Y(x) = 0 or there exists a point §,a < & < b, such that Y(t) < 0. 
If (2) is replaced by 


(2’) lim {y'(x)¥(x) — y(x)Y’(x)} s 0, 


z—a+0 
then either Y(x) = 0 or there exists a point §,a < — < b, such that Y() > 0. 
If we replace (2) by 


(2’’) lim {y’(x)Y(x) — y(x)Y"(z)} = 0, 


z—at+0 


then either Y(x) = 0 or there exists a point =, a < — < b, such that Y(z) has a 
variation of sign in &. 


2 Cf. Szegé, [9]. 
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If we omit (2) and replace (1) by 
(1’) y(z7) >0O on a<x<b and y(a) = y(b) = 0, 


then either Y(z) = 0 or there exists a point £, a < — < b, such that Y(x) has 
again a variation of sign in & (The ordinary Sturm theorem.) 

THEOREM 2. Consider y"’ + ¢(x)y = 0, where o(x) ts continuous and mon- 
otonically decreasing (increasing) in the strict sense ona < x <b. Suppose that 
a solution y(x) has the zeros a, B, y, --- (at least three and arranged in increasing 
order) on (a,b). ThenB—-a<y—B<---(B—-a>y—B>---). 

Remark. For the inequality 8 — a < y — 8 (8 — a > y — 8B) we need only 
the fact that g(x’) > g(x’’) (e(2’) < o(x’’)) fora < 2’ < Band B <2" < y¥. 





I. Ultraspherical polynomials for \ > 1 


1. Trigonometric comparison. The ultraspherical polynomials can be char- 
acterized by the differential equation 


(1) y’’ + {(n + dr)? + ACL — A) sin *d}y = 0 

with the solution y = sin’ 3- P(cos 3) = #-f(8), where f(8) = ap + aw + ---, 
a ~ 0. We compare this with the differential equation 

(2) u’’ + (n + d)*u = 0 


with the solution u = sin(n + A)d = Jg(9) = P(bo + DW 4+ --- ), where 
bo 0. Since ford > 1,A(1 — A) < 0, because of Sturm’s oscillation theorem 
between two consecutive zeros of y(#) there lies at least one zero of u(?). Hence 





Tv 
(3) di. —_ Opa > n+nX (k = # 2, cet, n). 


This inequality holds for k = 1, d = 0, since 


lim {y’(#)u(e) — y(d)u’(d)} = 0. 


v-+0 


Adding (3) for successive values of k, we obtain 





k 
Now #% + @rsi-e = 7, whence F 
in k+aA-—1 : 
(5) dv; = TtT— Paik < ss a Tv i 
The combination of (4) and (5) gives the estimate for the k-th zero of P‘’(cos 8): ‘ 
. k k+r-1 
(6) n+ Xd rcu< —- (k = 1, 2; ’ n) 


This corresponds to the estimate (C,) given by Szegé for 0 < A < 1 (Introduc- 
tion 2); it is particularly like that of Bruns for the Legendre polynomials. 
It is evident that if \ = 1, 3, = kr/(n 4+ 1). 














ZEROS OF JACOBI AND RELATED POLYNOMIALS 307 


2. The analogue of Szegé’s estimate (C,). From Theorem 2 of the Introduc- 
tion we have 3 — 0 > db. — & > --- > Pinj2}41 — Pfnjq}, Since the coefficient 
of y in (1) is monotonically increasing in (0, 7/2); furthermore, for n even, it 
fulfills the conditions given in the remark to Theorem 2 with @ = #&{»/2)-1, 
B = Bin, Y = Ptnj41- Thus the polygonal line joining the points (k, #8;), 
0 < k S [n/2] + 1, is concave upward and hence takes on its minima at its 
end points. Consider now 8; = 3 — ke: — ¢. The line joining the points 
(k, 8;) is also concave upward. We want to determine c; and cz so that 3; > 0 
at each end point and so that the inequality & = ke; + ce will be “better” 
than (4). 





For k = 0, 8, = % — co = —ce; we put ce = 0. Moreover for n odd, 
, n 1 n+1 
Pingiy/2 = Wasiy2 — . Q= ; ~ 1. 
If we put this equal to zero, cq, = 7/(n + 1). For n even we obtain 
’ 7 n , 7 n 
= ee. ails Oi Bids seman Oe, 
Bi/2 = On/2 n+1°2’ Diins2 l+n/2 (5+ ) 


Adding, we obtain 
3, 2 + Pisa = I nj2 oa Drinse - r= 0. 


Now since the polygonal line joining the points (k, #;) is concave, we cannot 
have O./2 < 0, > Oor Bi = On seas = 0. Hence 3, 2 > 0, Dien 2 <0. 
Thus we have for both n odd and n even 


o, = & — kx/(n +1) 20 (k = 1,2,---, [(n + 1)/2)]), 
the equality holding for n odd and k = (n + 1)/2. The lower estimate 
(7) 3, > kx/(n + 1) 


is better than (4). 
Consider now 


” k+e 
a ea~sa* 





Then 3, — 0/_, =o —&1—7/(n+X). From (3), we have 3), — 8/_, > 0. 
Thus the sequence {#;} is monotonically increasing and takes on its maximum 
at its right end point. We want to determine c so that this maximum will be 
less than or equal to zero. 

For n odd, 8(n41/2 = 2/2 so that 
n+ 1 
= 


-_ n+xX 





” 
Pin+1) ; = 
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If we set this equal to zero, then 2c = \ — 1. For n even, it must be verified 
that 


Now 
Pisnj2 — One = 7 — Wr > r/(n + A), 
whence the statement follows. The upper estimate 


A-1 


<< 
2 n 
(8) 0, < —_— (« = 1, 2, see []) 


thus obtained is better than (5). The combination of (7) and (8) gives the 
estimate 


+335 
k 2 n 
(9) * + r< ov; < “=a T ( = 1, 2, se. [3], 


in which, compared with (6), both the upper and lower estimates are improved. 





3. Bessel comparison. The inequalities 
(10) &’ <sin?d Ss 0 +6, c = 1 — (2/z)?, 0<<vs 1/2 
lead from (1) to the comparison equations 
(11) u’’ + {(n + dA)? + AL — A)P* Ju = 0, 
(12) v’’ + {(m + A)? + ACL — ANe + ACL — A)? Jo = O,7 
which have the solutions 
(11!) w= VO-Jd,{(n + dd} = Ma +ad+---), y=rA—},a #0, 
and 
om * VI-Jy{l(n + A? + ML — Ae} d} = Mbo + id +--+), 
y= —},b #0, 


respectively. Between the differential equations (1), (11) and (12) we have 
relations similar to those in Theorem 1 for 0 < 3 < 2/2. Further, it is easily 
seen that 


beer {u’(d)y(9) — u(d)y’(9)} = 0 


and 


lim {y'(8)v(d) — y(d)o’(9)} = 0. 


d—-+0 








Li. ETT 
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Thus between consecutive zeros of y(#) there is at least one zero of u(#) and 
between consecutive zeros of v(#) there is at least one zero of y(#). Hence, 








denoting the zeros of J,(#) by ji, je, js, --- in increasing order, we have the 
estimate 

Je bee: n+l 
a) 53) <%< pyrene - (i - 3-4] 2 ). 


These inequalities correspond to the estimate (C3) of Szegé (Introduction 2). 


4. An estimate for j,. Now consider k fixed and n = 2k — 1,k = (n + 1)/2 
Then from (13) 
Je v de 
— < “a < ~ 7 
2k-—-1+A 2 V (2k —1+4+r)? + MI — Aje’ 
from which 


_ a” ee 
(« )ry} + @e 140) <* (: 


whence, writing y = A — 3, we have 


< 
=, 14 Te 
(}-34+ yt ay ae 
re 1)\\ 
(14) d= (: “ge 3) . o(@)) 


We thus have 
II. Estimates derived from Stieltjes’ integral representation’ 


1. Stieltjes’ integral representation. Using the generating function of the 
ultraspherical polynomials (see eq. 2, Introduction), Stieltjes derived the follow- 
ing important representation :* 


P (cos 8) = 2a7' sin rA(2 sin B)~* Ret O-**/?! 











) 

Tv, 
Xe 
) e. 


Da 
7 
1 
— + 
4 


k 


(1) ' 
/ rl — r)t—"(1 — kr)" dr, 
0 
where 
(2) k = 3(1 — icot 8). 


3 In a letter to Prof. Szegé (January 7, 1935) Prof. Fejér gave a proof of the estimate 
(C.) of Szegé in the special case of Legendre polynomials (A = 3) based on Stieltjes integral 
representation. The results of this part were derived after I had seen this letter through 
the kindness of Prof. Szegé. Ina letter (February 10, 1935) Fejér derives (C:) generally 
from (1) for0 <A <1. Finally in a paper which should appear in the Monatshefte fiir 
Mathematik und Physik he obtains an upper estimate for 3 which is for 0 < \ < } better 
than that of (C2) as well as better than the upper bound in (5). 

4 [1], II, 122 (in a letter dated December 19, 1890). Cf. also Szegé [8], p. 57. 
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Here the first two factors of the integrand are real and positive; the third factor 
is equal to unity forr = 0. This complex factor may be written for 0 < 3 < 1/2 
in the form (1 — kr) = p(r)>e-®¥™, where p(r) > 0, ¥(r) real and p(0) = 0, 
y(0) = 0. Now 





¥(r) = tan“ (cot Dinan ), 
2-—r 


so if 


(3) A,(8) [ ere cos fn ( a8 )} . 
= r*(1 — rv)" p(r)- tan~’ | cot d- Vs 
B,(9) Jo sin 2—7 


then 
«) P (cos 8) = 2x sin wA(2 sin 8) 
{A,(8) cos [(n + A)F — rd/2] + B,(¥) sin [(n + A)I — wd/2]}. 





2. An estimate for 0 < \ < }. Consider now the following values for 
(n + A) — wA/2: kx, (ke — 1/4), (kr — 2/2), (ka — 32/4). The corresponding 
values of 3 are respectively 
ee. => i 2=e , — 3-2 
me eee aa eae a yee. Mee 
nar.’ n+ , n+ ’ n+X : 
while the corresponding values for }2(2 sin 8) (sin 7A)-! P2?(cos 8) are re- 
spectively 

(—1)FAn(ae), = (—1)*2-8{An(bx) — Brlbx)}, = (—1)**" Ba lex), 


(—1)*12-4{A, (dx) + Ba(di)}. 





ay = 


Now since 0 < \ < }and0 < # < 7/2,0 < A tan“ [cot 3-r/(2 — r)] < 2/4, 
so that A,(8), B,(8), and A,(#) — B,(8) are all positive. Therefore PO? (cos #) 
changes sign between b, and c, and hence must have a zero there. This gives 
the following estimate: 





k — 1 5 r ee 1 = . 
(5) “335° *"* "ga * («=1,2,...,[3]). 
The upper estimate given by Szegé [see Introduction, (C2)] is kx/(n + 1). 
3—-A n+l 
2 ——_—_—_ 





For values of k less than , the estimate above is better. 


1—A 2 


3. An estimate for— 4<X<0. For—3<A<Oand0 <8 < 7/2 


—" <) tan (cot b. . r_) < @, 


4 2—-rf 























ZEROS OF JACOBI AND RELATED POLYNOMIALS 311 


so that A,(#) and A,(#) + B,(#) are positive, while B,(#) is negative. Thus 
P (cos 8) must have a zero between c, and d;. This gives the following 
estimate: 


i 3 — 2d “a 1—A 


4 2 n 
(6) “Gar. " * Saee («=1,2,.-.,[5]). 


Remark. (5) and (6) also hold for n odd and k = (nm + 1)/2, when = is put 
instead of < in the lower estimate in (5) and similarly in the upper estimate in 
(6). 

III. Jacobi polynomials 

1. Trigonometric comparison. For the Jacobi polynomials we have the 

differential equation 


” a+p+i1\ tj —@ tf} es 
‘ +{(n+ 2 ) + Fsin? 0/2 + Fee anf” =° 
with the solution y = (sin #/2)**+! (cos #/2)8+). P'*”) (cos #) = det! .f(8), where 


f(8) = ay + 4d + ---, a #0. We compare this with the differential equa- 
tion 





2 
wt (ng St8ttVy 20 


with the solution u = sin (n + state = 8-g(8), where g(8) = by + 


bd + ---,bo #0. For a, 8 > —1 all of the zeros of P‘**? (cos 8) are real. 
Let them be denoted in increasing order by 3$*""’, 0 < 3% < r;k = 1,2, ---,n. 
We discuss the position of 3% * for a, 8> — }. 

Case A: a < }, 8 < }. In this case 


a+6+1VY, }-e@& i —-# ( et fry 
(1) (n + 2 ) a 4 sin? 3/2 + 4 cos? 3/2 i aad : 








so that between consecutive positive zeros of u(#) there lies at least one zero of 
y(8). Further, 
lim {u’y — y’u} = 0, 


o+0 


sincea + 34> 0. We see, therefore, that at least one zero of P‘*’*? (cos #) lies 


in each interval 
k ot) (b= 1,2,---,n). 
np othr n+ 2tht 


Further, since the length of each of these intervals is less than +/n, we must have 
exactly one zero in each interval. From this the following estimates result: 


k 


T 
a+f6+1 
2 











(2) 3 r<oe"e< 


(k = 1,2,..-,n). 
np StEt! 


n+ 
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The above argument may be carried through for P‘°* (cos #), since 8 + 3 > 0. 
Thus 8°” also satisfies (2). Now® 





(3) oe) 4 oF), = 2, 
so that 
k + a+ . —1 
oe) og — 92), > tT. 
a+B+1 
——— 


< 0, the lower estimate of (2) has been im- 


Since in this case —1 < ote-! 


proved. The combination of this lower estimate and the upper estimate from 
(2) gives the following: 








94 228-' 
a be Qa 
—a ——_ 


For a = B = \ — 3 we obtain Szegé’s estimates (C2) (Introduction 2). 
Remark. If the comparison equation 


2 , on nt ae 
wrt {(ngstetty 43 = Bly a0 


is used, the upper estimate is improved slightly. Thus 











k a+p-—1 
+ 2 r<9ePe k - 
a+B6+1 a+B+1 
(4’) aie sae ee ee 





nity 22s 8 @akg---\e 
4 (n + te de ll. 4 = ) 
Case B: a > 3, B > 3. In this case, the inequality (1) must be reversed. 


Again 
lim (y’u — u’y) = 0, 


v—+0 
since a > 3, so that 


T 


np tht 


gle) > 





5 Cf. for instance [8], p. 4, (3). 











9 oe 
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Applying Sturm’s oscillation theorem and the reversed inequality (1), we have 




















gia?) oe gia:8) > T : 
np otb+l 
2 
Adding these inequalities for successive values of k, we obtain 
ee") > & ; T. 
n oo atet* 
The relation 3") + 98:4), = x gives 
+B-—1 
k+ fst seal 
Of =e - ON <r ttf ; *- 
n + ater: n + atete 
The last two inequalities give an estimate that is analogous to (4), namely: 
k by Ste 
(5) ir sor” < i7 (k = 1,2,---,n). 
n+ tit np tht 


For a = 8 = \ — 3 we obtain I, (6). 
Case C: a? < 3,8 > 3 and Case D: a > 3, 6? < }. For these cases® there 
is a value of 3, say 3’, 0 < 3 < a, for which 
bra $= 8 _ 
sin? 3/2 © cos? 3/2 


Then in the interval (0, 3’) we have 


0. 





oe) _ oe) s —_* 
a+B+1 
aa we 





in cases C and D respectively from the same considerations as above. Adding 
for successive values of k, 


(6) 
(7) 


k 
Tv 
a+f+1 
2 


in cases C and D, respectively, provided that 0 < 8°) < 8’(a, 8). 
In order to obtain estimates for the interval (#’, 7) we use (3). If we inter- 
change a and 8, cases C and D are interchanged and #’ becomes r — #8’; Le., 


gia) < 





n+ 


® We always assume tacitly a, 8 > — 1. 
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I 


0'(8, a) = x — ¥'(a, 8). Consequently 
“2 ——_ 


Tv 
n + «+e+* 





in cases C and D, respectively, provided that 0 < 3? < 8/(8, a). If now 
8’(a, 8) < d's) < 2, we have 0 < 88:4), < 8'(8, a). Therefore 








get) ge — 98) Se n+1—k 
a4 tet! 
2 
or 
a+B-—1 
(8) (a. Bl < wel _— (a8) = , 
(9) o; > +k [v'; m (9’, x)] 


in cases C and D, respectively. Thus the process that gave both upper and 
lower estimates in cases A and B on the interval (0, +) only yields an upper 
estimate in case C and a lower estimate in case D on the intervals (0, 3’) and 
(9’, T). 


2. Bessel comparison. The following representation of the coefficient in the 
equation of the Jacobi polynomials is convenient for obtaining estimates of 
their zeros by means of those of the Bessel functions: 

i—-@ + i —-? a tee =2 4t=e cos 3 
4sin?3/2 ' 4cos*0/2  2sin?d 2 ~~ sin? 3" 
We have 











v? < sin? # < dF? + 6, 0<d<¢<z7, 
where c = sin g — g* and 1 — #/2 < cos # < 1, whence for g = 1/2 


i cos 3 - 
(10) oe b<sa5 <vd?’+c. 
For brevity, we put 
$a -F . F-o ome 
2 sin? 3 2 sin? 3” 





a+sp+1\ 
“teet, 


(11) O(a, B; 8) = (n + 
There are four cases depending on the signs of the expressions } — a? — f?, 
B — a’. 

Case 1:4} — a — 6 > 0,6? — a > 0. Using (10) we have’ 


7 We again assume throughout a, 8 > — 1. 





yee 
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—_—— — 
(np 2tett) _ e241 <n 








2 4 e 
—_ 
<(n ~ stay} + (4 — ate + £—* 
d 

Denoting by A? and B* the terms independent of 3, A > 0, B > 0, we obtain® 
1 > 1 9 
42 i ° .—-c 

A? + — < O(a, 8; 3) < BB+ 7 


Case II: } — o&® — 8 > 0,8 — a <0,a > — }. Asincase I we have 


7 2 of 1 _ of 
— 48 =k e = < O(a, 8; 3) 








2 e 
atetty (} —a — ®)c 27a j-a 
which may be written in the form 
. a 2, 4 -@ 
Cc? + §=* < Oa, 6; 8) < D* — a 


Case III: } — a? — & < 0,8 — a& <0. In this case, the constant terms are 
the same as in case I and appear interchanged. Thus 


—a* 


1 
B 8;8) < A? + 








Dig 
Case IV: } — & — B <0, 8 —a& >0,a> — }. In this case the constant 
terms are the same as in Case II and appear interchanged. Thus 


ns ; ‘ 1_@ 
D 8; 39) < C? + * a 








The comparison equation in ali four cases is of the form 





w+ (a thy wo 4>0 
a ye _— ’ é ’ 


with the solution u = #!.J,(Ad) = d*+!.g(8), where g(8) = a + a0 +---, 


a ~ 0. Further, for all cases, 


lim (y’u — u’'y) = —lim (u’y — y’u) = 0, 


g—+0 v— +0 
since a + 4 > 0. The resulting estimates are then 
Case 1: jx/B < 8'% < j/A; 
Case II: j./D < 8%" < j./C; 
Case III: j,/A < 8 < jx/B; 
Case IV: jx/C < o's"? < ji /D, 


5 We take n so large that A? > 0, B? > 0. 
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where j, denotes the k-th zero of the Bessel function J,(#) of order a and 
k = 1,2, --- , k’, where k’ is such that 


ve << dER, 


and nis large enough so that all the expressions A, B, C, and D are real. 
There is no difficulty in the discussion of the cases in which either of the two 
expressions } — a? — 8°, 6* — a’, is zero. 
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CLASSES OF MAXIMUM NUMBERS ASSOCIATED WITH CERTAIN 
SYMMETRIC EQUATIONS IN n RECIPROCALS. III 


By H. A. Smmmons anv W. E. Biockx 


1. Introduction. By extending considerably the methods used by Sim- 
mons in the first! paper I, and by Stelford and Simmons in the second? paper II, 
we shall obtain results that include as special cases all theorems of I, II (cf. the 
definition of remarkable properties in this section and* Theorems 5, 8, 9 and 12). 
We shall explain in more detail what we do in this paper after we recall from I a 
few definitions that we use here. 

If a solution x = (xm, ---,2,) of any given equation with which we deal is 
obtained by Kellogg’s process‘ of minimizing the variables 2, --- , Z,—; in this 
order, one at a time, we shall denote it by w and call it the Kellogg solution of 
the given equation. For the equations that we consider the Kellogg solution is 
(except in §14) one in positive integers. It always belongs to the general class 
of solutions that we admit, namely, that in which x, --- ,2,-; are positive 
integers and x; S r2 S --- S z,. These solutions include all positive integral 
solutions and are called E-solutions (for extended solutions, beyond those in 
positive integers). Thus, for a very simple example, the Kellogg solution of 
a; + 2,' = 2/7is zx = w = (4, 28) and its E-solutions are (4, 28), (5, 35/3), 
(6, 42/5), and (7,7). From Theorem 2, p. 887, of I, we know that 28 (=we) is 
the largest number that exists in any E-solution of the given equation and that 
28 appears in no E£-solution of this equation except w. Furthermore, if 
P(x, 22) = P(x) is any symmetric polynomial in 2, x2 with no negative coef- 
ficient, and if P(x) is not a mere constant, Theorem 3 of I contains the following 
statement as a very special fact: if s = X is any E-solution of the equation 
xz; + 2,' = 2/7 other than its Kellogg solution w, then P(X) < P(w). 

Where nothing is said to the contrary, we adopt generally the definitions and 
notation of I, II. Thus P(z) stands for a polynomial of the type defined above 
except that P(x) contains n variables instead of 2; and with i = 0 and j equal to 


Received August 12, 1935; presented to the American Mathematical Society, April 19, 
1935. The results of Parts 1, 2,3 of this paper are due to Simmons; those of Part 4, chiefly 
to Block, a student at Northwestern University. 

1 Cf. Trans. Amer. Math. Soc., vol. 34 (1932), pp. 876-907. 

2 Cf. Bulletin Amer. Math. Soc., vol. 40 (1934), pp. 884-894. 

3 A theorem has the same number as the section which contains it. 

‘Concerning Kellogg’s diophantine problem and extensions of it, cf. O. D. Kellogg, 
American Mathematical Monthly, vol. 28 (1921), p. 300; D. R. Curtiss, ibid., vol. 29 (1922), 
pp. 381-387; and Tanz6é Takenouchi, Proceedings of the Physico-Mathematical Society of 
Japan, (3), vol. 3, pp. 78-92. 
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integers, we let >;,;(2) stand for the j-th elementary symmetric function of the 7 
variables 2, --- , 2; with the customary agreement that 


/= 0 when i < j and also whenj < 0, 
Xi, i(2) ; 
|= 1 whenj = 0. 

For brevity, when we have that the Kellogg solution w of a given equation (e) 
contains (in the sense described above) the individual maximum number that 
exists in any #-solution of (e) and maximizes P(X), where X is any E-solution 
of (e), so that P(X) < P(w) if X ¥ w, we shall say that w has the remarkable 
properties relative to (e). The former (latter) of these properties will be referred 
to as the first (second) remarkable property. 

In I and II we established the remarkable properties for the Kellogg solution 
w, of each of the following equations in the z;: 


(1) Dn. (l/r) + AZn, -4i1(1/z) = b/a, a=([(e+1)b-1], 
(1.1) Ln.r(1/z) + En, r4(1/z) + --- + 2n..(1/z) = b/a, 


where 7, s, n are integers such that 1 < r < s S n, Ais any integer = 0, and 
b, ¢ are any positive integers. In II we generalized the results in question for 
certain cases in which a is not of the form (ce + 1) b — 1. The extensions which 
we make here can be understood from inspection of the equations that we treat 


here as we treated (1) and (1.1) in I, II, namely,’ 
(1.2) Dn. (1/xr) + 2a, n(1/z) = b/a (n2r+2), 
Lnr(1/r) + AreiDn.rga(1/r) + AveeZnrge(1/zr) + --- + A,Z,,.(1/2) = b/a 


(1.3) 
(n2s>r+4+1), 


where the \, (¢ = r+ 1, --- , 8) are integers = 0 such that 
Nidie2 — Ande ZO (=r—1,---,s—l)j=r—2,---,t—-1) 
(1.4) 
Ai = Aw = 0, Az 1; 
(1 5) Zn. (1/2) + ArsiZn, r+i(1/2x) + Ar+22n, r42(1/2x) + lish + AsZn,s (1/z) 


+ n,.(1/r) = b/a (n>s+2>n+2), 


where’ the d,; are as in (1.4). 


5 Were we to use 2,,-(1/x) + AZ,,,(1/z) in the place of the left member of (1.2), we 
would need to prove in §4 the following inequality instead of (4.14): 


(Ie)? > (1e**)(1e*) + AC). 


We have not been able to establish this inequality when A is a positive integer restricted 
only by the new equation (1.2) in question. It is for a similar reason that we do not use a 
positive integral parameter \ as a multiplier of 2,,,(1/z) in (1.5). 

® To allow the case n = (s + 1) here would duplicate a case of (1.3). 
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In (1.2), (1.3) and (1.5) a can be as it is defined for (1) for every set of values of 
n, r, s and the d’s that we admit, and for some choices of these numbers extra 
content is given to our theory by allowing a the generality that it has in II 
(ef. Theorem 8). One should note here that there are infinitely many sets of 
positive \’s for which (1.4) holds.?’ For example, with s = (r + 2), (1.4) is 
satisfied if \?,, = A>x2 > 0; with s = (r + 3), (1.4) is true if A2,, = Ars, 
ArutAre2 2 Arss > O, and A244 = ArytAras. It is also to be observed that the 
second subscripts of the 2’s in (1.3) are consecutive and that the only cases 
where we do not require these subscripts to be such are exhibited in (1.2) and 
(1.5). We have tried, without success, to find for our theory some modification 
that would enable us to establish the remarkable properties for the Kellogg 
solution of the equation 


En, 1(1/x) + Zn, 3(1/z) - b/a ’ 


where a is as it is in (1) and nis any integer > 3. The difficulty is that we are 
unable to establish the analog of (2.3) below [or of (4.3) and (4.4) in the cases of 
(1.2) for which r > 1; or of (6.3) and (6.4) in our treatment of (1.3)]. For this 
reason, we have not been able to apply our method with success to any general 
equation of the form® 


Zn,r(1/z) + Zn,e(1/z) = b/a (n>s>r+1). 


New features of our procedure. The nature of our present modification of the 
procedure of I and I is described fairly well in the following four statements. 

(1) In the case r = 1 of (1.2), the individual maximum number and the 
class of maximum numbers are identified in two different processes; the class of 
maximum numbers for this case is identified by transforming the elements of an 
arbitrary E-solution X ~ w into their Kellogg correspondents in the reverse 
order, from X,, to X; (cf. §§2, 3). 

(2) We use inequalities here that require extra detail (beyond that of I, IT) 
relative to the sizes of the X; in an E-solution [ef. (4.12), for example]. 

(3) The inequalities between the partition symbols in §§4, 11 involve extra 
terms [cf. the last term, (1*-"), in (4.14) and the right member of (11.16)] and 
call for a variation of corresponding procedure’ in IT. 

(4) The observations that (7) and (11) are equivalent to (7.1) and (11.1), 
respectively, are new and they are important because (7.1) and (11.1) are 
analogs” here of the key-inequality of I, namely, (46) of I. 

The discussion from §2 to the end of this paper is divided into four parts as fol- 
lows: Part 1 deals with (1.2), the case of (1.5) in which \; = 0 (¢ = r +1, --- ,8), 
§§2 to 4 (inclusive); Part 2 with (1.3), §§5 to 8; Part 3 with the case of (1.5) in 


7 Inequality (1.4) implies that if \,,..., 4 (r S k <s— 1) are positive and if ¥y. = 0, 
then A, = 0 (¢ = k + 1, --- , 8), as can be readily proved. 

8 We deal with a special equation of this form in §13. 

° Cf. p. 888 of IT. 

10 The case r = 1 is admitted in (7.1) and (11.1), whereas it was excluded in (46) of I. 
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which \; > 0 (ef. footnote 8), §§9 to 12; Part 4 with problems that we have 
considered in trying to extend our theory and with an application of this theory 
to the convergence of certain types of series, §§13 to 16. The order in which we 
present our results (of Parts 1, 2, 3) is that in which we have obtained them" 
and is one of increasing difficulty. Part 3 depends upon (and naturally follows) 
Part 2 in the same way that Part 1 here depends upon I [ef. our proof of (4.3) 
below]. 


Part 1. The remarkable properties of the Kellogg solution of (1.2) 


2. The maximum number in any E-solution of a special equation. We 
consider now the case r = 1 of (1.2), 


(2) 2n.1(1/z) + En.n(1/z) = b/a, a = [(c + 1)b — 1). 
The Kellogg solution of (2) is [ef. I, p. 886, equations (23)] z = w, where 
wm=c+l, Woit = AW, --- Wp +1 (p=1,---,n—2), 
(2.1) 
Wn = A(W; -++ Wai + 1). 


We wish to prove now that the w of (2.1) has the first remarkable property 
relative to equation (2). 

Proof. From I we know that if X,...(-1 is any set of nm — 1 positive 
integers such that 2,-1,:(1/X) < b/a, then X,... in» is a set o [ef. 14 of I] and 


(2.2) Z»,1(1/X) Ss Zp, 1(1/w) (p = 1, ss 1). 


Suppose now that X is an E-solution of (2), so that (2.2) holds. Further, 
let X be different from w and suppose that X, 2 w,. We shall reach a con- 
tradiction. Under our hypothesis X contains at least two elements that differ 
from their corresponding elements of w, one element of class A and one of class B 
(ef. p. 891 of I) and, according to equation (31) of I, the inequality sign in 
(2.2) holds for p = (n — 1). Further, since X, = wy, 


(2.3) En.1(1/X) < En,1(1/w) . 
From (2.3) and the fact that X is a solution of (2), it follows that 
(2.4) Xi --- Xn < Wes: Wr. 
However, by solving (2) for x,, we find that 
X, = a(Xy--+ Xa + 1)[bX1 --- Xa — aSn-1, n-1(X)]',, 
so that 
(2.5) X, S a(X--- Xnu+ 1). 


Since w, = a(wi --- Wa-1 + 1), (2.4) and (2.5) are inconsistent with our hypoth- 
esis that X, 2w,. Therefore X, < wy. 


1 In the beginning of this work our main object was to overcome the difficulty that is 
explained in §22 of I. 
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3. The class of maximum numbers relative to (2). We wish to prove next 
that the w of (2.1) has the second remarkable property relative to (2). 

Proof. We consider the elements of any E-solution X # w in the order X,, 
Xn, --+,X:. Since X # w, X contains at least one element of each of the 
classes A and B, and we let X,,, X,_ be the first elements in X,..., of class 
A, B, respectively. Our transformation from X to X’ is an analog of (33) of 
I (cf. the transformation defined below by setting f = X in (3)); then if X’ ¥ w, 
we repeat the transformation to pass from X’ to X’’; etc. Using the notation 
of p. 898 of I, we define our transformation from f, ...1 to f, ..., to be ¢; or ts: 





(i) f, =So(p #10), fo, = we, 


Z Za, (5) +2, (5) om. (?) + 2s, {°) 


(t) f, =fo(p #0), fe= we, 


23) +243) (3) +24) 


according as t; defines fi, to be not greater than wy or greater than wy, respectively. 


We shall show that every transformation which we employ in passing from 
X to w is such that” 


(3.1) Sofa < Sede» (fo,)' + (fo)' < (fo, + (f's)', 


where ¢t is any positive integer; the second remarkable property will then follow 
from Lemma 3 of I. That (3.1) holds for the transformation from X to X’ 
follows from Lemma la of I and the fact that X is an E-solution of (2) different 
from w so that X, = X,¢ < wz (ef. §2). We shall be assured of the validity of 
(3.1) generally if we can show that in any intermediate set f,...1 = X“ the 
first element which differs from its correspondent in w is f,, of class B®. Sup- 
pose this were not the case. Then we should have 












(3.2) fpo= Xp (P=1,---,4-D, fa > var fo= ve (P= Aa+1,---,n). 






Now (3.2) implies that fi... @1) # wi... @-1. Hence from (3.2) and (31) of 
I, it follows that 






(3.3) Yo—1,1(1/f) < Lo-11(1/w). 
Then (3.2) and (3.3) imply the inequalities 
(3.4) Xo,.1(1/f) < 2o,,1(1/w), 





(3.5) 





2n1(1/f) < 2n,1(1/w). 





” That Sofa ¥ So, fg follows from the second footnote on p. 892 of I. 
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Hence we could replace fo, by we, in (3.4) and (3.5) and obtain true inequalities. 
Indeed we could make the substitution f,, = w,,, with f, = tS, (p # A, 18, 
where ,@6 = n), and select f, > w, in such a way that the new set f’ would 
satisfy (2). This set f’ would be an E-solution of (2), and the inequality f, > w, 
would contradict the theorem proved in §2, namely, that w, has the first remark- 
able property relative to (1.2). 


4. The cases r > 1 of (1.2). We wish to establish the remarkable properties 
for the Kellogg solution w of (1.2) in the cases r = 2,---,n — 2; the case 
r = n — 1 of (1.2) was treated in I. The Kellogg solution zx = w of (1.2) is 
defined by the following equations 


wp = 1 (p= 1,---,r— 1), wyp=ec+l, 


(4) 
Wri = Ap pri(w) +1 (p=r,---,n—2), wa = alZn-1,n--(w) + 1). 
We define the left member of (1.2) to be ¢,(1/z), and write generally 

(4.1) ¢p(1/zx) = 2», -(1/z) + Zp, n(1/z) (rs ps n), 


so that ¢,(1/z) = 2,,,-(1/z) forr S p < n. 
From (4.1) it is clear that ¢,(1/x) may be written in the form 


en(1/r) = Zn,r(1/z) + x, '¥(1/z), 
¥(1/z) = 2-1, r-1(1/z) + Ln—1, n—1(1/2). 


From (4.2) one observes that in order to establish the first remarkable property 
for w, it suffices to show that if X is any E-solution of (1.2) except its Kellogg 
solution w , then the following relations are true [ef. (30) and (32) of I]: 


(4.3) Ln—-1,r(1/X) 3S Ln-1,r(1/w), 
(4.4) ¥(1/X) < ¥(1/w). 


The class of maximum numbers in which we are interested (ef. §1) will also be 
identified if we can show that every transformation that we make in passing 
from X to w (by one or more transformations) accords with (3.1) (ef. Lemma 3 
of I). We shall prove this to be the case after we make two more definitions. 

Definition of transformation. We now consider the elements of our solution 
X # w, and more generally of f [cf. (3)] in the order from the first to the last: 
f=h.... If f = X@ contains at least one element of each of the classes 
A@™, B@, the transformation from f to f’ is defined by ts or t,, 


(ts) Sp =f, (P #19), fo, = We, — en(1/f’) = onl /f), 
(ts) Z, = } a (p # fn, 19), fis = We, ¢n(1/f’) = ¢n(1/f), 


according as ¢; defines fee to be not greater than w,, or greater than w,,, respec- 
tively. 
Definition of set o. Let \ be a fixed positive integer such that r S \ S n, 


(4.2) 


(4.5) 
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where r and n are as defined above for (1.2). We shall call 2;..., a set o [relative 
to the Kellogg solution w of (1.2)], if and only if y,(1/r) < ¢,(1/w) for every 
positive integer p such that r < p S X. 

Proof of (4.3). In I we showed that if X,...,,r S p S n — 1, is any set 
of p positive integers such that 


Z)(1/X) <b/a, a =([(e + 1)b — 1, 


then" 
(4.6) Zp. r(1/X) S Zp,-(1/w), 


where w is the Kellogg solution of (1) of I. Hence (4.6) holds in particular 
when X is any £-solution of (1.2) and w is the Kellogg solution of (1.2), the first 
n — 1 elements w; of the two w’s in question being identical [cf. (26) of I and 
(4) here]. Relation (4.3) is the case p = n — 1 of (4.6). 

Remark. Since (4.6) (a relation from I) happens to hold here, it is not neces- 
sary for us to make an induction here parallel to the main induction of I. This 
explains why we use n rather than an induction integer k, r < k S n — 1, as 
the subscript of ¢ in (4.5). 

The completion of the proof of the remarkable properties for w will not be 
difficult after we establish the following lemma, which resembles certain parts of 
Lemma 9 of I. 

Lemma 4. Let f (= fi...,) stand for our E-solution X (#w) or any inter- 
mediate set of X under transformation (4.5). Then application of (4.5) to f is 
such that (3.1) and the following relations hold: 


(4.7) Zpr(1/f’) S Zp, -(1/w) (r>1;p=r,---,.0-—1), 
(4.8) zy. (1/f’) S 2p, -(1//) (r>1;p = 10,---,n— 1). 


If we prove the case f = X of Lemma 4, the other cases will follow as did 
the corresponding further cases in I (cf. Lemma 9). 

Proof that (3.1) holds when f = X. Using all cases p = r, --- ,n — 1 of (4.6), 
we observe that f = X is a set o, so that 6, = q: < 18 = q. Since X is an E- 
solution, fp, S fj. Consequently, (3.1) holds (cf. Lemma la of I). 

Proof that (4.7) holds when f = X. By (4.5), f: = f, = w, for all positive 
integral values of ¢ from 1 to ,@ — 1 except t = (1, and w, S f A < fs,. Hence 
(4.7) holds when f = X. 

Proof that (4.8) holds when f = X. This inequality is the present analog (cf. 
footnote 10) of (37) of I. Since (37) and (46) of I are equivalent, in order to 


13 It is to be noted that the hypothesis in I that X is an E-solution of equation (1) of I 
was not used (in full) in proving (4.6). This hypothesis was used only when the value n 
was assigned to p. 
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prove the case f = X of (4.8) it suffices to establish the analog here of (46) of I. 
This analog is readily found to be" 
(4.9) Dinar 2 pore S (Zane + Vaa,es)Zpua,r-19 
where® p = 19g, ---,%—1. Our method of proving (4.9) is to show first that this 
relation is true for p = n — 1, and then to prove that if (4.9) is true for p = k, 
where k is any admissible value of p except its smallest value, 1q, then (4.9) is also 
valid for p = k — 1. We have not been able to prove directly the inequality 
which one obtains by putting p = n — 1 in (4.9). However, by reducing the 
terms of each 2, , of this relation to a common denominator and then can- 
celling the denominators, we obtain the following equivalent of the case p = 
n — 1 of (4.9), which we shall establish: 

, 


(4. 10) Zz n-2, ncn wands = [= a n-r(X) + 1} a. n—r-2(X)- 


Employing the identities” 


2 -2,(X = Zn-3,.(X) + ) & ews ¢ (({=n—r—I1,n-—- nr), 
we find that (4.10) reduces to 
(4.11) es oes 2 Dhow: n—r(X) + 1]Z,_3, ened 


The symbols in (4.11) are all positive except for r = 2; inthis case 2;,_, ,_,(X) =0 
(ef. the definition of 2; ;(x) in §1 and of 2; (2) in footnote 14) and (4.11) is 
equivalent to 


(4.12) Xi, see Xin. = Z.-00-cX), 


where X;,, ---, X;,, are the n — 3 elements of X which are under consider- 
ation, and 7; < i2 < --- < i,_3. Since X is by hypothesis an E-solution of 
(1.2), the right member of (4.12) does not exceed (n — 3)X;,, --- X;,,. From 
this fact and the inequalities X, = 1, X; = 2,¢ = 2, --- , n (which are obviously 
true when r = 2) one readily finds that in order to establish (4.12) it suffices 
to prove that 


(4.13) 2-4+>n—-—3 (n = 4). 


Here the equality sign holds for n = 4, 5, and the inequality sign holds for 
n> 5. Hence (4.13), the case r = 2 of (4.10), is true. 


4 Here, as in the sequel, 24g = 2.,8(1/X) is the 8-th elementary symmetric function 
of all of the reciprocals 1/Xi, --- , 1/Xa42 except 1/X, and 1/X.. 

1 Since ws = 1 (i = 1,---,r—1),19 27. Ifig = n, X, is the only element of class 
B in X, and every transformation that we make in passing from X to w will accord with 
(3.1), so that we shall not need to consider (4.9). In other words, no E-solution X in which 


Xi... ~-1) is non-transformable fails to accord with the theorem that w has the remark- 
able properties. In the present proof, then, we suppose X; .. . (n-1) transformable so that 
19 s n—-1. 


16 By the hypothesis that 1g < n — 1, every E,_2,s(X) under consideration contains X q. 
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When r > 2 in (4.10), we let (u*) = =/_,,(X“). Hence to prove (4.11), it 
suffices to show that, with s = n — r — 1 2 1 [ef. the hypothesis n = r + 2 
in (1.2)] 


(4.14) (1°)? S (1e+#)(1-*) + (1°). 
By use of (47) and (48) of I, with s; = ss = n — r — 1 = 8 21, we find that 
(4.15) (19)? — (1e*#)(1e-!) = [(2*) + (2°11) +--+]. 


Under present hypotheses (j‘), 7 = 1, 2, is a function of n — 3 = s + 1 variables 
X;. Hence (2*) and (2*'1*) are both positive and 


(2 1”) > (i), 
while no omitted term in the right member of (4.15) is negative. Consequently 
(4.14) holds in the sense of >. 
Final step of the proof of (4.9). We have proved (4.9) forp = n — 1. If 
the set of numbers 1, --- ,m — 1 contains only one number, our argument is 


complete; otherwise, we assume that (4.9) is true for p = k > iq (and < n; 
cf. footnote 14) so that 


(4.16) Die 2[2e-a-al! 2 [Znane + ZaeeadZr2-ab? (r > 1). 
With (4.16) holding, we wish to prove that if R is the right member of (4.16), 
then (in our induction from k to k — 1) 

(4.17) Zi-sr2[2es,-al7 2 R. 


To establish (4.17), it suffices to show that the left member of this relation is 
not less than that of (4.16), or that 


(4.18) Pr err > eee per (k > gq 27). 


Since in the case .q = r, (X; --- X/)7 < (X,--- X,)— S (wi --- w,)- here 
as in I [ef. equation (39) of I], we may, and do, assume that if . = 7, then 
k r+ 2and that if 1 >7r,then k 2.¢+1. Therefore k is never less than 
r + 2, and each member of (4.18) is positive. To establish (4.18) we use in it 
the identities 


Zier @ Zina t+ XeZi-w ¢=r—i1,r—9), 


and obtain the following equivalent of (4.18) 


Bianca?” B Bin-08s-0--8 (k >r + 2); 


or, in our partition notation, with es = (1%), 
(4.19) (1*)? = (1*")(1**) , s = (r — 2) 20, (k — 3) = (s+ 1). 


Relation (4.19) is obviously true when s = 0, and it is true by (4.14) when s > 0. 
Hence (4.9), the case f = X of (4.8), is true. 
_ 2 om (n-1) Contains at least one element of each of the classes A’, B’, 
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application of (4.5) with f = X’ will yield this case of (4.8) (cf. the proof of the 
case f = X’ of (56) of I], ete., until we arrive at aset X such that X{').. ;,-1) 
does not contain an element of both the classes A“, B®, and | i i ar con- 
tains an element of each of the classes A“~, B“-, while (4.8) holds for f = X, 
X’, --. , X¢ and, therefore, for f’ = X. Hence (4.8) is true. 

Completion of the proof of the remarkable properties for w. Relations (4.7), 
(4.8), and (4.3) imply that every set f which we consider is a set ¢. Conse- 
quently, from the last paragraph above, 


(4.20) XS > w,; G = i,.-.,0 = I). 


We desire to prove that the sign > holds in (4.20) for at least one of the values 
of 7. Suppose that the equality sign holds in (4.20) for every admissible value 
of 7. Then, since X® satisfies the case r > 1 of (1.2), X® = w. We shall 
reach a contradiction. Here, as in (29) of I, 


(4.21) 2,,-(1/X) S (bX, --- X, — 1)/(aXi1--- Xp) (p= q,---,n— 1). 


With p = n — 1, relations (4.21), (4.8) and our hypothesis that X® = w now 
imply that 


(4.22) Zn—1e(1/w) S Ln, -(1/X) . 


Consequently it follows from (4.21), (4.22), and the fact that (28) of I holds 
here that 


(4.23) W1--- Wer S Xi--- Xan. 


However, every transformation that we made in passing from X (in which 
Xi... (n—1) Was supposed to be transformable) to X® = w affected exactly two 
of the first n — 1 elements of one of the sets X, X’, --- , X“ and increased 
the product of these two, so that w; --- Waa > Xi --- X,-1, which contradicts 
(4.23). Hence the sign > holds in (4.20) for at least one of the specified 
values of 7. 

That w has the remarkable properties now follows readily. Since X 
satisfies the case r > 1 of (1.2), and (4.20) holds in the sense just described, 


(4.24) X) < w,. 


By the definition of t, X, = XS‘). Therefore X, < w, and w has the first re- 
markable property. Further, from (4.20) and (4.24) it follows (ef. footnote 15) 
that all applications of (4.5) which our procedure employs in passing from X“ 
to w accord with (3.1). Hence from Lemma 3 of I it follows that w has the 
second remarkable property. 

Proof of (4.4). The first remarkable property for w [cf. the first sentence 
below (4.2)] has just been established without proving that (4.4) is valid.” 


17 Most of §20 of I could be replaced by argument of the type that is used in the two 
paragraphs just before the proof of (4.4). 
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That (4.4) is true may be proved as follows. In every set that we consider 
Xi... @-» is transformable, and by (4.5) and (4.8) every transformation that 
we use in passing from X to X is such that 


en(1/f’) = ell /f),  Ln+a-(1/f’) S 2-1-(1/f), 


the first f admitted here being X. Hence from (4.2) it follows that in each 
such transformation 


(4.25) v(1/f’) 2 vQ//) . 


Then every further application of (4.5) that is needed to carry X® (which 
was shown to be different from w in the second paragraph above) into w is 
such that 


fe<Ies fi =f (p=1,---,4—1,441,---,n-—1), 


so that here (4.25) holds in the sense of >. Hence ¥(1/X) < y(1/w). 
The statements of the results of Part 1 are obtained by setting \, = 0 
(t =r+1,---,8) in Theorem 9. 


Part 2. The maximum number and the class of maximum numbers that we 
associate with equation (1.3) 


5. Statement of results analogous to those in I. We consider equation 
(1.3) as conditioned by (1.4). The Kellogg solution w of this equation is z = w, 
where [ef. (23) of I] 


wp = 1(p=1,---,r—1), we=c+l, 
Wp+1 = A[Zp, pryi(w) + AvyiZp, p-r(W) + Av422p, p—ra(w) 
(5) + -++ + MZp pew] +1 (p=r,---,n— 2), 
Wn = A[Zn—t,n-r(W) + ArgaZna, n-ra(W) + Arp2Zn—t, n—r2(W) 
+ +++ + AZn-1,n-2(w)] 


TueoreM 5. The solution w just defined has the remarkable properties relative 
to the equation (1.3) that we consider [including the case , = 0 (t = r+ 1, ---, 8) 
of I}. 


6. Method of procedure. In proving Theorem 5, we use the methods 
of I, and therefore of Part 2 of II. We define the left member of (1.3) to be 
¢n(1/x) and write ; 


¢p(1/x) = Zp,e(1/r) + AvtaZp.r41(1/Z) + Avy2%p, r42(1/2) 
i... ee (p=T,---,n). 


(6) 
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Then an equivalent of (1.3) is 
(6.1) ¢n—-1(1/2) + (1/2,)¥(1/z) = b/a ’ 


where 
W(L/r) = Syn, ra(1/X) + AvgrZn—1, (1/2) + Avg 2Bn—1, r41(1/2) 
fees + AsDn—t, e—1(1/z) ° 


To establish the first remarkable property for w, it suffices to prove that if X 
is any E-solution of our equation (1.3) except its Kellogg solution, w, then" [ef. 
(4.3) and (4.4)] 


(6.3) gn—i(1/X) S gna(l/w) , 
(6.4) ¥(1/X) < ¥(1/w). 


Further, if in proving these relations, we apply (as our method directs us to do) 
one or more transformations of the type defined in (6.5) below which satisfy 
(3.1), and only such transformations, we shall establish the second remarkable 
property for w (cf. Lemma 3 of I). 

To avoid redundancy and yet point the way to the conclusion of the re- 
markable properties for w, we next present a rough parallel (somewhat amplified 
in places) of Part lof II. In this parallel we omit Lemmas 7 and 8 of II, which 
need not be changed here. 

The analogs here of (26a), (30a), (32a) of II are (5), (6.3), (6.4), respectively; 
and the present analogs of (28a), (29a) of II are these respective relations 
themselves except that now ¢,(1/z) is defined by (6). 

Sets, 7. In the definition of set ¢ in §4, merely replace (1.2) and the ¢,(1/z) 
of (4.1) by (1.3) and the ¢,(1/x) of (6), respectively, to obtain the present 
definition of set ¢. Set r is defined here as it was on page 890 of I. 

The new transformation. Using the notation of Lemma 4 above, and also 
of Lemma 9 of I, we suppose that® fi...4, r << k S n, where f = X, con- 
tains at least one element of each of the classes A@™, B@. Then our trans- 
formation from f; .... tof, ... , is ts or bs, 


(6 5) (ts) f, ae fo (p cal fA, 19, [s Pp s k) ’ fe. = We,; gx(1/f") = ex(1/f) ’ 
, (ts) r of = So (p ~ hi, 18, ls Pp Ss k) ’ fis = We, ox(1/f’) = ox(1/f) , 


according as ts; defines fie to be not greater than wy or greater than wy, 
respectively. 


(6.2) 


18 A proof which in our procedure is essentially equivalent to, and slightly shorter than, 
our proof of (6.3) and (6.4) consists in showing that in the notation of §4, (4.20) and (4.24) 
hold here. In §4 we gave the analogs of both of the proofs in question. Were we to give 
here all details of our treatment of (1.3), we would not feel content to conclude our argu- 
ment without observing its obvious implication that (6.4) is true. 

19 Just as we used the relations r < k < nin Lemma 9 of I, we may employ them here. 
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Lemma 6. (Analog of Lemma 9 of 1).” (i) If X1..., is a set r, Xi...% ts 
transformable." (ii) If fi.... is a set + or a transformable set o for which 
r<k Ss nand if tis a positive integer, application of (6.5) to fi... yields a 
set f; ... , such that (3.1) holds, and 


(6.6) er(1/f’) = ¢p(1/w) (p=r,---,10—1), 
(6.7) ¢p(1/f’) Ss ¢p(1/f) (p = 19, a »k _ 1) ’ 
(6.8) ex(1/f’) = ex(I/f) . 


Partial proof of Lemma 6. The proof of (i) is the same as that of the cor- 
responding case in Lemma 6 of I. Concerning (ii), the case f = X of (3.1) 
follows from the relation X,, S X,,, which is obviously true, and Lemma la 
of I; the case f = X of (6.6) follows from the facts that . 2 rand X; 2 w; for 
i= 1,---,1q¢ — 1 [ef. the proof of (35) of I]; (6.8) is true by (6.5). 

In §7 we establish the case f = X of (6.7). Then the rest of the proof of 
Lemma 6 can be made by argument of a type that was used on pages 899 and 
900 of I. 


7. The analog of (46) of I. How to complete the proof of Theorem 5. 
From the case f = X of (6.7) and (6.8), one can derive (7) below just as we ob- 
tained (46) of I from the relations (37) and (38) of I: 


[Dior + Avg Zier + Avge Zaorge toes + AZ el 
[z~-2 r-2 + Aus Z ee, — + AusZ 5-2, + +++ A252 sal 

P [Z.-. —3T a er + a ae +--+ so 
[Zara + Mgt Z par + Avge Zporg $v + AZ poi 


(k>p2q2r,s>Pr). 


(7) 


We shall presently show that (7) is equivalent to” 
(7.1) DAaAne — Aye ZeosZ p25 — Zr2,52)-2,4] 2 0, 


where the summation extends over? = r —1,---,s —l1;j =r—2,---,%*-—1 
and A,-1 = Assi = 0, A, = 1. This equivalence may be proved as follows. Cer- 
tainly (7) is equivalent to 


, , , , 
v s,s , 
Pip Aj42(Z ao, se 2 p—o,j — Ve-e,j2p-a,s) 2 O, 


* Lemma 6 and transformation (6.5) include the lemma and transformation, respec- 
tively, which would be used in a precise parallel here of Part 1 of II. 

21In Lemma 6, X:...% must be transformable for at least one value of k such that 
r <k <n; for, as was stated in footnote 15, no E-solution X in which X; ... m1) is non- 
transformable needs to be considered. 

22 The sign = would hold here if n = 4, p = 2, k = 3, m = 1,19 = 2, and g(1/X) = 
Yoi(1/X) + Ba2(1/X) + B4s(1/X). This example shows that the sign < does not always 
hold in (37a) and (46a) of II. In both of these relations < should be replaced by S. 
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where the summation extends over all 7, j = r — 2,---,s — 1. Further, the 
sum of all terms that one obtains here by setting i = 7 is zero. Consequently, 
consideration of the two cases i > j and 7 < j now leads to the conclusion that 
(7.1) is equivalent to (7). 

Proof of (7.1). On account of (1.4), we only need to prove that 


, nail ont a 
(7.2) 2 x-2,42 pe, j =" 2-2, 52 p-2,i = 0. 


There are two cases to be considered: (1) where Lemma 8a of II applies to 
(7.2) with u = k — 2,v = p — 2,7 = i, t = j; (2) where this lemma does not 
apply to (7.2). 

(1) Here (7.2) holds in the sense of >, and (7.1) is true. 

(2) Since k > p and i > j by hypothesis, we conveniently subdivide this 
case into two other cases, as follows: (a) when j < 0, as it is if 7 = (r — 2) and 
r = 1; (8) wheni > (p — 2). In case (a) [(8)] each term [the last term] in the 
left member of (7.2) is zero. Therefore (7.2) holds in case (2). 

How to complete the proof of theorem 5. With Lemma 6 holding (cf. the last 
sentence of §6), we can make the induction for (6.3) just as the similar induction 
for (30) of I was made in I (ef. in particular the beginning of §14 and the last 
two paragraphs of both §16 and §18, of I). Then argument of §4 leads to the 
analogs here of (4.20) and (4.24) and thus establishes the first remarkable prop- 
erty for w. Next we observe that throughout the proof of Lemma 6 the relation 
{ef. (58) of I] 


for SI 


is used, so that each transformation that we employ in this proof accords with 
(3.1). Consequently, the second remarkable property for w follows. 


8. Our most general result relative to equations of the form (1.3). The 
methods by which one proves Theorems 2a, 3a, 4a, 5a of II suffice to prove the 
following 

TueoreM 8. Let ¢,(1/zx) be as it is in (6). Suppose that a, b, and yw, where 
r Sus n —1, are given positive integers, with a and b, b S a, relatively prime, 
and that there exists a set of n numbers w = (wi, --- , Wn) with the following 
properties: 

(1) it is an E-solution of the equation 

gx(1/z) = b/a, L1S&r<sSn,_ Dsasin (1.4); 


(2) ¢p(1/w) = (bw: --- wp — 1)/(aw --- wp) 


for every positive integral value of p for which » S p S (n — 1) and for no positive 
integral value of p less than p; 

(3) if x = X, where X ¥ w, is an E-solution of the equation in (1), then for 
every positive integral value of p such that r S p S uw for which X,...p ¥ Wr... p 
the relation ¢,(1/x) S ¢,(1/w) holds. 
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Then w is the Kellogg solution of the equation in (1) and w has the remark- 
able properties relative to this equation. 

The following corollary shows that Theorem 8 has content for a case in which 
oo 

Corouiary 8. The Kellogg solution of the equation 


Dni(1/x) + 2 Zn,2(1/z) + 32,,3(1/2) = 5/11 (n > 4) 
isx = w, where w, = 3, we = 14, and 
Wp+i = 11[Zy, p(w) + 2Z,y, pa(w) + 3Z,,p-2(w)]} + 1 (p = 2,---,n — 2), 
We = UZ n-a, na(w) + QZr-r, n-2(w) + 3Zn-1, n-a(w)]. 
Here the » of Theorem 8 has the value 2 > r = 1. 
Part 3. The maximum number and the class of maximum numbers that we 
associate with equation (1.5) 


9. Statement of results analogous to those in Part 1. We consider equation 
(1.5) as conditioned by (1.4). The Kellogg solution w of this equation is 


w; (¢ = 1,--- ,n — 1) as in (5), 
(9) v, = a[>,—1, n-r(w) + Ar41Dn—1, n—r—1(W) + Ar¢22n—1, n—r—2(W) 
+ ose AsZn—1, n—2(W) + 1}, 


where the }’s satisfy (1.4). We give below the main step in the proof of the 
case ); > 0 (¢ = r + 1, --- , 8) of the following 

TuHEeoreM 9. The solution w in (9) has the remarkable properties relative to its 
associated equation™ (1.5). 


10. Proof of the case \, > 0 of Theorem 9. We use the methods of Part 1, 
§4. We define the left member of our equation (1.5) to be the new function 
¢n(1/x), and write 


¢p(1/x) = 2». (1/2) + Ar Zp, r41(1/z) + Ar+2 Dp, r42(1/2) 
4 os+ + NeZpc(h/2) + Zp cll/s) (p=r,---,M), 


so that the present ¢,(1/z) is formally the same as that of (6) except for p = n. 
An equivalent of (1.5) is 


(10.1) ¢n—1(1/x) + (1/2n)¥(1/z) = b/a, 
where 
W(1/x) = Lua, r—a(1/2) + Aver Ena, 2(1/2) + Avge Zn, r41(1/2) 
ove $ AsZnat, a(1/Z) + Zana, na(1/z). 


(10) 


23 The case \, = 0 of Theorem 9 was disposed of in Part 1. The only case that we need 
to consider here is that in which , > 0 (ef. footnote 7). 
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Now we assume X to be any E-solution of our equation (1.5) except its Kellogg 
solution, w. In order to establish the first remarkable property for w here, 
it suffices to prove that under our present hypotheses the inequalities [ef. (4.3), 
(4.4)] 


(10.2) gn-1(1/X) S gna(l/w), 
(10.3) ¥(1/X) < y(1/w) 


hold, so that X, < w, [ef. (10.1)]. If in establishing (10.2) and (10.3) we trans- 
form X into w by one or more transformations, everyone of which accords 
with (3.1), we shall prove for w the second remarkable property, and thus obtain 
the case \, > Oof Theorem 9. After we define the new transformation and set o, 
we shall give (in §11) the proof of the analog here of (4.9). All further details 
in the proof of the case > 0 of Theorem 9 will then be obvious from the first 
three paragraphs below (4.19). 

Definition of the transformation. This is formally the same as the definition 
of the transformation of §4, the symbols ¢,, w, X, f here relating to the case 
dh, > 0 of (1.5) rather than (1.2). 

Definition of set ¢. This is also formally the same as the definition of set o 
in §4 except that here the present case of (1.5) and the ¢, of (10) replace (1.2) 
and the ¢, of (4.1), respectively, there. 


11. Analog of (4.9). This relation is readily found to be** 


, ae af »/ 
iiiitiout + Ant 2n-8,¢ + A422 n—2, 41 + Pits + A, Zn—2, 0-1! 


a@ att oT ’ 
[2 ~2,r-2 + A, 412 p—e, r—l + A422 p—2,r + sola + A, 2 p—2, 2 


(11) 
, af oF , , 
> [Z-2. r—2 + A412 a2, r—l + A422 a-8,r + a + A, 2 n~2, 0-2 + Zn-2, — 
7 asi ail , 
[Z,~2 r—l + A412 p—e.r + A422 ps, r+l + a + A, Z paz, 2-1) 


(p = 19, ---,n —1;¥% > 0). 


It can be proved™ that (11) is equivalent to (11.1) in the same way that we 
showed (7) to be equivalent to (7.1): 


TrcraAsea — ArsaAseaMSe-acZo-ay — Zo~asZp-00 


(11.1) xt , 
> “~n—-2, A + A412 p—2,r + A422 po, p41 + wre + A, 2 p-2, 11) 


where the summation is to be extended over 7 = r — 1,---,8 —1ljj=r-— 2, 

--,@ — 1, and Av = Agr = 0,4, = 1. By (1.4) the first parenthesis under 
the summation sign is not negative for any admissible pair of values of 7 and j. 
We prove that (11.1) is true in two steps. First we show that (11.1) holds for 


*4 It is interesting to note that ifr = land; = 0 (¢ = r +1, --- , s), (11) does not hold. 
This is the case which we gave special treatment in Part 1. It is also to be noted that the 
sign >, rather than 2, is used in (11). 
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p =n-—1. Then we prove that if (11) holds for p = k, where k is any ad- 
missible value of p except its smallest®® value gq, (11) is true also for p = k — 1. 
First step. We treated the case s = r = 1 of (1.5) in §§2,3; the cases = r > 1, 
in §4. Under our present hypothesis that \, > 0 (¢ = r + 1, --- ,s) we need 
now to dispose of the following cases of (1.5): (i) s > r = 1; (ii)s > r > 1. 
This we now proceed to do. 
(i) Thecases >r=1. If s = 2,3, m,(3 < m Sn — 2), andp = n—1, 


(11.1) reduces to (11.2), (11.3), (11.4), respectively: 
(11.2) NE(Za-o1 — Vasa) > Zara na(l + AeZa-s1), 


[0X8 — AMSa-as — Vans) + AAc(Sa-as — Za-as) 
a a a a b> a) Fo? ee SS 


(11.3) 


[A + AAlZ-e0-3 — Zonnans) + AnAZa-a0-22anas 
(0 = Sueded + ->> + Bacal ans ~ Saeed 
> OO + hi as + Se tere + Ss a 


where A is the sum of those terms of the left member of (11.1) that are obtained 
by assigning to i values > r — 2 and < m — 1 = (s — 1). 

Proof of (11.2). The left member of (11.2) equals \3}X;' and a = 
(PX,)—!, where P = X,, --- X;,, [ef. the notation of (4.12)]. After reducing 
all terms of (11.2) to a common denominator, we find that this inequality 
is equivalent to 


(11.5) P*n8 > P + AeZ..2 0-5). 
If n = 4, the smallest admissible value of n when s = 2, (11.5) reduces to 
X73 > X,, + &- 


Since r = 1, every element X, of X exceeds unity, and A, 2 1; hence the last 
displayed relation, and therefore the case n = 4 of (11.5), is true. Suppose 
n> 4. Since d¢ is a positive integer and X; S Xz S --- S X,, we shall es- 
tablish (11.5) if we prove that P > [1 + (n — 3)/X;,,] or, indeed, that 2” > 
n—1. This inequality is true for n > 3 and therefore for the cases under con- 
sideration. Consequently (11.5) and its equivalent (11.2) hold. 

Proof of (11.3). We shall obtain the desired result by establishing the 
inequalities 


(11.6) Ziae — Banas > Zoned! + Be-aa> 
(11.7) Be iosTena,1 — Se-asBa-n0 > Be-n0-c%e-ns- 


25 As in the corresponding proof of §4, we suppose k > r+ 2. The case thus omitted is 
easily handled as was explained just below (4.18). 
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Proof of (11.6). Using the identity 2/_.. = 213.2 + X7'Di_,,, in (11.6), 
and then reducing all terms of the resulting relation to a common denominator, 
we find that (11.6) is equivalent to 


(11.8) Pan ct) > P + Booch). 


Since n = s + 2 = 5and every element in P has a value > 1, each factor in the 
left member of (11.8) exceeds 2. Consequently (11.8) holds. 
Proof of (11.7). It is equivalent to 


SD) Bic ARs AD) — Boca AX sc. At) > Ba.2cc%). 


If n = 5, which is the smallest admissible value of n since n 2 s + 2, (11.9) is 
equivalent to 


(Xi, + Xi, + Xi)(Xi, + Xe) — (AX Xi, + XiXi, + XX) > 1, 
which is obviously true (with X;; > 1). If n > 5, we use the identities 
Enn2u(X) = Zp. u(X) + X,Zpua(X) (w= n—4,n—- 3), 
and find that (11.9) is equivalent to 
(11.10) [ys na(X)P — 2a, na ME ns, n8(X) > Eps, ns(X) - 


Introducing the partition notation (a‘) = =/_, ,(X*), we may write (11.10) in 
the form 


(11.11) (i-y — G{i"4 > Gr4. 


By use of (47) and (48) of I, we find that the left member of (11.11) is not less 
than 


(2"-*) + (2"*1*) , 


which exceeds (1"-*), since the number of variables under consideration is 
n — 3. Hence (11.7) holds. 


Proof of (11.4). Here n = m+ 2 26. Hence we shall obtain the desired 
result by establishing the inequalities 


/ , 


(11.12) Dinsent — Zase-t > Za-a0-2(1 + Z,-3,1)> 
(11.13) a oe _ a > a - ee (2<t<m). 


Proof of (11.12). Reducing all terms of this relation to a common denomi- 
nator and using in the result the identity 


a ee = ae 4 + pe a Fy ’ 


we find that (11.12) is equivalent to 
(11.14) Zi -aa-m—i(X) > 1+ P12, (KX), 














MAXIMUM NUMBERS ASSOCIATED WITH SYMMETRIC EQUATIONS 335 


where P is as in the last paragraph above. No term in the left member of 
(11.14) is smaller than X,, --- X;,_,,_. and no term in Dis, n-a(X) is larger 
than X;,--- Xj,_,. Hence (11.14) is true if 

n—m— 1 


( Rae x, ore es 
(11.15) 


(Xi > 1ljn >m+2 26). 
The left member of (11.15) is not less than 2(n — 3) and the right member is 
not greater than (nm — 1)/2, and so (11.15) holds. 

Proof of (11.13). Reducing the terms of (11.13) to a common denominator 
and employing in the result certain identities analogous to those just above 
(11.10), we find that (11.13) is equivalent to 

ae a n—t-1(4) —_ oe! ee ) > ee ’ 
or, in partition symbols, to 

(1*-"-")(1*-*-!) os, (1*-*)(1*-*-*) > (1*-*-*) 
(2<timn =>=m+2 26). 


(11.16) 


We let s, = n — t — 1, ss = n — m — 1 and use the following relation which 
can be obtained from (47) and (48) of I: 


(11.17) (1%) — (a"*)() & Qa) 4 (Qt), 


In order to establish (11.16), it suffices to prove that the right member of (11.17) 
exceeds (1"~'). By hypothesis, the number of variables under consideration 
is (n — 3) > 8s. Hence neither of the terms in the right member of (11.17) 
vanishes. Consequently, in the cases where s. = 8, 8s: = 8 — 1, and 8s, = 1, 
this right member is readily seen to exceed (1*~'). Suppose, then, that 1 < s2 
< s, — 1; we shall prove that 


(11.18) (2"1°-") > (1). 


Consider the terms of each of the symbols of (11.18) which involve only the 
variables of some particular set of s; variables (of the n — 3). The numbers of 


such terms in the left and right members of (11.18) are ne and 8, respectively. 
2 


Since 1 < s2 < 3, — 1, (*) > s,, and every one of the s, terms in question of 
2 


(1"~) is less than the minimum term in question of (2"1"°~"). By considering 
in this way every set of s; variables from the n — 2 [and thus counting certain 
terms of (1"~') more than once, since n — 3 > s,], we find that (11.18), and 
therefore (11.17), holds. . 

(ii) s >r > 1. We find it desirable to treat this case in the following parts 
(with p = n — 1 in each part): (1) r = 2, s = 3; (2) r = 2, s = m > 3; (3) 
s>r>2. 
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(1) r=2,s=3. In this case \, = A: = 1, and with p = n — 1 inequality 
(11.1) is 


’ , aq! , 
[2-21 rn-3,1 + A3(Zn~2,2 ae 2 n-3,2) + A3 (Zao, 22 n-g,1 


(11.19) 


, , , , , 
— 2 n-2,12 0-3,2)] > Z n-2, 2-22 a-3,1 + AsZn—3,2)- 


To prove (11.19), it suffices to show that (11.7) holds here and that 


(11.20) Baas — 3.00 > BnceePaas- 


The proof of (11.7) given above applies here. Inequality (11.20) is readily 
found to be equivalent to P = X;,--- X;,, > 1. This is true sincen —3 2r 
= 2andP = X,.--- X, 22. 

(2) r=2,s=m>3. Thecase p = n — 1 of (11.1) now reduces to 


, , , , 2 , mm 
[2-21 — Zno31 + A3(Z n-2,2 ~~ = 3,2) + (A3 - A) (Zn-222 0-3, 


: ; 5! a af , 
(11.21) naar na) + +++ + AeXm(Z nama — Dn—smar) + AwAs(Z n—2,m—t> n—3,1 
: r 4 2 , hd , , 
= Sask.) > -°* + Me enntke-tnt — Sn-t-dbatn-l 
, , , , P 
> 2 n-2,n-2(2 n—3,1 + As= n—3,2 oe NZa-s,3 a eee + Am 2 n—3,m—1)° 


Since AmAs 2 Ay (¢ = 2, --- , 8) and Az = 1, in order to establish (11.21) it 
suffices to show that (11.20) and (11.13) hold here, and the proofs of these 
relations are the same here as they were above. 

(3) s>r>2. Thecase p = n — 1 of (11.1) reduces to 


[A + WAZi ass Banas _ ee + ee ee 
(11.22) — i +--+ + | a a a a 
> ee a + ew + oo en + =, + er 5 


where A is the sum of those terms of (11.1) that are obtained here by assigning 
to i values > r — 2and <s—1. That the sum of the last s — r + 1 terms of 
the left member of (11.22) exceeds the right member of this inequality can be 
shown by comparing the coefficients of \,,,; and \, in the left and right mem- 
bers, respectively, of (11.22), and observing that (11.13) holds with m = s and 
t=r,---,swhenr > 2. 

Second step. Let An-2,r1, Bp-2,r-2, An-2.r-2, Bp-2,,-1 be the first, second, 
third, fourth bracket, respectively, in (11). Since we know that (11) is true for 
p = n — 1, we know that 


(11.23) By-2,r-2(Br-2,r-1)7? > An—2,r-2(An—2,r-1) 


is true fork = n — 1. Assuming (11.23) to be true for any value of k except 
the smallest value of p that we admit, namely p = iq if 1g > rand p = q+ 1 
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if iq = r, we prove that the relation which results when k is replaced by k — 1 
in (11.23) is also true. In order to do this, it suffices to prove that 


By-s,r-2(Be-s,ra)! 2 Bre, --2(Be-2.-), 


which is true since the case p = k — 1 of (7.1) is a true inequality. 


12. Our most general result relative to equations of the form (1.5). The 
methods referred to just before Theorem 8 suffice to prove the following 

TueoreM 12. This is obtained from Theorem 8 by merely substituting in 
Theorem 8 the ¢,.(1/x) of (10) for the ¢,(1/x) of (6). 

Corotiary 12. The Kellogg solution of the equation 


Zn,(1/z) + 2Zn,2(1/%) + 3En,0(1/2) + Zn,n(1/z) = 5/11 (n> 4), 


is x = w, where w, differs from the w; of Corollary 8 only fori = n. Here w, 
exceeds the w,, of Corollary 8 by 11. 


Part 4. Further results of several different types 


13. Application of the theory to a special equation different in form from 
both (1.3) and (1.5). In §1, just before describing the new features of the 
present paper, we indicated the nature of the requirement of consecutive second 
subscripts of the >’s in equations (1.3) and (1.5). That one may be able to 
carry through the procedure for an equation in which n is small and the second 
subscripts of the =’s employed are not as we have required them to be generally 
is shown by the following example. 

Consider the equation 


(13) Zs,2(1/x) + Zsa(1/x) = b/a, a = [(c + 1)b — 1). 


The Kellogg solution w of (13) can be obtained by properly specializing equation 
(5). 

We indicate our procedure for (13) as follows. Let X be an E-solution of (13) 
different from w. Suppose that X,... 4 contains at least one element of class A 
and at least one of class B. Suppose also, for the present, that gq, = 1, 1g = 2. 
Then let X be transformed into X’ in such a way that 


Xi = (Xi-0) 2m, X; = we, X; = X; (¢ = 3, 4, 5), 


while both X and X’ satisfy (13). We wish to prove the analog here of the 
case f = X of (4.8): 


(13.1) 2Zp,2(1/X’) + Zp, 4(1/X’) S Zp, 2(1/X) + Zp, a(1/X) (p = 2,3,4). 
According to the definition of the transformation, the case p = 2 of (13.1) is 


(Xj we) = (X;X,)" S (XX), 
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whose validity follows from the definition of our transformation and Lemma la 
of I. For p = 3, 4 the analogs in question of (4.8) are readily found to be 


(XsXiX5 + Xs + Xe + Xs)(XsXe + XXs + XiXs + 1) S Xs, 
(XsXaX5 + Xs + Xa + X5)(XsXq + XsX5 + XiXs + 1) 
< (X3X4 + 1)(X; + ag, 


respectively, and both of these relations are readily found to be true, since X is an 
E-solution of (13). Had we taken (q, 1g) to be any other pair of the numbers 
1, 2, 3, 4 with 9g: < 1q, similar results would have been obtained. It now follows 
that the Kellogg solution w of (13) has the remarkable properties. 


14. A new generalization of Kellogg’s problem.” Let a, m be any posi- 
tive integers and let n be an integer > 1. Then the equation which Kellogg 
considered is the case m = a = 1 of the following equation: 


(14) D(e+4t- +5)-3- 
The Kellogg solution of (14) is z = w, where w, = a + 1, 

Wp = wT --- wp +1 (p=1,---,n—2), 
and w, is the positive number (greater” than w,_:) which satisfies the equation 
(wrt* — wt) (wt — 1) = alu --- waa). 

We state without proof that the methods employed in I for the equation 
(14.1) Ln,1(1/z) = b/a, a =[(e+ 1)b— 1], 


suffice to show here that our solution w of (14) has the remarkable properties. 

Remark. If we attempt to find the Kellogg solution, W, of the equation 
L = b/a, where L is the left member of (14) and a is as it is defined in (14.1), we 
do not find that 


p 
1 l 1 _ (Wi --- W,)™—1 
D(Ftmt i +2)- a(W, >>> Wy) 


i=1 








for p = 1 unless m = 1. Consequently we can not apply our usual method of 
attack in this case. We have found no method that suffices here. 


15. An application of a result in Part 2 to convergence of a type of series. 
From the proof that every EZ-solution of (1.3) as conditioned by (1.4) is a set ¢ 


2 Cf. O. D. Kellogg, loc. cit., footnote 4. 
27 We leave the proof of this inequality to the reader. It may be established by sub- 
stitution of w’, where w, = w; (i = 1, --- , nm — 1) and wa = wy: in (14). 
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[ef. (6.3) and the definition of a set o in §6], we observe the following fact. 
Among all infinite series with m-th term equal to 


r+m—-1 


Us = (Zr40-1) pe AjZr4+m—2, j-1(1/2) ’ ,=1, i 


oP 


Ill 
—) 


(i>~s), 


where the z’s are positive integers such that U; + U2 + --- + U, < (b/a), 
a = [(c + 1)b — 1], for every positive integer p, there is no series which con- 
verges to (b/a) more rapidly than does the one that is obtained by letting n 
increase indefinitely in (1.3) and then taking x; = w; (i = 1, --- ,m — 1), where 
w is the Kellogg solution (5) of (1.3) when the )’s in these equations satisfy (1.4). 


16. The Kellogg solution of the cyclo-symmetric analog of (1.3). Let 
n, r, s be integers such that n = s > r 2 1, and let C; (1/z) stand® for the j-th 
elementary cyclo-symmetric function of the n reciprocals 1/2, --- , 1/z,. We 
consider the equation 


a6) CAA/r) + AraiCrai(1/x) + AvgeCry2(1/x) + --- + 2,C,(1/x) = b/a, 
a=([(e+1)b-1), 


in which the )’s are integers 2 0. The Kellogg solution of (16) is z = w, where 


(wp = 1 (p=1,---,r—1); 
wW=c+1; 
P 
(16.1) { Wp = >) AAA +++ Wpj + 1 (p=r+1,---,n—1); 
j=r 
a 7 
ae = b> b> AAMOWi41 ++ - Wa-1-j4i » 





j=ri=1 


and A, = 1,4; = 0(s <j <n), we--: Wen = 1. 

The solution w defined by (16.1) has certain properties which for the case 
4; = 0(¢ = r+ 1, --- , 8) were discussed in the article referred to in footnote 28. 
We have not been able to prove or disprove that this solution has the remarkable 
properties relative to equation (16). 


NORTHWESTERN UNIVERSITY. 
28 The function C,(1/z) was the left member of the equation considered by Simmons in 


an article On a cyclo-symmetric equation, American Mathematical Monthly, vol. 36 (1929), 
pp. 148-155. 











\-DEFINABILITY AND RECURSIVENESS 
By S. C. KiLEENE 


1. Introduction. In Kleene [2]! a theory of the definition of functions of 
positive integers by certain formal means is developed in connection with the 
study of a system of formal logic.2 The system of formal logic is shown in 
Kleene-Rosser [1] to be inconsistent; however, the theory of formal definition 
remains of interest, both for its use in a new system of formal logic proposed by 
Church in [3], and for its connection with questions of constructibility and 
decidability in number theory.* Hence it seems desirable to bring together the 
essentials of the theory, and to develop them from a somewhat new point of view, 
in which the emphasis is on the connection with the recursive functions. In 
this presentation, no knowledge of systems of formal logic is presupposed, but 
use will be made of a few results of the intuitive theory of recursive functions.‘ 

It is found convenient here to treat the functions as functions of natural 
numbers, rather than of positive integers. This change can be regarded as a 
change merely in the notation. 

The theory deals with a class of formulas composed of the symbols {, }, (, ), 
, [, | and other symbols f, x, p, --- called variables or proper symbols, where 
f, x, p, «++ is a given infinite list. 

A formula is called properly-formed if it is obtainable from proper symbols by 
zero or more successive operations of combining M and N to form {M} (N) or 
\x[M], where x is any proper symbol. An occurrence of a proper symbol x in 
a formula F is called bound or free according as it is or is not an occurrence in a 
properly-formed part of the form \x[M]. By a free (bound) symbol of F is 
meant a proper symbol which occurs in F as a free (bound) symbol. A formula 
shall be well-formed, if it is properly-formed, and if, for each properly-formed 
part of the form \x[M], where x is a proper symbol, x is a free symbol of M. 

Heavy-typed letters will henceforth represent undetermined well-formed 
formulas under the convention that each set of symbols standing apart in which 
a heavy-typed letter occurs shall stand for a well-formed formula.’ As abbre- 


Received July 1, 1935; presented to the American Mathematical Society, September 13, 
1935. 

1 The numbers in brackets refer to the bibliography at the end. 

2 Use is made, directly or indirectly, of Church [1]-(2], Kleene [1], Rosser [1], Curry 
{1]-[3], Schénfinkel [1]. 

3 See Kleene [2] p. 232, Church [4], and Church-Kleene [1]. 

‘In writing this paper, I have profited from discussion of the subject with Dr. J. B. 
Rosser, and I also thank him for assistance with the manuscript. 

5 A detailed analysis of the structure of well-formed formulas, and of the implications 
of this convention, is given in Kleene [1] §§2, 3. The term ‘‘proper symbol’ was intro- 
duced in place of ‘‘variable’’ in order to save the latter for use in another meaning in con- 
nection with the formal logics under consideration. 
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viations, we shall write {F}(A;,---, A,) or F(A, --- , An) instead of 
{ +--+ {F}(Ai) --- }(A,), and Ax; --- x,-M instead of Ax,[ --- Ax, [M] --- ]. 
Sik, ...x,M | shall denote the result of substituting A; for each of the occurrences 
(if any) of x;in M (¢ = 1, --- , ). From time to time we assign individual 
symbols to stand as abbreviations for particular formulas, indicating this by an 
arrow —, as 


[— Mf -f ’ J— Afryz ‘f(z, Sf, y)) * 


We introduce an equivalence relation A conv B, or A is convertible into B, be- 
tween well-formed formulas, which is defined to be the relation of least domain 
which is (1) reflexive, (2) symmetric, and (3) transitive, and has further the 
properties (4) if A conv B, then {C} (A) conv {C}(B), {A}(C) conv {B}(C), 
and \x[A] conv Ax[B], (5) if the proper symbol y does not occur in A, \x[A] conv 
S}Ax[A]|, and (6) if x and the free symbols of N are not bound symbols of M, 
{Ax[M]}(N) conv SyM]}.¢ 

If {F}(N) is interpreted as representing the value of F (considered as a func- 
tion) for N as argument, and \x[M] as representing the function which M is of 
x, then the equivalence relation A conv B corresponds to a relation of equality 
in meaning.’ The analysis of the relation A conv B given in Church-Rosser [1] 
can be regarded as furnishing a demonstration of the consistency of the system 
under these interpretations: A formula A which has no part of the form 
{Ax[M]} (IN) is said to be in normal form, and to be a normal form of any formula 
A convertible into it. According to Theorem 1, Corollary 2, if A has a normal 
form, the latter is unique to within the choice of symbols used in it as bound 
variables.’ 

Evidently, a demonstration that A conv B is given by passing from A to B by 
successive substitutions (on individual parts of a formula not immediately 
following A) of (a) C for D (or inversely), where D conv C is known, and (b) 

NM| for {Ax-M}(N) (or inversely), changing bound variables when necessary 
to avoid confounding variables that should be distinct or to reach a desired 
formula. 

The substitution Sy,.:n%_M| for {Ax; --- x,-M}(Ni, --- , N,) is equivalent to 
a series of the substitutions (b). Indeed, from the interpretations of {F}(N) 
and Ax|M], it follows that the expression which we abbreviate to F(N,, --- , N,) 
represents the value of F (considered as a function of n variables) for the set of 


6 (1) and the clause ‘‘{A}(C) conv {B}(C)’’ of (4) are redundant. The present definition 
is equivalent to the former one, according to which A conv B whenever B is derivable from 
A by certain rules I-III, the derivation being called a conversion (cf. both Church [1] and 
Kleene [1]). 

7A relation, rather than the relation, since, for example, it can be maintained that 
Afz-f(x) and \f-f have the same meaning. 

8 The notion of the normal form of a formula under conversion was originally introduced 
by Church in lectures at Princeton in the fall of 1931. 
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arguments N,, --- , N,; and the expression which we abbreviate Ax; - -- x,-M 
represents the function which M is of x, --- , X,.° 

We have specified a class of formulas (the well-formed formulas) and an 
equivalence relation between formulas of this class (the relation of intercon- 
vertibility). We bring the natural numbers into relation with this subject- 
matter by selecting a progression of well-formed formulas 


'ax-f(z), Mx-f(f(z)), Mx -f(F(f(x))), --- 


to “represent” or ‘“‘be identified with” the natural numbers in our symbolism. 
This is recognized in the notation by assigning 


0 — Afx-f(z), 1 > Afz-f(f(x)), 2 — dfe-f(f(f(z))), --- - 


It may now happen, for a non-negative integral function L(m, --- , 2,) of 
natural numbers, that there are well-formed formulas L which automatically 
represent the function L(x, --- , Zn), on the basis of our equivalence relation 
and our interpretation of F(N;, --- , N,). That is, there may be formulas L 
such that, whenever x:, --- , X, represent natural numbers 2, --- , Zn, respec- 
tively, L(x, - -- , Xn) is convertible into the formula which represents the natural 
number L(m, --- ,2%n). In this case, we shall say that L(x, --- , 2,) is 
(formally) defined or d-defined by L. 

Thus a problem arises: what functions L(x, --- , 2.) are \-definable? We 
have at once that the successor function is \-definable, since 


{Apfx-flo(f, x))} (Afx-f(--- n+1 times --- f(x) ---)) conv Afx-f(--- n+2 
times --- f(z) ---) (n= 0,1,2,---). 
Accordingly let 
S — refx -flolf, x)). 


The identity function of a natural number is also )-definable, since the formula 
which we have called J has the property J(x) conv x. 

The problem has arisen from the point of view in which interconvertible 
formulas are regarded as equivalent. Hence we should consider whether the 
representations involved are unambiguous from this point of view. Let us call 
a representation of a class of mathematical entities by well-formed formulas 
well-founded if interconvertible formulas cannot represent different entities of the 
class. It follows from the above-mentioned consistency proof (Church-Rosser 
{1]) that the given representation of natural numbers by well-formed formulas 
is well-founded; this in turn implies that such representation of non-negative 
integral functions of natural numbers as \-definition yields is well-founded.” 


® This device for expressing functions of several variables in terms of functions for one 
variable goes back to Schénfinkel [1]. 

10 Non-interconvertible formulas may represent the same entity of a given class, and a 
formula may represent entities of different classes, e.g., the formulas abbreviated J and 0 
both represent the identity function of a natural number, while the latter also represents 
the natural number 0. 
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The problem is a special case of the larger problem: what functional relation- 
ships among well-formed expressions can be expressed by well-formed formulas? 

We shall say, generally, that a function LZ which associates well-formed 
formulas with finite ordered sets of well-formed formulas is (formally) defined or 
d-defined by L if for each finite ordered set Ai, --- , An, for which L is defined, 
L(Ai, --- , An,) is convertible into the value of the function L for the set 
A,, --- , A,, of arguments; and we shall understand, by the \-definition of a 
function of which the arguments or the values are other mathematical entities, 
the A-definition of a function which corresponds under the representation of the 
mathematical entities by well-formed formulas (in case a representation has 
been specified) ."' 

In this paper we restrict ourselves (except incidentally) to the case of the 
larger problem in which the independent variables are fixed in number, and 
range over the natural numbers. The subcase of the problem in which the 
values are also natural numbers (i.e., the problem first proposed) we treat in 
§§2, 3 by proving that all recursive functions, in a wide sense of the term recur- 
sive, due to Herbrand and Gédel, are \-definable ; and conversely, all \-definable 
functions of the type in question are recursive. In §$§4, 5 it is shown that, using 
the term recursive in an extended sense, these results can be generalized (under 
additional hypotheses) to the case in which the values are any well-formed 
expressions. The extended sense of the term recursive is obtained by assigning 
numbers to the values, by the Gédel method, and requiring that they be a re- 
cursive function of the arguments in the first sense. 

The formulas Afx-f(x), \fx-f(f(x)), --- were originally coérdinated with the 
positive integers 1, 2, --- (Church [2], p. 863). That is a suitable course to 
follow in developing number-theory (Kleene [2]). In this paper, for technical 
reasons, we are using instead the correspondence established above between 
those formulas and the natural numbers 0,1, --- . Because of this, the concept 
of \-definability of a function is altered. But, for the interpretation of the 
final results, one can easily go back to the original notion of \-definability. 
Since the “natural numbers’, “0”, “1”, --- enter into our definitions of - 
definable function and recursive function in the réle of a progression, it is only 
necessary to rename them “‘positive integers”’, “‘1’’, “‘2’’, - - - in those definitions. 
Or one may use the following relation: A positive integral function $(y:, --- , Yn) 
[well-formed function (yj, --- , yn)] of positive integers yi, --- , Yn is \-definable 
in the original sense if and only if the function ¢(z, + 1,---,2,+1)-—1 
[&(2, +1, --- , 2, +1)] of natural numbers 2, --- , 2, is A-definable in the 
present sense.” 


11 The A-definition of a sequence Ao, Ai, --- shall mean the d-definition of the function 
which A; is of 7, and the \-enumeration of a class shall mean the A-definition of an enumera- 
tion (with or without repetitions) of the members of the class. 

12 Under the Church representation of the positive integers, the formula of n denotes the 
operation of applying the n-th power of a function to an argument, and exceedingly simple 
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2. Recursive non-negative integral functions. In the \-notation, the defini- 
tion of a function by substitution is immediate: 


(1) {Ax, a? x,-G(Ai(x, 2 Sill Xn), sietiay, H,,.(x:, te Xn))} (Xi, ee X,,) conv 
G(Hi,(X:, re X,.), 7 Oe H,,.(X, Pees ae X,,)).¥ 


When an italic letter denotes a number, the same letter in heavy type shall 
denote the corresponding formula. Our remark that S \-defines the successor 
function of a natural number can now be written thus: 


(2) Ifx + 1 = z, S(x) convz (x = 0,1, --- ). 

In view of the form of x (x = 0,1, --- ): 

(3) x(F, A) conv F( --- 2+ 1 times --- F(A) ---) (x = 0,1,---). 
(4) x(1) conv I (x = 0,1,---). (I(A) conv A.) 


By use of (4): 


(5) {At-t(7, 0)} (x) conv 0 (x = 0,1,---). 
(6) {Au tee t,-&(7, eee, t, (I, t;) tee )} (x, te » ted conv Xx; (x1, coe 5 2y = 
0, 1, --- >. 


\-definitions of addition, multiplication, and exponentiation, due to Rosser (Kleene [2] 
pp. 160-164), are possible. 

If that representation is extended by adding \fz-2z(f) to represent 0, the resulting formal 
definition of functions of natural numbers is equivalent to the one of this paper in re- 
spect to the results we have summarized. 

If the Church representation is extended by the natural method of using the class of 
properly-formed formulas instead of the class of well-formed formulas, modifying suitably 
the relation conv, and letting \fz-z represent 0, simplifications are afforded in the proofs 
of many theorems, but unfortunately difficulties are introduced in the formal logics in 
which this theory is used. Rosser has shown that the formal definition (A-K-definition) 
under this program is equivalent to \-definition, when the range of the independent varia- 
bles is the set of natural numbers, and all the values have the same free symbols. For 
functions over all well-formed formulas, \-K-definition is not equivalent to \-definition, 
but we conjecture that the equivalence holds for many other significant ranges of the 
independent variables (such as functions of natural numbers, functions of functions of 
natural numbers, ---, with values in the same range, and ordinal numbers represented 
by well-formed formulas as in Church-Kleene [1]), and fails only for very heterogeneous 
ranges. 

The formal definition which is obtained from that of this paper by using the [\-5-]con- 
version of Church [3] is likewise equivalent to \-definition, when the range of the independ- 
ent variables is the set of natural numbers and the values do not contain 6. 

13 Here we assume that x,, --- , x, do not occur in G, H;, --- , H, as free symbols; and, 
in general, when a heavy-typed letter represents occurrences of a proper symbol in a for- 
mula, we suppose the only occurrences of the symbol in the formula to be those appearing 
explicitly, unless the contrary is implied by the original convention concerning heavy-type. 
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lor the moment we abbreviate |Aporfgha-p(f, o(g, r(h, a)))} (x, y, z) to [x, y, zl, 
\Nef-e(f, I, 1)}(K) to Xi, thof-o(7, f, 1} (K) to Xe, [Aof-p(I, I, f)}(K) to Xs. 


(7) [x, y, zl conv x, [x, y, Z]2 conv y, [x, y, z]s conv z (x, y, z = 0,1, ---). 
If § — Ap-[p2, ps, S(os)] and 3 — [0, 0, 0], then, using (2) and (7): 


(8) HQ) conv [0, 0, 1], F(KQ)) conv [0, 1, 2], FF FB))) conv [1, 2, 3], 
B(F(F(FQ)))) conv [2, 3, 4], ---. 


Hence, letting P — dp- e(§, 3):, and using (3), (7) and (8): 
(9) Ifx =z+ 1, P(x) convz(x = 1,2,--- ). P(0) conv 0. 


Now let + — dAuv-r(P, S(u)), and abbreviate {+} (x, y) to [x] + [y] (omitting 
brackets when no ambiguity results). By (3) and (9): 


(10) If x2 yand x — y = z, x+y conv Z; if x S y,x~y conv0 (z,y = 0,1, ---). 
Let min — Ary-y + [y > 2]. 
(11) If x S y, min (x, y) conv min (y, x) conv x (x, y = 0,1, --- ). 


We call a formula constructed out of J, J and proper symbols by zero or more 
successive operations of passing from F and A to {F}(A) a combination, and the 
individual occurrences in it of J, J and proper symbols which enter in the course 
of this construction its terms. Let T — J(I, I), so that T conv Afx-x(f). The 
reader may verify that 


(12) J(T, A, F) conv dx-F(x, A), J(T, A, J(I, F)) conv dx-F(A(x)), J(T, T, JU, 
I(T, T, I(T, A, J(T, F, J))))) conv dx-F(x, A(x)). 


If C is the proper symbol x, J is a combination convertible into Ax-C. If C isa 
combination of the form F(A) and has x as a free symbol, then x is a free symbol 
either (a) of F but not of A, (b) of A but not of F, or (ec) both of Fand of A. In 
ease (a), if F° is a combination convertible into Ax-F, then by (12) J(T, A, F°) 
is a combination convertible into Ax-F°(x, A) and hence into \x-F(A) or Ax-C, 
and similarly in cases (b) and (ce). Thus, by induction on the number of terms 
of C: 


(13) Lf x is a free symbol of the combination C, there is a combination C° such that 
C° cond dx-C. 


A proper symbol is a combination; if F’ and A’ are combinations convertible 
into F and A, respectively, F’ (A’) is a combination convertible into F(A); if R 
has x as a free symbol, and R’ is a combination convertible into R, then R’ has 
x as a free symbol (interconvertible formulas have the same free symbols), and 
by (13) there is a combination R” convertible into \x-R’ and hence into Ax-R. 
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Thus, by an induction corresponding to the process of construction of a well- 
formed formula: 


(14) Given A, there is a combination A’ such that A’ conv A.“ 


Let H — do-o(I, I, I, 1). Given a formula A having no free symbols, there is 
by (14) a combination A’ convertible into A. A’ has no free symbols, and hence 
its terms are I’s and J’s. Let S’,[p) A’| denote the result of replacing each term 
T of A’ by y(T), and let C > dy-S’ty) A’|.. Then C(J) conv 8’)fp) A’| conv A’ 
conv A; and C(#/) conv S’acr) A’| conv 87 A’| conv I. 

(15) If A has no free symbols, there is a formula C such that C(I) conv A and 
C(H) conv TI. 


Let B_, > Apryz-p(z, z, y), Bo > Apryz- p(y, x, z), Bi > Apxyz-p(z, y, z). 
(16) B,(B_,) conv B_,, B_1(B1) conv Bo, B_,(B_1(B:)) conv B,. 


We now adopt the notation —1—> dfz-x(f). Given formulas A_;, Ao, Ai having 
no free symbols, there are by (15) formulas C; such that C,(J) conv A; and 
C,(H) conv I (¢ = —1,0,1). Then An-n(B_4, Bi, \abe-b(a(H, c(H))), Co, Ci, C1) 
has the properties of F in the following: 

(17) If A-s, Ao, Ai have no free symbols, there is a formula F such that F(i) conv 
A; (¢ = —1,0, 1). 


If % — Ap-p(0, 1), then, using (3) and the relations 0(1) conv 1 and 1(0) conv 0: 
(18) (nm) conv 1 (n = 0,1, --- ). A(—1) conv 0. 
If F has no free symbols, there is by (17) a formula B such that B(—1) conv 


B(0) conv J and B(1) conv Abz-F(z, A\p-b(A(p), b, p)). Then Ap-B(A(p), B, p) 
has the properties of L in the following: 


(19) If F has no free symbols, there is a formula L such that L(x) conv F(x, L) 
(x = 0,1, --- ) and L(—1) conv I. 


Given formulas G and H having no free symbols, choose K by (17) so that 
K(0) conv Ayf-y(f(—1), G) and K(1) conv dAyfxe2 --- t,-H(P(y), f(P(y), t2, --- , 
Xn), L2, +++, Ln), and let F > drAy-K(min(y, 1), y). Then the L given by (19) 
for this F satisfies the following: 


(20) If G and H have no free symbols, there is a formula L such that 
L(O, x2, --- , Xn) conv G(X, --- , Xn) and L(S(y), X2, --- , Xn) conv 
H(y, L(y, x2, ‘iia » Xn), Xe, attic » Xn) (y, 22, coe kn = 0, 1, shin: }. 


Choose K by (17) so that K(0) conv Afyr-r(y, f(—1), y) and K(1) conv 
Myr -f(r(S(y)), Sy), r), let F — Ax-K(min(z, 1)), and choose L by (19) for this 
F. Then L(x, y, r) conv L(r(S(y)), S(y), r) (2 = 1, 2, --- ) and, if r(y) conv 


‘4 This theorem derives from Rosser [1], and the present proof of it from Church [3]. 
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z, where z is a natural number, L(0, y, r) conv y. Hence, letting e, — 
Ara, --+ Ln-L(r(ai, --- , Ln, 0), 0, r(ai, +--+ , Zn)): 


(21) If r \-defines a non-negative integral function p(x, --- , 2a, y) of natural 
numbers such that (m1, --- , tn)(Ey)[p(m, --- , tn, y) = O], then e,(r) A-defines 
eyle(x, 79% 5 iy y) = 0). 


According to Kleene [3] IV every function of natural numbers recursive in the 
general Herbrand-Gédel sense (see [3] Def. 2a or Def. 2b) is expressible in the 
form ¥(eylp(z1, --- , Zn, y) = 0]), where ¥(y) and p(x, --- , tn, y) are primitive 
recursive ([3] Def. 1) and (x, --- , 2n)(Ey)[e(a, --- ,2n, y) = 0]. In view of 
(1), (2), (5), (6) and (20), every primitive recursive function is \-definable; 
and therefore, from (21), every general recursive function is \-definable. 


(22) Every non-negative integral function of natural numbers which is recursive in 
the Herbrand-Gédel sense is \-definable. 


(19) constitutes a schema for circular definition. Given any set of conditions 
of dependence of an entity L(x) on the variable natural number z and on L 
itself, if the set can be expressed in the A-notation by a formula F, a formula L 
satisfying the conditions in terms of the equivalence relation A conv B can be 
found.” To do this it need not be known that the conditions actually deter- 
mine a function L(x). Further analysis of this situation (Kleene [2] §18) shows 
that to each problem of a large class, which includes many famous unsolved 
problems (such as the Fermat problem and the 4-color problem), there is a 
formula P such that whether P has a normal form is an equivalent problem. 


3. \-definable non-negative integral functions. We now set up a represen- 
tation of the well-formed formulas by natural numbers, by the Gédel method. 
The symbols which occur in well-formed formulas we number thus: 


A...1;{, 6 [... 11; }, ), ] ... 18; the t-th proper symbol ... piss 


(p; = the i-th prime number), and we order numbers to formulas (considered as 
finite sequences of symbols), finite sequences of formulas, ete., on the basis of 


the correspondence 7, 2, --- , 2, to p;' p>’ --- py" between finite sequences of 
numbers and individual numbers (m, me, --- ,% > 0). Using the methods 
1 Read (21, ---, tn) “for all z:, ---, z,’’, (Ey) ‘‘there is a y’’, ey[R(y)] “‘the least y such 


that R(y) (Oif there is no such y)’’. 

16 A formula which \-defines a non-negative integral function of natural numbers has no 
free symbols. 

17(17) can be used in the selection of F, if cases are distinguished in the form of the 
dependence of L(z) onzand ZL. (4) and the clause L(— 1) conv J of (19) can be used when- 
ever under a given case L(x) is independent of either or both of rand L. These devices are 
illustrated in the proofs of (20), (21) and (24). 

18 The distinction in well-formed formulas of three species of parentheses is unessential, 
since the species of each parenthesis can be determined from its situation. 
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and notations of Gédel [1] pp. 179-182," and starting from 1—5, 7-10 of his list 
(p. 182) and 6, 11-18 of Kleene [3], we define the additional primitive recursive 
functions and relations 19-42: 


19. Z(0) = 2-3%7-5"-7-11%- 134.174. 19". 23.29" .31'9. 378.418. 43%, 


11 + 4n 

Z(n + 1) = Su Z(n) PE 5 + 

Z(0), Z(1), --- are the numbers corresponding to the formulas 0, 1, --- . 
20. Num(zr) = (En)[n S$ tx & & = Z(n)]. 

x corresponds to one of the formulas 0, 1, --- . 
21. Z-(x) = en[fn Sx & zx = Z(n)). 

If x corresponds to Afz-f( --- n + 1 times --- f(x) --- ), Z-“(x) is n. 
22. PS(x) = Prim(x) & x > 138. 

x is a proper symbol. 


23. PFR(z) = (n){0 < n Ss U(x) > (Ev)[v Ss x & PS(v) EnGle = 
R(v)] V (Ep, 0 < p,q <n & {n Gl x = E(p Gl x)*E(q Gl zx) 
V{n Glxz = R(1)«[p Gl z)*E(q Gl xz) & (Ev)[v s x & PS(v) & 
pGlx = R(v)]}}]} & U(x) > 0. 
x is a sequence of formulas of which each is either a proper symbol or is 
compounded out of the preceding ones by the operations { } ( ) and A []. 


24. PF(x) = (En){n S (Pr{l(z)*])*"@* & PFR(n) & x = [l(n)] Gl n}. 
x is a properly-formed formula.” 
25. v Geb n, x = PS(v) & PF(x) & (Ea, b, cla, bh cS et & & = 
a*R(1)*R(v)*E(b)*c & PF(b) & Ua) < n S Ua) + l(b) + 4]. 
The proper symbol v is bound at the n-th point of the properly-formed formula z. 
26. v Frn, x = PS(v) & PF(xz) & v = nGle&n Ss U(x) & v Geb n, z. 


The proper symbol v occurs as a free symbol at the n-th point of the properly- 
formed formula z. 


27. v Geb x = (En)[n S U(x) & v = nGlz& v Geb 2, ZI. 
The proper symbol v occurs in the properly-formed formula x as a bound symbol. 
28. v Fr xz = (En)[n Ss U(x) & v Fr n, 2). 
The proper symbol v occurs in the properly-formed formula z as a free symbol. 
29. WF(x) = PF(z) & (n)[n < I(x) & (n+ NGlz =1-— (Ep,gq,r){p,94,7r S 
z& x = p+R(1)+R[(n + 2)Gl zl*E(q)er & Up) = n & PF(q) 


& [(n + 2) Gl a] Fr q}). 
z is a well-formed formula. 


19 The possibility of defining number-theoretic functions by means of recursion was 
expounded in Skolem [1]. In that paper Skolem also showed that restricted existence and 
restricted generality (the restriction by an upper bound) can be expressed by recursive 
functions. 

20 Cf. Gédel [1] p. 183, footnote 35. 
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30. z Imry = WF(z) & {x = y V (Ep, 9,7, 8, 0[p,9q,8,tSrk&r Ss yk&r= 
p+R(1)*R(q)*E(s)#t & PS(q) & WF(s) & PS(r) & r Oces & 
y = p*R(1)*R(r)*E(S(s, g, R(r)))*t] V (Ep, 9g, 7, 8, O 
[p, 9,7, 8, S 2 & & = p*E(R(1)*R(q)*E(r))*E(s)+t & PS(q) & 
WF(r) & WF(s) & q Gebr & (u)[u S$ s& uFrs —u Gebr] & 
= p+S(r, q; s)*t]}. 
zIme y = zImryVy Imrz. 
x Imr y (x Imc y) corresponds to the relation obtained from A conv B by 
omitting (2) and (3) (omitting (3)) in the definition of the latter. 


31. EC(x,m) = 0(R(x), m) for the 6(z, m) given by Kleene [3] I when ¢(n, z, y) = 
elzon+2& {(xImen &2z = n) V (x Imen &2z = 2)}). 
EC(a, 0), EC(«, 1), --- is an enumeration (with repetitions) of the numbers 
y convertible into x (if x is well-formed). 








Now let L be a given non-negative integral function of n natural numbers, and 
L a formula which )\-defines L, i.e., a formula such that, for each set 2, --- , Ln 
of natural numbers, L(x:, --- , x.) conv Afz-f( --- m+ 1 times --- f(x) --- ) 
when m = L(x, --- , 2) and (by Church-Rosser [1], Thm. 1, Cor. 2) only 
then. If 1 denotes the correspondent of L under our representation of well- 
formed formulas by natural numbers, 


A(ai, +--+ ,%n) = E(--- E()*E(Z(x1)) --- )*E(Z(z,)) 


is the correspondent of L(x, --- , x.) (8, 10, 19). Hence, if z., ...., denotes the 
correspondent of the formula Afz-f( --- m + 1 times --- f(z) --- ), there are 
y’s such that EC(A(m, --- , x), y) = Z2,...z, (31). For those and only those 
y’s Num (EC(A(a, --- , 2), y)) holds (20). Hence 


(x, «++, tn) (Ey) Num (EC(A(m, --- , 2), y)) 
and 
Z-(EC(A (a1, «++ , tn), ey[Num (EC(A(n, --- , tn), y))I)) = Z-(es, ...2,) = 
L(x1, «++ , Zn) (21). 
Using Kleene [3]V, the expression on the left is seen to be recursive. Thus: 
(23’) Every d-definable non-negative integral function of natural numbers is 


recursive in the Herbrand-Gédel sense.”4 


4. Recursive well-formed functions. Let L be a function of a fixed 
number n of natural numbers 2, --- , Z,, of which the values L,, ... ., are well- 
formed formulas. Let A(x, --- , 2.) be the function which corresponds to L 
under our representation of formulas by numbers, i.e., the function which the 
correspondent of Lz, ...2, iS ui 21, +--+ ,2%n. Wecail L recursive if A(x, --- , 2) 


*1 This result was first announced by Church. 
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is recursive in the Herbrand-Gédel sense. This definition agrees with the 
former one, when the values of LZ are formulas representing natural numbers, in 
view of the recursiveness of Z and Z~' (19, 21). 

In order that L be )-definable, it is necessary that all the values Lz, ... z, 
have the same set 2:1, --- , Zm Of free symbols. If L is recursive, the function L’ 
whose values are the expressions AZ; - - - Zm-Lz,...z, (which contain no free sym- 
bols) is recursive, since 


\’(a1, ee Zn) - R(1)*R(s)+E( aids R(1)*R(sm)*E (A(x, pra tn)) oo ) ’ 


where 8, --- , 8m are the numbers corresponding to the symbols 2, --- , Zm, 
respectively. Moreover, if L’ is \-defined by L’, then L is \-defined by 


AX, --- Xn-L’ (xi, --- , Xn, Zt, -** » Zm) - 


These remarks reduce the problem of this section (to prove (25)) to the special 
case in which the values of L contain no free symbols. 

In the following 7 and j denote the numbers corresponding to the formulas I 
and J, respectively: 


32. CR(z) = (n) {0 <n Slt) nGlze=ivnGlre=jv(Ep,gl0<p,q<n 
&nGlz = E(p Gl z)*E(q Gl z)|} & U(x) > 0. 
x is a sequence of formulas of which each is either J or J or is compounded out 
of the preceding ones by the operation { } ( ). 


33. Comb (xz) = (En){n < (Pril(x)*])="@ & CR(n) & x = [I(n)] Gin}. 
z is a combination. 


34. C(x) = EC(zx, ey{[WF(x) & Comb (EC(z, y))] V (WF(z) & y = 0}}).” 
If x is well-formed, C(x) is a combination convertible into z. 


35. D(x) = (Ep, Dip, q = x& x = E(p)*E(q) & WF(p) & WF(q)). 
zx corresponds to a formula of the form {P} (Q). 


36. Mi(z) = eplp S r& WF(p) & (Eg)lq S r& xt = E(p)+E(Q)II. 

M.(z) = eqlg S x & WF(Q) & (Ep)lp < c& x = E(p)*E(q))). 

If x corresponds to the formula {P} (Q), M(x) and M(x) correspond to P 
and Q, respectively. 


37. I(z) =z =i. 
x corresponds to the formula /. 


By the )-definition of a relation we mean the d-definition of the representing 
function of the relation (i.e., the function which is 0 or 1 according as the relation 
holds or not). Since a recursive relation is one of which the representing func- 
tion is recursive, recursive relations among natural numbers, as well as recursive 
functions, are \-definable (by (22)). 

Accordingly, let C, D, Mi, Me, I be formulas which A-define C, D, Mi, M2, J, 


22 By (14) and Kleene [3] V, this function is recursive, which is sufficient for our purpose. 
Actually, it is primitive recursive, by Gédel [1] IV, since a primitive recursive bound for y 
is given implicitly by the proofs of (14) and the property of EC(z, m). 


——  —————— 
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respectively. Using (17), choose a formula 2 such that 2(0) conv Axf-x(f(—1)), 
MN(1) conv Azf-x(f(—1), J), and a formula & such that R(O) conv Axf-f(Mi(z), 
J(M2(x))), R(1) conv Azf-N(A(z), z, f); and let B — dArf-K(D(z), z, f). By (19), 
there is a formula @ such that G(x) conv B(x, G) and G(—1) conv J. Then 
@(y) conv I if y corresponds to J, G(y) conv J if y corresponds to J, and G(y) 
conv @(Mi(y), G(M.(y))) if y corresponds to a formula of the form {P} (Q). 
Hence, if y corresponds to a combination Y whose terms are J’s and J’s, @(y) 
conv Y. If zx corresponds to a formula X having no free symbols, C(x) corre- 
sponds to a combination Y of J’s and J’s convertible into X. Hence, letting 
G — rx-G(C(z)): 


(24) If the number x corresponds to a formula X having no free symbols, G(x) 
conv X. 


Now, if the function L is recursive, and if the values L,,...., contain no free 
symbols, there is a formula 1 which \-defines A(x, --- , 2.) (by (22)), and then 
Aa, ++ Xn-G(I(x1, --- , n)) A-defines L. Passing to the general case by the 
means we have indicated: 


(25) If the function L of n natural numbers having well-formed formulas as values 
is recursive (i.e., if the corresponding numerical function is recursive in the Her- 
brand-Gédel sense), and if all the values have the same free symbols, then L is 
d-definable. 


We are now in a position to infer the \-definability of various sequences of 
well-formed formulas from the theory of recursive functions. We give several 
examples, each accompanied by a definition displaying the recursiveness of the 
corresponding numerical function A(z).% a, f, --- stand for the numbers 
corresponding to A, F, --- , respectively. 


(26) The sequence Ao, --- , Ars, F(O), F(1), - - - is \-definable (if Ao, --- , Ara, F 
have the same free symbols). 


Ma) = el(x =Ok&y=m)V---Veer=k-lLlk&y=auv(rzkk&y 
= E(f)*E(Z(x~k))]. 


(27) The sequence Ao, --- , Ax-s, F(O, Ao, --- , Axa), F(1, Ai, --- , Ax), --- , where 
A; denotes the (i + 1)-th member, is \-definable (if Ao, --- , Ax have the same free 
symbols, and the free symbols of F are free symbols of Ao). 


d(0) = &,-**; Ak _- 1) = Qk-1, A(k oo x) 
= E(--- E(E(f)*E(Z(z)))*E(A(a)) --- )*E (Q(x + [k — 1))). 


(28) The set of formulas derivable from A(0), A(1), --- by zero or more successive 
operations of passing from M and N to R(0, M,N), R(1, M,N), - - - is \-enumerable 
(if the free symbols of R are free symbols of A). 


23 Here are used known recursive functions and relations, the methods of Gédel [1], 
Kleene [3] V, direct recursive definition by equations. 
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A(x) = 0(R(E(a)*E(Z(0))), x), where @(z, m) is chosen by Kleene [3] I taking 


¢(n, z, y) = a {n)2& ge B(z(#e+e(2([5])))2@ sen} 
V {n +1\2&2= Baek 2(|" + ‘)))} |. 


38. EW(0) = z (the number corresponding to /). 
EW(x+1) = eylEW(2) <y S Z(z) & WF(y) & (pp Sy > p Fr yl}. 
EW(0), EW(1), --- is an enumeration of the well-formed formulas with no 
free symbols. 


(29) The class of well-formed formulas (having a given set of free symbols) is 
d-enumerable. 





For the case of no free symbols, A(x) is the function EW (zx) which precedes; if L 
h-enumerates the class for this case, Ax-L(x, z:, --- , Zm) \-enumerates it when 
the set of free symbols is 2;, --+ , Zm. 


39. NF(x) = WF(z) & (p, 9, 7, 8, p, 9, 7, 8,¢ S 2 & WF(q) & WF(r) & WF(s) > 
x # pxE(R(1)*q*H(r))*H(s) +t}. 


x is a well-formed formula in normal form. 


40. ENF (zx) (defined in the same nanner as EW (x) replacing WF (zx) by NF(z)). 
ENF(0), ENF(1), --- is an enumeration of the well-formed formulas with 
no free symbols in normal form. 


41. EN(x) = EC(ENF(1 Gl Dy(z)), 2 Gl Dy(z)). 
EN(0), EN(1), --- is an enumeration of the well-formed formulas with no 
free symbols which have normal forms. 


(30) The class of well-formed formulas (having a given set of free symbols) which 
have normal forms is \-enumerable.™ 


This follows from 41 (or 40) in the same manner as (29) from 38. 


5. \-definable well-formed functions. In the extension of the notion of 
recursiveness to functions L of which the values are any well-formed formulas, 
the point of view in which interconvertible formulas are regarded as equivalent 
iscompromised. Every well-formed formula \-defines 2° functions L of n natural 
numbers, each corresponding to a different numerical function A(x, --- , Xn). 
Since the power of the class of recursive numerical functions is No, not all func- 
tions L \-definable by a given L are recursive. In order to prove a theorem like 
(23’), there must be added to the hypothesis of \-definability a condition on the 
form of the values L,,...:, of L which selects from the formulas in which 
L(x, --- , Xn) is convertible that one which is Z,,...,,. A condition of this sort 


24 This theorem is due to Church. 
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which can be used here to replace the condition of representing a natural number 
is that of being in normal form, supplemented by a convention which removes 
the ambiguity in the normal form of a formula: a formula shall be in principal 
normal form if it is in normal form and the symbol following the n-th occurrence 
of \ is the n-th proper symbol (in the given list) which is not a free symbol of 
the formula. 


42. PNF(x) = NF(z) & (p, 9, 7) {p, ¢,7 S x & x = p*R(1)*R(q)*r — 


q = es[s S ex & PS(s) & s Frxz& s Geb pl}. 
x is a well-formed formula in principal normal form. 





If all the values L,,...., are in principal normal form, and A(x, - -- , Z,) is chosen 
as in the proof of (23’), we find that A(m, --- , 2x) = EC(A(m, --- , Zn), 
ey[PNF(EC(A(a, --- , 22), y))]), which is recursive in the Herbrand-Gédel 
sense, since (%, --- , 2n)(Ey)PNF(EC(A(n, --- , Zn), y)). 


(31’) Every d-definable function of n natural numbers of which the values are well- 
formed formulas in principal normal form is recursive (i.e., the corresponding nu- 
merical function is recursive in the Herbrand-Gédel sense). 
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SOME MORE THEOREMS CONCERNING FOURIER SERIES AND 
FOURIER POWER SERIES 


By G. H. Harpy anp J. E. LitrLewoop 


1. Introduction 


1.1. The principal theorem in this paper is Theorem 10: if u(@) is periodic, 
with period 27, and integrable, s,(x) is the partial sum of the Fourier series of 
u(@), for @ = 2, s(x) is arbitrary, 


(1.1.1) o(x, 0) = z{u(e + 0) + u(x — 6) — 2s(z)}, 
and 

(1.1.2) k2p>1, 

then 


IIA 





ed R\ Wk = " l/p 
(1.1.3) (> | sn(z) = Ht) K(p, k) ([ Lote a)" 


This theorem is, in a sense, a theorem of ‘strong summability’.! It is known 
that, if 


(1.1.4) [ | G(x, t) |? dt = o(8), 
0 

for some p > 1, then 

(1.1.5) > | sa(2) — s(2) | = on), 


for every positive k. In Theorem 10 both hypothesis and conclusion are 
stronger. In fact (1.1.4) is equivalent to 





(1.1.6) [ wa OF a o@d. 
and (1.1.5) to 

2n 
(1.1.7) > lee = ae) _ oa); 


and (1.1.6) and (1.1.7) are plainly consequences of the convergence of the 
integral and the series in (1.1.3). 


Received November 3, 1935. 
1 See Hardy and Littlewood (2, 4, 8), and Zygmund (15), 237-241. The bold-faced num- 
bers refer to the list of references at the end of the paper. 
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There are other points of difference between Theorem 10 and the theorems 
about strong summability. Thus (1.1.4) says the less the smaller p, and 
(1.1.5) says the more the larger k, in each case because of Hélder’s inequality. 
There are no such obvious relations of inclusion between different cases of 
Theorem 10. The convergence of 


(1.1.8) > naz 


for given positive a, and r, does not imply its convergence? for any other r; 
and the integral and series in (1.1.3), for different pairs of values of p and k, 
are similarly independent, so that no case of the theorem implies any other 
case in any trivial manner. 

Finally, (1.1.4) is satisfied for almost all z, if u(@) is L”, while the integral 
in (1.1.3) may diverge for almost all z. 

1.2. In §§2-3 we prove some ‘pure inequalities’ which we require later; the 
theorem essential for our applications is Theorem 3. In §2 we deduce this 
theorem from a very general theorem (Theorem 1) which we have proved 
elsewhere ;* but, in view of the length and difficulty of the proof of Theorem 1, 
we add a direct proof of Theorem 3 in §3. 

In §4 we prove our main theorem for a special class of functions, those which 
are boundary functions of analytic functions f(re) regular for r < 1. The 
Fourier series of such functions are ‘Fourier power series’.* In this case we 
can (as in general we cannot) include the value p = 1. 

In §§5-6 we complete the proof of Theorem 10, for general u(@). In §§7-9 
we give a more direct proof of an analogous theorem for Fourier cosine trans- 
forms; and we conclude, in §10, with a few miscellaneous comments. 


2. Inequalities 


2.1. Our argument depends upon a number of special cases of a very general 
inequality® which we proved in 7 and restate here. 
We suppose that 


O<psq, r>0, y=a+6-1, 
1 1 1 1 1 1 1 
wai SE, wtotes, atades, 
- a*q a ats’ 0 Pp r 
a =0, bb = 0, a, 20, b, 20, 


Cn = ab, + Abn + eee + Ando, 
? Thus (1.1.8) is convergent, witha, = (logn + 1)~¢,ifandonlyifr>1/a. Onthe other 
hand, if . 
a, = n® (n = 2”), a, = 0 (n # 2™), 
then (1.1.8) is convergent if and only if r < 1/8. 
3 Hardy and Littlewood (7), Theorem 1. 


‘ For fuller explanations see Hardy and Littlewood (5). 
5 Hardy and Littlewood (7), Theorem 1. 
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and 
Ar = >) n-(na,)?, Be = >> n-“(n8d,)2, Cr = >> n“(nre,)’. 
1 1 1 


We allow infinite values of p, q, or r; if, for example, p = , then A is to be 
interpreted as 


(2.1.1) lim lim (> n(n a)” = max (n“a,). 


NS po 1 


The fundamental inequality is 


(I: 2.1.2) C < KAB, 
where 
(2.1.3) K= K(p, q, 7, a, B). 


We say that a set of conditions is necessary and sufficient for (I) if, when the 
conditions are satisfied, (I) is true for some K of the type (2.1.3) and all a,, b,, 
and, when they are not satisfied, (I) is false for every such K and some a,, b,. 
In stating the theorem we distinguish between ‘ordinary’ and ‘exceptional’ 
cases. A case is ordinary when p, q, r are finite and p ¥ 1, and otherwise 
exceptional. 
THEOREM 1. (1) Jt is necessary for the truth of (1) that 


(2.1.4) p21. 


(2) [Ordinary cases.] It is necessary and sufficient for the truth of (I), in an 
ordinary case, that (2.1.4) should be satisfied (so that p > 1) and also one of the 
four (mutually exclusive) alternative conditions 


(2a; 2.1.5) tTSrsp, a<1l, B<1; 
(2b; 2.1.6) p<rsq, a<l, BS By; 
(2c; 2.1.7) r21, q<r, a Sa, B S Bo; 


T 


(2d; 2.1.8) 7 <1, q<rsj , 
— 7 





ala, BS Bo. 


(3) [Exceptional cases.| The only case® in which p = © and (I) is true is 
the case 


p=q=r=Tt= 2, a<l, B<1. 


When p is finite, all exceptional cases except four are normal, in that the con- 
ditions appropriate to them can be derived from those catalogued under (2) by 
substitution of the special values of the parameters p, q, r and interpretation of 
B and C, if necessary, in accordance with the convention (2.1.1). 


6 In fact, all cases with p = © are normal (in the sense defined below). 
7 The four cases (2a)—(2d) are mutally exclusive even when exceptional values of p, q, r 
are allowed, so that the definition of ‘normal’ is unambiguous. 
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The four abnormal cases are® 


piper p=qd=r=1. In this case the conditions area <= 1, 8 S 1 (instead of 
< 1,8 <1). 

@ p=1<r<gq. In this case the conditions are a < 1, 8B < Bo (instead of 
< 1,8 S fo). 

(8c) p=1<r=q. In this case the conditions area S 1, B S Bo (instead of 


(3d) > >1,7>1,r= ~. In this case the conditions are a < a, 8B < Bo (in- 
stead of a S a, B S Bo). 

It may be observed that ag < lifr >qand®{<1lifr>p. Thusa<1l 
and 8 < 1 in all of the cases (2a)—(2d). 


The case g = 


2.2. We now specialize the theorem by supposing that g = ~ (so that r = p). 
In this case 


B = max (n*b,) , 


and B < « means b, = O(n~*); and there is no real loss of generality in sup- 
posing that 


=n? = nr', 
say. 
We shall specialize a little further by supposing that 
a<l, B<1l, w=1-86>0. 
Then 
(2.2.1) C.= S (n- 2)" a, = a 


O0<s<n 


is effectively the Riesz or Cesaro sum,’ of order w, formed from the series >> an; 
and (I) becomes 


(I: 2.2.2) (> n-(nr a)*)"" SK (D> n“(n« a,)”)"? , 
with 
(2.2.3) K = K(p, r, a, w). 


Making these specializations in Theorem 1, and rearranging the results in 
a more convenient manner, we obtain 
THEOREM 2. Suppose that 


lsp<o, a<l, o>Q0. 


8 The order in which these cases are catalogued is not the same as in 7; and there is no 
parallelism between them and (2a)-(2d). 

® In fact C, is Riesz’s mean, with integral n, multiplied by a factor I'(w). 

Riesz admits non-integral values of n, and this is important for some of the more delicate 
properties of his sums; but the difference is not significant here. 
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Then it is necessary and sufficient for the truth of (I), with a K of type (2.2.3), 
that one or other of the sets of conditions 


1 p 
.2.4 ts. 
2:34) emo deities, Or", 
1 
(2.2.5) w=-, psr<o, 
p 
: 1 
(2.2.6) w>-, psrsnx 
P 


should be satisfied; except that the last < in (2.2.4) must be changed into <, and 
the last < in (2.2.5) into S, when p = 1. 
Finally, if we specialize still further by supposing 


w=a>Qd0, 7¥=0, 
(I,) takes the form 


(In: 2.2.7) (> na)" Ss KC) ner a?) 
with 
(2.2.8) K = K(p, r, a); 


and we obtain 
THeoreM 3. I[f 


lsp<o, 0O<ac<l, 


then it is necessary and sufficient for the truth of (I:) that one or other of the sets 
of conditions 





1 p 1 
- < < —__- == } 
“<< parar—a (and + < - +. when p 1); 
(2.2.9) a=c, psr<; 
sy - psrso 
p’ - 


should be satisfied. 


3. Direct proof of Theorem 3 


3.1. We have deduced Theorem 3 from Theorem 1, whose proof occupies 
some thirty pages of 7; and the deduction by specialization, though straight- 
forward, requires a good deal of attention. We therefore add a direct proof of 
Theorem 3, or rather of its positive clauses, which give sufficient conditions for 
the truth of (I). 


Ee 
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We suppose first that 





(3.1.1) l<p<r<om, 
and that 
(3.1.2) r<—? 
1 — ap 
ifa <1/p. We write” 
1 1 
(3.1.3) B= pa—l, v=(2-})p, 
p rr 
(3.1.4) S? = } i nb a? . T? = } > n-(a\2)* . 
so that the inequality to be proved is 
(3.1.5) Ts KS. 
We can choose p and @ so that 
(3.1.6) -—-l<p-—7<--—a, 
ll 
(3.1.7) Sane alas, 
P r 
Then 


2 i 
(3.1.8)a = >> s7 a; - s°-7(n — sete. s-e(n — s)-" al SU” 
as<n 


1 
Vv? Ww’, 


where p’ is defined as usual by 


1 1 
e"s- gre? 
and 
(3.1.9) U= > sta? <8, 
s<n 
(3.1.10) VY= Zz gle—y)P’ (n ‘ai gs) (t+e—-)p’ . 
s<n 
(3.1.11) W = D> s(n — 8)" a”. 
s<n 


It follows from (3.1.6) and (3.1.7) that 


(oe-—y)p’>-1, (+a-1)p’>-1, 


10 There will be no disadvantage in using 8 and y in senses different from those of §2. 
1 Observe that this choice of « presupposes (3.1.2.). 
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and so 

(3.1.12) V Ss Kn'o-*tete-ve'th 
with 

(3.1.13) 2 K= K(p, rT, &, Pp, a) - 


Combining (3.1.9) and (3.1.12) with (3.1.8), we obtain 


(3.1.14) n—(a'®)" < K S-? n'W , 

where 

(3.1.15) tar(o—vteta-144)-1. 
Hence 


T = Dn (a) < K St? So nt DY s-r(n — 8) a? 
n n s<n 


(3.1.16) 
= K S-? >) sta? >> n(n — 8)-"* 


n>s 


But or < 1, by (3.1.7), and 
t—er=e(o-yt+a-1)-1<-1, 


by (3.1.6), so that 
(3.1.17) : n'(n — 8)-* < K st. 


n>s 


with a K of type (3.1.8). Finally 
t—or—pr+1= —w +or—= =, 
by (3.1.15) and (3.1.3); so that (3.1.16) and (3.1.17) give 
T<KS-°> sa? =KS'. 


Here K = K(p, r, a, p, «), and K = K(p, r, a) when suitable values, satisfying 
(3.1.6) and (3.1.7), are given to p and o. 

3.2. Suppose next that 

1 


i-@ 


We choose p and ¢ as in §3.1, so that 


1 
Sie Va leet een -: 
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and 
s°-1(n — g)tte-! < ne-rtete-l | 
Thus (3.1.8) may be replaced by 


a® < ne-yteta—l [Jilr’ Wile : 


where U is defined as before (with p = 1), and r’ like p’. The proof then 
proceeds as in §3.1. 
There remain the marginal cases, 
1 p 1 
r=P?p; ~s, “£22 =; to . >1, ar>-, ree, 
vials Pp Il—ap’ ” Pp 
The first of these, like the case (3.2.1) treated above, may be disposed of by 
an appropriate simplification of the main argument. But in this case we 
can go further, and find the best possible K. We therefore postpone this case, 
and treat it, in Theorem 4 below, as a separate theorem. 
3.3. When 








1 Pp 
(3.3.1) p>l, saliead eek er 


the proof lies a little deeper.’ It is impossible to choose ¢ so as to satisfy 
(3.1.7), and we must appeal to a theorem which Pélya® and we have proved 
elsewhere. 
We write 
b, az n2—-lp an . 


so that S*? = >> b?. Then 


1 1 1 
a) = > (n— s)187 “b, S nP * DY (n — 8)1b, = nP “0. 
s<n s<n 


Hence 


1 
TI09 = Dw (ae))e> SW) er = K(D bg) He? = KS=, 


by the theorem referred to. 
3.4. There remains the case 


(3.4.1) p>, a>s, r= o, 

The theorem then asserts that 

(3.4.2) a = >> (n — s)™a, < KS. 
s<n 


12 We are in case V of Theorem 1: see 7, §§7-8. 
13 Hardy, Littlewood and Pélya (9, Theorem 5). 
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Now 
ons Sy 
> (n — sea, = Ds” Pa,-8? “(n — 8) 
(3.4.3) s<n s<n . 
< s( > s(n — ayrren 
s<n 
Since 
p’ 


ow ay - t= w= —1+p(l-—a)>-1, 
, , 1 
ehhh Bate, 
P 
and the sum of these indices is —1, the second factor on the right hand side 
of (3.4.3) is bounded; and this proves (3.4.2). 


3.5. When r = p, we can prove the more precise result which follows. 
TueoreM 4. If p 2 1,0 < @ < 1, then 


(3.5.1) : n-(a\*)? < (x cosec am)? > neP-lq?. 


The constant factor is the best possible. 
(1) If p > 1, we take 





(3.5.2) tnt, ye net, 
p p 


These values satisfy (3.1.6) and (3.1.7).4 Then 
a“ < (2 sP'(n — aieeiaal sad | Zz s?(n — s)-“rar = Vir’ wie, 


s<n s<n 

Here 

V=>os*(n-—s) 1 < | x-*(n — x)* dx = m cosec ar. 

s<n 0 

Hence 

T? = > n“(a\)? < (x cosec ar)?! Y n- D> s-#?(n — s)-*P a? 

n s<n 
(3.5.3) 
= (x cosec ar)?! >> s-»?a? D> n(n — 8)-”. 
3 n>s 
But 
n(n — s)-? = Di n(n — 8s)" < z(2 — s)*"dzr 

(3.5.4) x n>s ° 


= m coset amr-s*! 


14 y is now 0. 
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and 
(3.5.5) —ppt+a—1l= —(p— la+a-—1=ap-—1. 
Hence (3.5.1) follows from (3.5.3). 
If p = 1, r = 1, the equations (3.5.2) reduce to p = 0, ¢ = 1 — a, and the 
conditions (3.1.6) and (3.1.7) are not satisfied. But in this case 


> mta® = Yo nt YS (n — 8) a, = Da, dS m-X(n — 8) 


8 n>s 
< x cosec ar D> s*'a,, 
& 


and (3.5.1) is still correct. 

We have now completed the proof of the main clause of Theorem 4 (and so 
that of Theorem 3). To prove the constant in Theorem 4 the best possible, 
we take a, = n~*~, where 6 is small and positive. Then 


al?) = z (n - g)e-1g-e8 w T(a) ha = - 8) -3, 





and it follows, by an argument of a familiar type,“ that the constant cannot 
be less than 





: l(a) T(1 — a — 8) 
im ( rd — 6) 


The Riesz mean of a, is not a‘*) but a‘*’/I'(a). If a‘*’ were actually the Riesz 
mean, the constant would be (I'(1 — a))?. 

3.6. All these theorems have naturally their analogues for integrals. In 
particular we require 

Tueorem 5. Suppose that p, r, and a satisfy the conditions of Theorem 3; 
that f (x) = 0; and that 


P 
) = (x cosec ar)”. 


(3.6.1) fula) = gh i * @— yfly) dy 


is the Liouville integral of f(x) of order a, with origin). Then 
(3.6.2) ( i] ” A(f,(2))* az)" < K( | rri(f(a))r de)”. 


When r = p we can take K = I(1 — a), and this is then the best possible value 
of K. 

The proof is the same except for trivial simplifications. 

Finally, taking 


p>, a = 1/p; p> 1, a = 1/p’, 


in Theorems 3 and 5, we obtain two theorems which are particularly important 
for our applications. 


15 See Hardy, Littlewood and Pélya (10), 232. 
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THeoreM 6. Jf 
p> Il, psr<®, 
and a, = 0, f(x) = 0, then 
(3.6.3) (> na /p)yr\ ue < K(>a?)'”, 


(3.6.4) ( | Hf, (2))" az) <K ( | ” (f(2))? az)” 


THeoreM 7. Jf 


b 


1< p< 32, psr 


or 


or 

p > 2, psr 
and a, = 0, f(x) = 0, then 
3.65) (S wilat!/ ys KS way, 


(3.6.6) ( i] eM fuype())" ax)" <K ( .. x?-2( f(x)? ax)”. 


In each theorem K = K(p, r). 


lA 
8 


4. Theorems on power series 


4.1. We suppose now that 


(4.1.1) f(2) = Den” 

0 
is an analytic function regular for r = | z| < 1, that p = 1, and that 
(4.1.2) | f(z) |? |1 —2z|?"' do = | f(re®) |? | 1 — re®|»-! do 


is bounded for r < 1. 

We may always suppose, if we please, that co = 0, since the theorems which 
we prove under this restriction may be extended to the general case by con- 
sidering z f(z) instead of f(z). Series in which n occurs as a denominator are 
extended over the range 1 to ~. 

If p = 1, f(z) belongs to the (complex) class L, and all the standard relations 
hold between the function, its boundary values, and its coefficients. In 
particular, 
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leo st [sem | ao 


and 





len] — A a i8) | 
b> n s A [i160 ao, 


where A is a constant." Hence 


> lat Wk ' tae 

Recta A < 16 

( n ) s A [ isen ao 
forlsks ~. 


The situation is not quite so simple when p > 1, since f(z) does not usually 
belong to L. If however 0 < \ < 1, then” 


(p—1)a — (2-2) 
fisraom fiseia— ele tae a 


' Ly — (p—1)a p—r 
s(f isiei1—ziraa)(f 11-21 = do) 


and (p — 1) A < p — A, so that the second factor is bounced. Hence f(z) 
belongs to the class L*, and has a boundary function 


F(0) = fie*) 


of LD. Also, since (1 — z)"/”’ f(z) is a power series of the class L”, and 
(1 — e*)"»’ F(@) is its boundary function, we have 


| | F(@) |? | 1 — e® |» do = lim / | f(z) P| 1 —2|\>*do< @. 
On the other hand, if 
[iP@ via errae < «, 
then (1 — e’)'/”’F(6) is L?, and is the boundary function of a function 


(1 — z)'/»’g(z) of the complex class L”; and g(z) must be f(z), since F(@) is the 
boundary function of f(z). Hence 


fise \>| 1 — z |?" dé 
is bounded. Finally, since the ratio 
[iro \>|@|>-"do : [iro >| 1 — e® | >" do 


16 Hardy and Littlewood (3), 208. 
17 The range of @ is always supposed to be (— 7, 7). 
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lies between positive bounds depending only on p, our condition on f(z) is 
equivalent to the condition 


(4.1.3) | | F() |? | 0 | do < aw, 


4.2. THeoreM 8. Jf 
lspsk<e 
and the integral (4.1.2) is bounded, or F(@) satisfies (4.1.3), then 


(4.2.1) (> Le)" = K( [ | F(6) |? | 6 |> ao), 


with K = K(p, k). 

(1) We have already disposed of the case p = 1; in this case we may include 
the value k = «. 

(2) We suppose then that p > 1. We shall use (besides the theorems of §3) 
three known theorems concerning Fourier series, expressed by the inequalities 


(4.2.2) (S| us ?’)'" < (t [ “Tap ao)” (l<ps2), 
(4.2.3) Z| nirtiuales Kf |hlrao <ps2, 
(4.2.4) El ules Kf |hlrlolreao (p = 2). 


In these theorems, of which the first is due to Hausdorff® and the second and 
third to ourselves,” 


oc 
2. Un end 
—2 


is the complex Fourier series of a function h(@) for which the integral on the 
right hand side is finite. 

We must distinguish the cases p = 2 and p 2 2. 

(3) Suppose first that 1 < p < 2. It is plain from Hélder’s inequality that, 
if (4.2.1) is true for k = k; and for k = ke (and the same p), it is true for 
ki Sk Ske. It is therefore sufficient to prove it (a) when 


4.2.5 a a a 
(4.2.5) P "pee 
and (b) when” 

(4.2.6) k2p’. 


18 Hausdorff (11); Zygmund (16), 189-192, 200-202. 

19 Hardy and Littlewood (3), Theorems 5 and 3; Zygmund (16), 202-215. 

20 The two ranges overlap or abut when p = 3, and then no appeal to Hélder’s inequality 
is necessary. 
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The function 
¢(z) = ; nlp 2" 


is regular for r < 1, z ¥ 1, and has no zeros, except z = 0, in or on the unit 
circle." Near? z = 1 


‘hd mw qa? 


1 — 2)’ 
and the ratio 
| (6%) | | 0 [1 
lies between positive bounds K(p). 
If 
= .— J) ” 
ge) = D) bar =, fle) = 4) 9), 


then 
Cn, = } (n — s)-/? b, = pit/>" , 


s<n 


in the notation of §2.2. The function g(z) is regular and belongs to L”, and 
has a boundary function g(e”) = G(é@); and the ratio 


| G(@) | = | @|¥" | FC) | 


lies (for almost all @) between positive bounds K(p). 
We now distinguish cases (a) and (b). In case (a) we use Theorem 7 and 
the second of the three theorems quoted in (2), viz. (4.2.3). These give 


(> Lely" i (> _ si ° < KD nP-? | b, |) Ue 
q l/p /p 
< k( | |G\ras) s K( [ Fr \o\-ae) 


In case (b) we use Theorem 6 (with p’ in place of p) and the first of the 
theorems of (2), viz. (4.2.2). We thus obtain 


Oi Niel Ot ci 
< K( | 1G ao)” < K( | | F |r| o|rd0)"” 


*1 This is a case of ‘Kakeya’s Theorem’. 
22 In fact 


12) ()” 
Pp z 


is regular forz = 1. See for example Lindeléf (12), 138. 
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Thus the theorem is proved in cases (a) and (b), and so whenever p S 2. 
(4) When p = 2 we use 
¥(z) -_ } nap’ gn 
instead of ¢(z). If 
f=w, g= Lobe", 
then 
C= pit /P). 
The function g(z) has a boundary function g(e*) = G(@), and the ratio 
| G(6) | =| 6 |” | F(@) | 


lies between positive bounds K(p). Hence, using now Theorem 6 and the 
third of the theorems of (2), viz. (4.2.4), we obtain 


(D sty" (DUP B) 5 coi ne 


thus completing the proof of Theorem 8. 

The result is not true, for any p > 1, when k = «. It would imply that 
b‘'/ is bounded for any g(z) = >. 6, 2" of the class L”, and it is not difficult 
to construct an example to the contrary. 

4.3. THEeorEM 9. Suppose that F(@) is the boundary function of an analytic 
function f(z) = > enz” of the class L; that 

S(t) = Dc, er ; 
0 
thatk = p = 1; and that 
||" | F(@ + @) — s(x) 
is, for a given x and s(x), integrable in 0. Then 


> | a(x) — s(x) |*\¥* = K([ | F(a + 6) — s(x) aw)” 
; nie, 8 [o| 


1 ° 








with K = K(p, k). 
We may suppose z = 0. We have then only to write 
f(z) — s(0) = h@), 
h(z) 
g(z) = = > (s,(0) — s(0)) 2”, 


i—g 





and to apply Theorem 8 to g(z). 
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5. Extension to general Fourier series 


5.1. It is natural to expect that, when p > 1, there will be a theorem for 
general Fourier series corresponding to Theorem 9. 

Let us suppose that p > 1; that u(@) is a periodic function of @ of the class 
L?; that 


u(@) ~ >> c, e"* 


(or 
u(@) ~ a9 + p (a, cos nO + b, sin né)) 
is the Fourier series of u(@); that 


s,(x) = > OT 


-s 


(or 
8nx(t) = 4.40 + >> (a, cos vx + b, sin vx)), 


and that $(z, @) is defined as in (1.1.1). We shall prove 
THeoreM 10. Ifk = p> 1and| 6 |- | $(2, 0) |? is integrable in 0, for a given 
x and s(x), then 


(5.1.1) (> | sn(z) = s(x) ‘y" < «(| |e ag)", 


with K = K(p, k). 
5.2. We may make the usual formal simplifications, supposing x = 0, 
s(x) = 0, and u(@) real and even, so that 


u(0) ~ fa) + >> a, cos n6, 
o(z, 6) = u(@) ’ 





Sn = 8n(t) = 8,(0) = $a0 + ps a, . 


We have then to prove that 


bad \ve wt > We 
(5.2.1) (> =") = K(/ Ol an)”. 


The function u has a conjugate v, odd and of L”, defined by 


(5.2.2) v0) = — 2 |" cot 3 @ — 0) ule) ag. 


The associated harmonic function vanishes at the origin. 
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Let us assume for a moment that we have proved that 
* | v(0) |? * | u(@) |? 
2.3 das K —— di 
(5.2.3) [ 9 49 = K(p) / 5 48 


whenever the integral on the right is finite. Then F(@) = u(@) + 7v(@) is the 
boundary function of an analytic function f(z) = )> c,2” satisfying the condi- 
tions of Theorem 9, and s, = @ + + --- + ¢,. Hence 


k\1L/k F Pp Ip Pp l/p 
(> Ly" s x( [0 ao) K(f OL a)”, 
n |6| | 0| 
with K = K(p, k). 
The proof of Theorem 10 is thus reduced to the proof of (5.2.3). 





lA 











6. Theorems on conjugate functions 


6.1. It is well known” that a function U(x) of L?(—«, «), where p > 1, 
possesses a conjugate V(x), defined for almost all 2 by 
UW) gy 


oy—2 


(6.1.1) Pad « = | 
Sa 

which also belongs to L?(—«, «). The integral is a Lebesgue integral at 
infinity and a principal value at y = z. 

When U(z) is even (a hypothesis essential in the sequel), we may also write 





. 1[" 2@& . 
(6.1.2) V(x) = - | zoel dy . 


This integral exists under wider conditions than that in (6.1.1). Suppose, for 
example, that U(x) is L” in every finite positive interval, and that z*U, where 


ee Peo 
Pp 


is L?9(0, ©). Then the integral converges as a principal value (for almost 
all x) across y = | x | ; and, since 


' U | > 1/p - > ' L/p’ 
| P dy < (| (y| U ray) (/ y (2+a) p ay) 


@+ ap’ >(1-1)p = 1, 








and 


it converges absolutely at infinity. We may therefore define V(x) by (6.1.2). 
TuHeEoreEM ll. Jf 


(6.1.3) —-l-—-<a<—s=l1-.-, 
p 


23 M. Riesz (13); Zygmund (15), 147-149. 
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x*U (x) is L?(0, ©), and V(x) is defined by (6.1.2), then x*V(x) is L?(0, &), and 
(6.1.4) i] (a*|V |)? dx < K | (x*|U |)? dz, 


with K = K(p, a). 
We denote by V*(z) the conjugate of the even function | z |* U(x). Then 
V*(z) is L? and 
| |\V*\»dx< K | (a*|U |)? dz. 
0 0 
It is therefore sufficient to prove that 
(6.1.5) | |V*—a2a*V\|pdrs x | (a*|U |)? dz. 
0 0 


Now, when z > 0, 


ve avn? | My2yrUW) dy, 
0 


where 





Me, « 22-2. 
'Y, 2) 7 ypae 


This function has a fixed sign, viz. that of a, and is homogeneous of degree —1; 
y* —_ 1 yen dy < a) 


and 
| M y,1 ly Vp dy = i] 
I - )| o lyY-l1 


when a satisfies (6.1.3). Hence‘ (6.1.5) is true under the conditions of the 
theorem. 
In particular, when a = —1/p, we obtain 


(6.1.5) | VP as Kp) | UP ae. 
0 x 0 x 


6.2. THeoreM 12. If p > 1, u(@) is periodic and even, 0 | u(@) |” is in- 
tegrable, and v(@) ts defined by (5.2.2), then 


(6.2.1) [ ere < Kw) [ ha P 


We have 








v(6) = — =f (cot 3 (6 — 0) — cot ; (@+ 0) ue) do 


(6.2.2) : ; 
= t/ _ snd u(@) do. 


T cos @ — cos 8 


24 Hardy, Littlewood, and Pélya (10), Theorem 319. 
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If we write 
x = tan 20, y= tan3¢, u(o) = U(y), v() = V(z), 
then (6.2.2) becomes (6.1.2). Also 


° | a ie |p ® | > - 
I Lu) ip = [U@) I” ar, | OE a « | VO) ae, 
o sing Jo z o siné 0 x 


and therefore, by Theorem 11, 


i "12 P ao < K(p) i " RF we. 











sin 6 


This implies (6.2.1). 

In proving this theorem, we have completed the proof of Theorem 10. 

6.3. It is to be observed that the truth of Theorems 11 and 12 depends 
essentially on the hypothesis that U(x) and u(@) are even. It is not true, 
without this restriction, that the integrability of @-' | u(@) |? involves that of 
6! | v(@) |". Suppose for example that 


sin n0 cos n8 
uO) = 2) aap? 0) = - 2) ee 


where 8 is positive. Then u(@) behaves like a multiple of | log @ |~* for small 
positive @, and | @ |~' | u |” is integrable if (and only if) Bp > 1. On the other 
hand v(@) behaves like a multiple of | log @ | '~%, if 8 < 1, and v(@) — v(0) be- 
haves in this way if 8 > 1; and neither | @ |-' | » |? nor | @ |-* | v(@) — v(0) |? 
is integrable unless Bp > p + 1. 

We can show, by an argument like that of §6.1, but based upon the formula 
(6.1.1) instead of upon (6.1.2), that the conclusion of Theorem 11 is true, for 
general U(x), when —1/p < a < 1/p’; but the value —1/p of a, the critical 
value for our purpose, is excluded. 


7. Fourier transforms 


7.1. Our proof of Theorem 10 is comparatively simple (granted the inequali- 
ties of §§2-3) but very indirect, and it is natural to ask for a proof independent 
of the theory of analytic functions. For the sake of variety we give here not 
this proof but the proof of the analogous theorem for Fourier cosine transforms. 
To simplify the formulae, we suppose throughout that k = p. 

We use the notion of a ‘limit in mean’ or ‘strong limit’, with index p, of 
a function s,(x). This, if it exists, is a function s(x) such that 


x 
lim | sa(x) — s(x) |?>dx = 0 
ao J0 


for every positive and finite X. We write, after Wiener, 


s(x) = Li.m. s4(x). 


an 
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A limit in mean is, apart from null sets, unique. 
THEOREM 13. Suppose that p > 1, 





(7.1.1) [ er dx < ~, 
and 
(7.1.2) a(2) = | w0) = ut in 


Then sa(x) has a limit in mean s(x) when a > ~, and 
(7.1.3) [ er dx < K(p) [ fe ? os 


7.2. There is a simple proof, which we have not succeeded in generalizing, 
in the case p = 2. 
Suppose first that f(z) = Oforz >c. Then 


e(z) = [ “70 == ie | ” f(t) me @ = ds) 


for a > c, so that s(x) is the limit of s.(x) in the ordinary sense. Also, 


[Ss 6) 4 r= ff = [° f) sin xt a f° flu ) = ru A 


-[ [ SOsf(u) ne sin zt sin ru oR 
0 J0 tu 0 2 
The inner integral is 


é ci - a 
4 tao Et We as 3 | 1 — cos |t—ulz 
0 0 

















(7.2.1) 











2 x = 
1 ey 1 — cos w 
== ——— dw, 
2 |t—ulé w 
and is positive and less than 
og St 
lt— ul 


Hence, if we write g(t) = t'f(t), so that g(t) is L*, we have 


E (. 2 ea we 
(7.2.2) (2) a | M(t, u)g(t)g(u) dt du, 
where 
(7.2.3) Nt@«— ‘+* 





7 id a 
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Finally, since M is homogeneous of degree —1, and 
. oe t+ 
= t tidt= [ - ——— dt = - x? 
m | M(t, 1)t-'d J, j 64 ge <a, 
(7.2.2) implies* 


(7.2.4) i — dx =m [ (g(t))? dt = m [ = dt . 


Passing to the general case, we observe that 








s(x) — s(x) = * 40) == dt, 


and so, after (7.2.4), 
[ (s(x) — sa(z))? 5 -— [ (f(t)? dt, 


x t 








which tends to 0 when a and b tend to infinity. A fortiori 
x 
/ (s(x) — sa(x))? dx — 0 
0 


if0 < X < «. It now follows in the usual manner that s(x) exists for almost 
all x, and that 


[7 CO ae = tim 


x aoe J0 





(02)? ae < m i] ~ SOY a 
x 7 . t , 


We observe here, in order to avoid repetition, that the last stage of the argu- 
ment would run quite similarly for general p. When we have proved the ana- 
logue of (7.2.4), with general p and f = 0 for t > c, the rest of the theorem 
will follow. 

8. Lemmas for the proof of Theorem 13 


8.1. Lemma a. If f(z) 29, p > 1,7r > 1, 


file) = [1 ‘. wien [ @a, 


then 
(8.1.1) / x(fi(x))?»dx = K / x*(xf(x))? dz, 
(8.1.2) [  g-(falx))” dx < K [  g-2(f(x))? de , 


with K = K (p, r), whenever the integrals on the right are finite. 
These are known theorems.%* The cases we require are r = p andr = 2. 
2 Hardy, Littlewood, and Pélya (10), Theorem 319. 


2% For (8.1.1) see Hardy, Littlewood, and Pélya (10), Theorem 330. The second in- 
equality is not stated explicitly in the book, but will be found in Hardy (1). 
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Lemma 6. If 1 < p S 2, and f(x) is L?(0, ~), then 
= cos , . cos 
F(x) = i fo wa at dt = Lim. i fo lis at dt 
exists, for almost all x, as a limit in mean with index p’, and 
(8.1.3) [oti F@ rae s K@) |” \s@) rar, 
0 0 
Lemma y. If p = 2, and x?!» f(x) belongs to L?(0, ©), then 
“3 cos ; ° cos 
F(z) = / fi on zt dt = Lim. / fo a xt dt 


exists, for almost all x, as a limit in mean with index p, and 


(8.1.4) F | F(x) |? dx < K(p) [ xP | f(x) |p dz. 
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For these two theorems see Hardy and Littlewood (3), Theorems 13 and 14. 


8.2. Lemma 6. Let 


(8.2.1) v(x) = x'/P [ (1 — u)—/?’ cos xu du. 


Then the result of Lemma B remains true when (zt) is substituted for cos xt or sin zt. 
It is easily verified by standard methods” that ¥(x) is regular for 0 < x < ~, 


that 
v(x) — pr? 


for small positive z, and that 


w= 1()oo(e-g) +4) 


for large positive z. Hence 


(8.2.2) y(x) = C cos (2 _ 3) + R(x), 
where 
(823) |R(z)|<KO@<zS)), |Riz)| <2 @>, 


and C = C(p), K = K(p). 


27 The simplest method for finding an asymptotic expansion for ¥(z) is to apply Cauchy’s 


Theorem to 


fo — u)-V' etm du 


and the rectangle (0, 1,1 +7%0,7i0). 
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It is enough to prove the result on the hypothesis that f(r) = 0 for z > e, 
when 


F(x) = / S(t) Y(at) dt 
0 
exists, for all z, as a Lebesgue integral; the proof may then be completed as 


at the end of §7.2. 
Now in this case” 


oe l/z C4 
F(x) = c | f(®) cos (x rs z) dt + / f(t) R(at) dt + i f(t) R(at) dt 
0 0 lz 
= F\(x) + F.(x) + F(x) , 


say; and it is sufficient to show that F,, F2, and F; satisfy inequalities of the 
type (8.1.3). This is true of F;, by Lemma gs. Next 


iFeisk |’ isola = K5(%), 


say; and so 


/ x’? | F, |\pdrs x / xP («(4)) dx 
0 0 zx 


=K '§ ax? (filx))? dx 


IIA 


K[ \f(x) |? dz, 


by (8.1.1), with r = p._ Finally 


|F;|s : | fi) dt = K (1) , 
S Fe /o t # x 


c[=(4’))« 


= K [ (fo(x))"? dx = K | | f(x) |p dz, 


say, and 


xe 
/ x? | Fs; \P dx 
0 


by (8.1.2), with r = 2. 
8.3. LemmMAe. Let 


IIA 


1 
x(x) = g!/?’ / (1 — u)-"? cos ru du. 
0 


Then the result of Lemma y remains true when x(at) is substituted for cos xt or 
sin zt. 


28 Our argument is suggested by one used by Titchmarsh (14) for a different purpose. 
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Here 
, ss T 
x(x) = C cos (: —, 5) + R(x), 
2p 
where R(x) again satisfies (8.2.3). Arguing as before, we obtain 
F(x) = Fi(x) + Folx) + F;(x) , 
where 
/ F(x) \° dx S K (p) [ x? | f(x) |? dx 
0 0 


and 


| F(z) | < K4(), | F(z) | < 1%) 


zx 


We have now 


| "| Fir) \?dr SK [ (1(2))’ dx 
) 0 x 


= K [ x (fi(z))? dx = K [ xP? | f(x) |? dx 


by (8.1.1), with r = 2; and 


in F(x) "de SK [ zs? (1(2)Y dx 


= K | x? (fo(x))? dx S K | xP | f(x) |? dz, 
0 0 
by (8.1.2) with r = p. The result follows as before. 


9. Proof of Theorem 13 
9.1. (1) Suppose that 1 < p S 2, that 2~/*f(x) is L”, and, in the first in- 
stance, that f(z) = Oforz >. 
Let 
w(e) = | evs v(et at, 
0 
where y is defined as in §8.2. Then 


r (x — y)“/?w(y) dy = [ (2 — y)-” ay [ t-/P f(t) Y(yt) dt 


- [ t-1/P f(t) at |” (x — y)-VPyp(yt) dy 











378 G. H. HARDY AND J. E. LITTLEWOOD 
(by absolute convergence). The inner integral is 


z 1 
[ (x — y)~/? (yt) ay [ (1 — u)-"/?’ cos ytu du 
0 0 


z y 
= we [ (x - vray | (y — v)-*’ cos tu dv 
0 0 


sin xt | 


= tl/p [ cos tv av | (x — y)-"?(y — v)-"?’ dy = x cosec aed =a 
0 v 
so that 
eo . t z 
s(x) = / f == dt = K(p) I (x — y)~"? w(y) dy 
/0 0 


is, apart from a factor K, the (1/p’)-th integral of w(x). 
It follows from Theorem 7 and Lemma 6 that 


[ te) ae s K | 2r\w(z) |r de < K[ OF an. 


This is the result of Theorem 13, when f(z) is 0 for large x; and the full result 
follows as in §7.2. 
(2) If p = 2 we write 


it | EU f(t)x(2t) dt, 


where x is defined as in §8.3. We again suppose, in the first instance, that 
f(x) = Ofor large z. Then s(x) is substantially the (1/p)-th integral of w(z), 
and 


[ Le) I? ae <K [ | w(x) |»>dx SK [ xP? (ee 'y dx 
0 x 0 0 gilp 
= K{ ISG)? ae 
0 x 


by Theorem 6 and Lemmae. The proof is then completed as before. 


10. Concluding remarks 


10.1. We conclude with a few miscellaneous comments. 
(1) The result of Theorem 10 becomes false for p = 1. 
It is plain that > n—|s,| < © implies }> n“|a,| < © and so 


>) lee deel < we, 
n 
Also 


i | 1 ; 
$n — 5 On = | 5 cot 5t SO sin nt dt 
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is the Fourier sine coefficient of the function $ cot $¢ f(t). Hence, if the result 
were true for p = 1, it would also be true that, if g(t) is odd and integrable, 
and 


g(t) ~ > b, sin nt, 





then 
n 
But 
g(t) =2>) Toe Sin nt = h(t — u) — hit +), 
where 





cos nt 
A(t) = > logn’ 
is integrable, for any u; while 
> | sin nu | 
n log n 


is generally divergent. 
Thus 


[lac « 
0 t 


implies s, — 0 (by Dini’s convergence criterion), but not the convergence of 
> n-|s,|. When p > 1 the situation is reversed: 


(10.1.1) [ser aes 


implies the convergence of > n-'| s, |”, but does not imply s, > 0. For 
(10.1.1) is satisfied whenever 


jt) = 0( (tog t)'), 


and this is not a sufficient condition for convergence of the Fourier series.” 
10.2. (2) It is instructive to contrast our results with the much simpler 
results for the Cesaro mean oa, of the Fourier series. 
If (10.1.1) is satisfied then, 4 fortiori, 


i * | f(u) |? du = o(t) 


29 See Hardy and Littlewood (6), 47; Zygmund (16), 31, 174. 
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and this is, for p = 1, a sufficient condition that ¢, — 0. Also (10.1.1) implies 


(10.2.1) Do < @. 


When p > 1, this is a corollary of Theorem 10; but it is true for p 2 1, and 
may be proved much more simply. For 


° sin? 1/n r | 
(10.2.2) |o,|< a| Lf(o) | sa dt < An | | f(t) | dt + 4 f FO at, 
0 0 1 


nt? a 


where the A are constants; and (10.2.1) is an easy deduction. 
Consider, for example, the first term 


An i | f(t) | dt = Anf, () 


on the right of (10.2.2). The contribution of this to (10.2.1) does not exceed 


Dead [mG(Nusnl GOs 


= K | a? f?(x) dz. 
ec 
We now require the inequality 
| x?" (f,(x))? dx = K | x (f(x))? dz, 
0 0 


which is a case of (8.1.1) when p > 1 and may be verified independently when 
p=il. 

The second term in (10.2.2) may be disposed of similarly. 

10.3. (3) Theorem 10 has a ‘transform,’ viz. 

TuHeoreM 14. If p > 1 and 


Xn? |bal? < &, 
then there is an odd function g(x) whose Fourier series is 


> bn sin nz, 


7 k Wk l/p 
( [ eer iz) < K(p, k) [z mn |ba/?) 


0 


and 


fork = p. 

This may be proved independently, or (when k = p) deduced from Theorem 
13; and there is a simple proof similar to that of §7.2 when k = p = 2. 

The corresponding theorem for cosine series is false. If b; = 0, and 
b, = (n log n)-' forn > 1,k = p = 2, then 
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> nb, = > aca n)? a 


but 








cos nt sin nu . du 1 
t) = —— - 
fi) n log n ~|" > log n u~| u log (1/u) log log t 
for small positive t. 


(4) Theorem 13 is equivalent to 
THeorEM 15. The bilinear integral form 


[ [S824 ceva aay 


is bounded in space |p, p’|: i.e 
x I/p xz 1/p’ 
Lr I giegteatroen de ay] = Keo( J Lat irae) ("1409 a) 
/0 0 


for all a(x), b(y), X, Y. 
The form 


sin mn 
On bn 
mi?’ nlp nilp 


is not bounded in [p, p’], since (e. g.) 


p> |sin mn |?" _ 
oor wns 


for every n.* 
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SUR QUELQUES DEFINITIONS POSSIBLES DE L’INTEGRALE 
DE STIELTJES 


Par Maurice FRECHET 


Introduction. La définition la plus utile de l’intégrale de Stieltjes 
F(x)dv(x) (ot V est le domaine d’intégration) est celle qui concerne le cas 


Vv 
ot F(z) est continu, v(x) 4 variation bornée et ot V est un segment fini. MM. 
F. Riesz et Lebesgue ont étendu cette définition, en la modifiant, au cas od f(x) 
n’est pas continue. 

En Calcul des Probabilités, deux extensions plus modestes paraissent seule- 
ment désirables. Celle concernant F(x), od l’on suppose F(x) monotone et qui 
trouve son application dans la détermination de la fonction des probabilités 
totales de la somme de deux variables indépendantes, forme l’objet de la Pre- 
miére Partie de ce mémoire. Les résultats obtenus prolongent en les complé- 
tant—en particulier, en adjoignant 4 leurs conditions suffisantes des conditions 
nécessaires—certains résultats antérieurs de MM. Lebesgue, Steffensen et 
Glivenko. 

La seconde extension, concernant le cas ot F(x) est supposé continu mais ot 
V est illimité, est le sujet traité dans la Seconde Partie de ce mémoire. Elle se 
trouve utile dans la détermination de la valeur et des propriétés de la valeur 
moyenne d’une fonction continue f(X) d’une variable aléatoire X. 


Les définitions de [ f(x) dC(z). 


Premiére partie. Cas ot f(x) est monotone 


Extension de la définition de l’intégrale de Stieltjes. Quand f(x) et C(x) 
sont deux fonctions définies sur un intervalle fini (a, 8), la premiére continue, 
l’autre monotone, on démontre que la somme 


o = SSG) (Cle) — Ce.) 
tend vers une limite déterminée J quand, en prenant arbitrairement a = 2 S 
& SaaS & S--- S tou S & S 2, = B, la plus grande 6 des différences 
8 
xr; — 2X; tend vers zéro. Et on pose J = S(x)dC(z). On a étendu cette 


définition 4 des cas beaucoup plus généraux. II nous sera seulment utile ici de 
considérer le cas ot f, continue ou non, est monotone comme C(x). Nous ver- 
rons quelle restriction il y a 4 faire pour que |’extension soit possible. 


Received March 2, 1936. 
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Décomposition de l’intégrale. Pour simplifier |’étude, on peut utiliser la 
propriété suivante: si f(z) est monotone, f(x) est la somme de sa “fonction des 
sauts” S(x) et d’une fonction continue g(r). On a alors ¢ = o, + 7, ol m, T 
sont formées comme go, mais 4 partir de g et S au lieu de f. Dans ce qui suit, 
nous pourrons envisager le cas général d’un segment V d’intégration, fini ou 
non, soit (a, 8), (a, +2), (—~*, B), (—», +) pourvu que nous supposions 
les fonctions f et C bornées—ce qui a lieu nécessairement (si on les suppose finies 
en chaque point) quand le segment V est fini. Quand V est infini, la suite des 2; 
sera illimitée dans un sens ou dans les deux. En appelant 1, L les bornes de C; 
h, H celles de f, la fonction fi(z) = f(x) — ha évidemment les mémes sauts. On 
posera alors dans tous les cas 


(1) S@= Dit Ds 
avec 

(2) s; = f(r) —f(% — 9), 
(3) s; = f(r +0) — f(r), 


T1, T2, --- étant les points de discontinuité de f ou encore de f;. Il est clair que 
0 s S(x) Ss H — het que S(z) est non-décroissant comme f(z). Sif est borné, 
g lest done aussi. Quand ¢ est continu et borné et C monotone et borné, la 
somme o; = 2 ¢(§)[C(x;:) — C(xi-1)] est ume somme d’un nombre fini de 
termes ou une série absolument convergente. Et la méthode classique montre 
que o; a une limite unique quand 6 — 0: 


limo, = J = [oo dC(x) . 


Pour établir l’existence de la limite de a, il suffit done d’établir l’existence de 
K = limr, avee r = » S(&) (C(x) — C(ai-a)] . 


b 
S’il en est ainsi, on pourra encore poser / f(z) dC(z) = limo =J+K. Or,en 


supposant, par exemple, f(x) non-décroissante aussi que C(x), on aura t Sr S T 
avec 


(4) r=  % S(ai) (C(ai) — C(ai)] ; T = S(2x;) [C(z;) — C(a-)] . 


Soient t, et 7’: les valeurs de ¢ et T pour deux modes de divisions quelconques, 
t’ et T’ leurs valeurs pour le mode de division obtenu en combinant les précédents. 
On a évidemment, si, par exemple, f(x) et C(x) sont non-décroissants, 


ass?’ ots. 


Ainsi on a d’abord t; < T; et les ¢; ont une borne supérieure finie m, les T: une 


borne inférieure M avec m < M. 











y 
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Existence des limites de ¢ et de 7. Cherchons d’abord ce que deviennent 
tet T quandé-— 0. La série S(x) est convergente, et on a 


T = a >, & + ry $3 | (C(x) — C(a)] . 


rj 32; 
Cette série double 4 termes = 0 est convergente, et on peut écrire 


T= p> ai > (C(x) — Cla + Ls {2 (C(x) — Cle} , 


7,32; j 2; 


(5) T = > s;(L— Cle) + Ds lL — C@e))I, 


x; et x, étant des points pris dans la suite des 2;, soient 24-1 et x1, tels que 


(6) Zo-4 <M B Mei Yan S 1; < 2. 

Que se passe-t-il pour T quand 6-+ 0? Les deux points z,_;, zs. tendent vers 
rj} Za. tend vers r; par valeurs inférieures, donc le terme C(z;) de T tend vers 
C(r; — 0). Quant A C(z’;), sa limite dépend de la maniére dont 23 tend vers 7;. 
Si, 4 partir d’une valeur assez petite de 6, la division D formée des points z; ne 
comprend pas 1;, C(z’;) = C(ag) > C(r; — 0). Si 7; appartient & D pour p 
assez petit C(x’}) + C(r;). Ainsi la plus petite des limites et la plus grande 
des limites de T quand 6 — 0 sont! 


A= 2) 8j(L — Cr; - 9) + D sj (L — Cr), 


st A’'= Dd sj (L — Cr; — 0)] + Yo sj (L — C(r; — 0)). 
On a 
(8) A’'’-—-A= om 8; (C(r;) — C(r; — 0)). 


i 
On verrait de méme que la plus grande et la plus petite des limites de ¢ quand 6 
tend vers zéro sont 


a= Dsj(L— Cr) + X sj{L — Clr; + OI, 


(9) - ” 
a’ =D) s;(L — C(r; + 0)) + z s;(L — C(r; + 0)], 


et on a 


(10) a—a’= a 8; [C(r; + 0) — C(r,)]. 


i 


1 A vrai dire, ceci montre seulement que les termes de T tendent respectivement suivant 
le cas vers les termes correspondants de A ou de A’. Mais les termes de T, A, A’ sont 
inférieurs 4 ceux de la série convergente et indépendante de 5: > s,[C(b) — C(a)] + 
> s;[C(b) — C(a)], ce qui permet de compléter la démonstration. 
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Si donc f et C sont deux fonctions monotones bornées quelconques, r et par suite 
o n’ont pas nécessairement des limites déterminées, l’intégrale de Stieltjes 


S(x)dC(x) peut ne pas exister. Pour assurer l’unicité de la limite de T et de 
4 


la limite de ¢t deux moyens se présentent. Ou bien restreindre V’arbitraire de la 
fonction f ou bien restreindre celui des divisions D. 

Le premier moyen a été choisi par M. Steffensen® dans une étude ou, d’ailleurs, 
en vue des applications actuarielles, il lui a paru suffisant de supposer que f 
n’a qu’un nombre fini de discontinuités et de prendre &; = (x; + xi)/2. Nous 
allons montrer que la condition suffisante qu’il obtient sous ces restrictions pour 
lunicité de o l’est en méme temps pour celles de ¢ et T dans notre cas plus 
général, et en outre qu’elle est nécessaire pour l’unicité de ¢ et celle de T. 

Pour que T et ¢ aient, tous deux, des limites uniques, il faut et il suffit, d’aprés 
ce qui précéde, que A’ — A = a—a’=0. Les expressions ci-dessus de A’ — A 
et a — a’ étant formées de termes = 0, ces termes devront étre tous nuls. En se 
reportant aux expressions (8), (10), on voit que cette condition peut s’exprimer 
ainsi: en tout point de discontinuité commun a f et C, f(x) et C(x) doivent étre 
continues a la fois d’un certain cété de ce point, ce cdté pouvant d’ailleurs varier 
avec le point considéré. C’est la condition imposée a priori par M. Steffensen 
et dont nous trouvons qu’elle est nécessaire et suffisante pour que T et ¢ n’ait 
chacun qu’une limite, indépendante du choix de la suite de divisions D pourvu 
que 6 tende vers zéro. 

Mais on peut, avec M. Lebesgue, supprimer cette restriction sur f en intro- 
duisant une restriction sur les divisions D. Le raisonnement précédent montre 
en effet que si la condition de M. Steffensen n’est pas réalisée, pour qu’une suite 
de divisions D fournisse une limite unique de T et une limite unique de ¢, il faut et 
il suffit que, si en un point de discontinuité x commun 4 f et C, il n’y a aucun 
cété de x ot f(x) et C(x) soient continues toutes les deux, ce point x doit, a 
partir d’une valeur assez petite de 6, appartenir toujours ou n’appartenir jamais 
4 la suite des D. 

A cet effet, il suffit, par exemple, que tout point de discontinuité de C(z) 
appartienne a la suite des divisions D pour 6 assez petit. C’est la condition 
suffisante indiquée par M. Lebesgue* dans un cas beaucoup plus général, celui 
ot C(z) étant a variation bornée, f(x) est seulement supposée bornée. 

D’aprés ce qui précéde, il y a ici d’autres choix possibles et moins restreints de 
la suite des D. Par exemple, il suffit que tout point de discontinuité commun a 
f(z) et A C(x) appartienne a la suite des divisions 4 partir d’une valeur cor- 
respondante assez petite de 6. 

Il y a des choix encore moins restreints de la suite des D, qui donneraient 
chacun A T une limite unique mais distincte de celle qui correspondrait au choix 
précédent. Pour éviter cette indétermination nous allons faire intervenir 
l’unicité de la limite, non seulement de T et de t, mais encore de r. 


2 On Stieltjes’ integral and its application to actuarial questions, Journal of the Institute 
of Actuaries, vol. 63 (1932), p. 447. 
3 Lecons sur l'Intégration, 2iéme édition, 1928, p. 272. 
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T et t sont deux valeurs possibles de 7 et l'on at S$ r S$ T. Done pour que r 
ait pour une suite de divisions D une limite unique indépendante du choix des £; 
dans les intervalles (x,;,, z;), il faut et il suffit: (i) que T et ¢ aient chacun une 
limite déterminée; (ii) que les limites de T et de ¢ soient égales. La différence 
des limites de T et ¢ est en tout cas au moins égale a 
(I) A-—a= Q's; (Clr) — Clr; — 0)1 + D sj (CC; + 0) — CO,)). 

F ? 
Dés lors, pour que 7 ait une limite indépendante du choix des £ ;, il faut que chacun 
des termes (qui sont tous = 0) de A — a soit nul; c’est A dire: 

ConpiTion (N). II faut que, de chaque cété de chaque point, l'une au moins 
des fonctions f et C soit continue, sans qu’il s’agisse nécessairement de la méme 
fonction quand le cété ou le point change. 

La condition (N) est nécessaire pour que o converge vers une limite indé- 
pendante du choix des £; quand on considére une suite déterminée de divisions 
D telle que 6 tende vers zéro. Mais il faut aussi que T et ¢ convergent chacun 
vers une limite unique. Et comme ces limites sont respectivement comprises 
entre A et A’, a et a’, et que la condition (N) assure seulement |’égalité A = a 
avec A’ > A = a 2 a’, il faut que T et ¢ tendent précisément vers A eta. C’est 
4 dire, comme il ressort de l’analyse faite plus haut, que tout point r; doit 
appartenir 4 la suite des D 4 partir d’une valeur assez petite de 6, si en ce point 


8 (C(r;)) — Cr; — 0)] + 8; 1C(r; + 0) — Cr] ¥ 0. 
Or si la condition (N) est réalisée, on a 
s,(C(r)) — C(r; — 0)] + sf(C@; + 0) — Cr] = 0, 
et par suite, en ajoutant, 
(8; + 8;)(C(r; + 0) — Cr; — 0)] #0, 
ou, puisque 8; + 8; ~ 0, 
C(r; + 0) — Cir; — 0) #0. 


Les points 7; ot cette inégalité est vérifiée sont les points de discontinuité 
communs 4 f et C. Le choix de divisions que nous avions donc indiqué plus 
haut comme simplement suffisant pour assurer l’unicité de chacune des limites 
de T et t, quand on ne sait rien sur la réalisation de (N), devient donc nécessaire 
quand (N) étant supposée réalisée, on veut assurer |’égalité des limites de T' et 
de ¢ et par suite l’indépendance pour la limite de -—et par suite de e—du choix 
des £; dans les (x;-4, 2;). 

En résumé: pour que o tende vers une limite indépendante du choix des &; 
dans les (x;_1, x;), quand on considére les valeurs de ¢ correspondant A une suite 
de divisions D convenablement choisie et telle que 6 tende vers zéro, il faut et il 
suffit que (i) la condition (N) soit realisée, (ii) tout point de discontinuité com- 
mun A f(x) et C(x) appartienne a la suite des D 4 partir d’une valeur correspon- 
dante suffisamment petite de 6. 
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Autre définition de l’intégrale. On retrouve encore la condition nécessaire (N) 
mais qui devient aussi suffisante 4 elle seule quand on se place au point de vue 
de M. Glivenko.* Celui-ci étend au cas de l’intégrale de Stieltjes la définition des 
intégrales de Darboux et il généralise la définition de l’intégrale ordinaire en 
exigeant que leurs valeurs soient égales. Plus précisément, d’aprés M. Glivenko, 
l’intégrale de Stieltjes i S(x)dC(x) existe quand les bornes M et m des sommes 

8 
de Darboux sont égales, et leur valeur commune est la valeur de l’intégrale. 
Mais les intervalles de variation de x et les variations correspondantes de C(x) 
sont supposés définis de fagon convenable.‘ 

Sans avoir besoin de faire varier 6, on voit en comparant les expressions (5) et 
(7) de T et de A, qu’on a toujours T => A. D’ot. M 2A. Or on vient de voir 
que pour certaines suites de divisions, T tend vers A; comme sa limite ne peut 
étre que >M,onadoncA =>M. Finalement M = Aetdemémem=a. Dés 
lors, pour que M = m, il faut et il suffit que A — a = 0 et on retombe bien sur 
la condition ci-dessus. 

D’ailleurs, écrivons 


DL S(a(C(a) — C(xi)] = » g(xi)[(C(a;) — C(aia)] + Zz S(x:)[C(x;) — C(a-)). 


Si M’ et M”’ sont les bornes inférieures des deux premiéres sommes, on voit 
qu’on a 


Dd S(xi){C(ai) — C(z,.)) > M"” + M, 


d’ot M’ => M’’ + M. D’autre part, pour tout « > 0, il existe une division 
D, pour laquelle T < M + e, et, puisque ¢ est continue et bornée, un nombre 
p tel que pour 6 < p, on ait 


> e(x[C(x;) — C(aia)] < M” + €. 


Adjoignons aux points de D, des points choisis de sorte que pour la division 
D obtenue, 6 devienne < p. Cette opération ne peut que diminuer la seconde 
somme. Pour cette division D, la deuxiéme et la troisitme sommes sont done 
respectivement inférieures 1. M’’ + «,M +. Désiors, la premiére quiest = M’ 
sera < M'’ + M + 2¢. Des inégalités ainsi obtenues 


M"+Ms M's M"+4+M + 2¢, 


vérifiées quel que soit ¢, on tire M’ = M+ M’’. Avec des notations analogues, 
on établirait de méme que m’ = m + m’’, d’od M’ — m’ = M — m+ M" — m"” 
= M — m, puisque, ¢ étant continu et borné, on a, comme on sait, M’’ = m’’. 


4 Sur les sommes de variables aléatoires. Ce travail (en francais) qui doit étre imprimé 
dans les travaux du Séminaire des Probabilités de l'Université de Moscou et dont M. 
Glivenko a bien voulu me communiquer une copie, sera reproduit dans l’ouvrage (en russe) 
intitulé Intégrale de Stieltjes, par Valére Glivenko, 1936. 
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Dés lors, pour que M’ = m’, il faut et il suffit que M — m ou encore A — a 

= 0, ce qui conduit encore A la condition déja signalée plus haut. Celle-ci est 

donc la condition nécessaire et suffisante pour que | f(x)dC (x) existe au sens de 
& 

M. Glivenko dans le cas actuel. 

Revenons 4 la limite de ¢. On sait que o; a une limite unique indépendante 
du choix des £; dans les segments (x1, 2;) et de la suite des divisions D, pourvu 
que 6 tende vers zéro. Pour qu’il en soit de méme de o = o; + 1, il faut done et 
il suffit qu’il en soit ainsi pour r. La condition cherchée résultera de la combi- 
naison de la derniére condition obtenue® et de la condition de M. Steffensen. 
C’est A dire que f(x) et C(x) ne devront avoir aucun point de discontinuité 
en commun. 

Remarque. Dans tout ce qui précéde nous avons supposé, par exemple, f et 
C tous deux non-décroissants. On raméne 4A ce cas celui od f ou C ou tous les 
deux seraient non-croissants en remplagant f ou C par —f ou —C. 

En résumé: Soient f(x), C(x) deux fonctions monotones sur le segment ab, 

b 


pour que l’intégrale [ f(x)dC(x) existe, il faut et il suffit 


I. Si on la définit comme la valeur commune de la borne supérieure de 
XL Se) (C(x) — C(x-1)] 
et de la borne inférieure de 
L Sa) (Cle) — Ca, 


que, de chaque cété de chaque point x, l'une ou l'autre des deux fonctions f(x) et C(x) 
soit continue; 
II. Si on la définit comme limite unique de 


c= Dd f(g) (C(as) — C(x-.)] 


pour un choix quelconque des £;, mais pour une suite de divisions convenables 
telle que 6 tende vers zéro, que, de chaque cété de chaque point x, l'une ou l'autre 
des deux fonctions f(x) et C(x) soit continue et alors que la suite des divisions con- 
sidérées comprenne chaque point de discontinuité commun a C(x) et d f(x) a partir 
d’une valeur de 6 assez petite (pouvant éventuellement varier avec ce point); 

III. Si on la définit comme limite unique de ¢ quels que soient le choix des £; 
dans les segments (x;-;, x;) et aussi quelle que soit la suite des divisions D 
pourvu que 6 tende vers zéro, que f(x) et C(x) n’aient aucun point de discontinuité 
en commun. 


5 Car celle ci, nécessaire en général, devient suffisante quand on la combine avec celle 
de M. Steffensen, puisque dans ce cas la différence des limites de T et t non seulement est = 
A — a, mais, étant < A’ — a’, est alors égaleA A —a = A’— a’. 
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Calcul de | S(x)dC(x). Chemin faisant nous avons obtenu tout ce qu’il faut 
4 


pour calculer l’expression de / S(x)dC(x). Quand elle existe au moins selon les 
' 


definitions I ou II, elle est égale 4 A = a, done aussi A 


“+ ow 24; | on te =] + P 8; E os ss Ia +9), 


Mais nous sommes dans le cas ok A — a = 0 et od par suite 
»» s;[C(r;) — C(r; — 0)] = p> s;(C(r; + 0) — C(r;)] = 0. 
Dés lors, on voit qu’on aura 
3(A +a) = 2X sill — C(r;)| + p» sj (L — C(r;)] = > s[L — C(r;)], 


en appelant s; le saut total s; + s; en r;; d’od finalement 


[ scoace = (L —1)(H —h) — ¥ 8,[C(r) — I). 

v 7 

On remarquera que si l’on pose y(x) = C(x) — l pour avoir en y(x), comme en 
S(z), une fonction (monotone bornée) ayant comme borne inférieure zéro et si 
l’on appelle 2, Q les bornes supérieures de S et de y(x), on aura 


[ sane = 2Q — & s7(r)) = [ sac). 


Dans les cas ot [ f(z)dC(x) existe au moins suivant la définition I ou la 
a 


définition II, l’intégrale | C(x)df(x) existe aussi, la condition A cet effet faisant 
Vv 


intervenir symétriquement fet C. Orona 


> f(ziC(ai) — C(aia)] + . C(aja)f(ai) — f(aj-)] = f(an)C(an) — f(a) C(a1), 


et 


> s(x) (C(x) om C(a4-)] + > C(x) [f(2;) — S(a;-)] = LH — lh. 


En prenant des divisions ot chaque point de discontinuité commun a f et C 
s’introduise pour 6 assez petit, et faisant tendre 6 vers zéro, on obtient ainsi la 
formule d’intégration par parties 


8 8 
i f(x)dC(x) = [f(x)C(x)\2 - | C(x)df(z), 


a 
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et plus généralement 


[ sea) = LH — lh — [ecomarey, 


formules établies ainsi dans tous les cas ot f(x) et C(x) sont deux fonctions 
monotones et bornées telles que de chaque cété de chaque point de z du segment 
limité ou illimité V, l’une ou l’autre des fonctions f(x), C(x) soit continue. 


Cas oi f et C sont des fonctions 4 variations bornées. On peut étendre 
les résultats précédents aux intégrales ot F et v(x) sont A variations bornées sur 
V. On sait qu’alors F(x) et v(x) sont chacun différence de deux fonctions 
non-décroissantes F = fi — fe,v = C; — Ce. Et si les variations totales de F 
et de v sur l’ensemble des points de V sont finies, les fonctions fi, fe, Ci, C2 seront 
bornées sur V. Alors, la somme 


=  ® F(&) [v(z;) — v(ai)) 


sera la somme algébrique de quatre sommes analogues mais formées chacune a 
partir de deux fonctions monotones bornées. Or les quatre fonctions fi, fe, C1, 
C2 peuvent étre choisies de fagon 4 n’avoir pas d’autres points de discontinuité 
4 droite que F et v respectivement et n’avoir pas d’autres points de discontinuité 
4 gauche que les mémes fonctions. Dés lors: si de chaque cété de chaque point 
x, l’une au moins des fonctions F et v est continue, la somme o tendra, quand 
5 — 0, vers une limite indépendante du choix des £; dans les (x;_;, x,;) et du choix 
de la suite des divisions D pourvu que tout point de discontinuité commun A F 
et v appartienne A la suite des D A partir d’une valeur assez petite de 6. Cette 


limite unique pourra étre prise comme définition de / F(x)dv(x) quand F et v 
Vv 


sont 4 variations totales bornées sur l’ensemble des points de V. 


Seconde partie. Cas oi dans i g(x)dC(x), o(x) est continu mais le domaine 
d’intégration illimité 

Dans ce qui précéde, nous avons admis comme évident que |’extension de la 

définition et des propriétés classiques de | g ¢(x)dC (x) ot ¢ est continu et C mono- 


tone au cas ot le domaine V d’intégration est illimité (dans un ou deux sens) ne 
présente pas de difficultés quand ¢ et C sont bornés. Dans ce cas, comme ¢ 
reste uniformément continu sur l’ensemble total des points du domaine d’inté- 
gration, on démontre, en effet, que la somme habituelle 


F o(&) (C(x) — Cla), 


qui est ici une série absolument convergente, a une limite indépendante du choix 
des £; dans les (x;,, x;) et du choix de la suite des divisions D pourvu que 6 
tende vers zéro. 
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Nous allons examiner le cas moins simple ot ¢ encore supposé continu n’est 
plus supposé borné sur le domaine illimité V pour lequel, pour préciser, nous 
prendrons (—*, +). 


2 


Une seconde définition de |’intégrale / ¢g(x)dC (x). On peut définir 


lintégrale / 


vers — © et + ©. II est souvent plus commode d’utiliser une définition 
équivalente plus directe. Nous allons indiquer celle-ci. 

Supposons done que C(x) soit une fonction monotone (par exemple, non-dé- 
croissante) et que g(x) soit une fonction continue queleconque. Pour assurer la 
validité de notre raisonnement, il nous sera nécessaire de supposer C(x) bornée 
(ce qui n’est pas indispensable pour que |’intégrale ait un sens, mais qui est en 
tout cas réalisé dans le cas important od C(z) est une “fonction des probabilités 
totales’’). Soit maintenant ---, 2m, ---, L1, Yo, Xi, Za, -++, Lay ++, Une 
suite croissant de — » A + » et provisoirement arbitraire. On aura 


20 


b 
¢(x)dC(x) comme la limite de i ¢(x)dC(zx) lorsque a et b tendent 


c] a 


J= [ g(x)dC (zr) = > t: g(x)dC(z) ’ 


=—e ° 
. zy 1 


si on suppose que l’intégrale du premier membre existe, au premier sens indiqué 


b 
c’est A dire qu’elle est la limite de / g(x)dC(x), de quelque facon que a et b 


tendent indépendamment vers — ~ et + «©. Cette derniére circonstance sera 
b 
sirement réalisée si l’on suppose que i | g(x) | dC(x) est bornée quels que soient 


a et b, c’est A dire que |’intégrale J est absolument convergente, hypothése od 
nous allons maintenant nous placer. 

Or, en désignant par £; un point arbitraire de l’intervalle et par m;, M;, les 
bornes de g(x) dans cet intervalle, il est clair que | ; g(x)dC(x) et 


¢(&)[C(x,) — C(xi-1)] sont compris entre m,[C(xz,;) — C(x;1)] et M{C(z,) — 
C(z;)]. En posant wo; = M; — m; on aura done 


| : g(x)dC(x) _ ¢(éi) [(C(ai) — C(x;-1)] | S a; [C(a) _ C(x;_,)] ’ 


-1 


d’ou 





(1) th e(x)dC(x) — 2 of) (C(x) — C(xi)]| S , wi (C(x) — C(aia)] - 


La fonction g(x) étant partout continue, il est toujours possible de choisir les 2; 
de sorte que les oscillations w; soient inférieures 4 un méme nombre arbitraire 
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w > 0. Supposons qu’il en soit ainsi; on aura, en représentant les deux termes 
de (1) par J;,, et S;,x 


k 
(2) | Jie — Sir|<o . % (C(x) — C(ai.)] = w(B — A) ’ 
i=1-—j 
A et B étant les deux bornes de C(x) sur toute l’ensemble des valeurs de z. 
Appelons Ji. 7 ae w,, J’, w; les valeurs prises par J;,;, S;,x, @;, J, # quand on 
y remplace yg par |y|. Ona nécessairement w, < w; <w. Donc les w;, ont bien 
une borne supérieure finie w’ < w. On aura de méme 


Sj.x <Jj,.+(B-A)s [lie ace) +w'(B— A). 


Quand j et k croissent, S; , croit, ou du moins ne décroit pas, et reste inférieur, 
d’aprés cette inégalité, 4 un nombre indépendant de j et de k. Done la série 


(3) S = a ¢(&) (C(x) — C(xi-)] 


t=—o 
est absolument convergente et l’on a 


S = lim Sj.e- 
j-7-@ 
k++ 


On tire alors de l’inégalité (2), en passant 4 la limite, | J — S| < w(B — A), 
d’oi J = lim S. Réciproquement, lorsque les x; forment une suite croissant 


«0 
de — © A + et choisis de fagon que les oscillations w; de g(x) dans les 
intervalles z;_;, x; aient une borne supérieure finie w, si la série S est absolument 
convergente pour au moins un choix des £; dans les intervalles z;,, z;, alors 
lintégrale J est absolument convergente et on a J = lim S. En effet, on a 


w0 
encore 
| Sj,2 —Jj,2| So (B—A), 
d’ot 
ae < Bsa +o (B-—A)<s pe | o(&) | (C(x) — C(aia)] + w'(B — A). 


‘=—o 


Done J;,, ayant une borne supérieure finie indépendante de j et de k, 
lintégrale J est absolument convergente. Alors en faisant tendre j et k vers 
— «et-+ o dans la formule | S;, — J;,.| S o(B — A), on aura | S—J/| 
w(B — A),d’od J = lim S. 


Remarque. Nous venons de démontrer que si ¢(z) est une fonction partout 
continue, si C(x) est une fonction monotone bornée, la condition pour que 


Vintégrale J = [ g(x)dC(x) soit absolument convergente est qu’il existe au 


moins un nombre w > 0, et, — en divisant la droite illimitée par une suite crois- 
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sante de nombres z; tels que l’oscillation de g(x) dans chaque intervalle (z;_,, z;) 
soit < w — , un choix de nombres £; dans les intervalles respectifs (2;:, x;) tels 
que la série (3) soit absolument convergente. 

Et s’il en est ainsi, on a J = lim S quel que soit le choix des £; dans les 


w—0 
(zi:1, 2:). La démonstration suppose essentiellement que C(x) est borné. On 
peut voir de plus que la validité de l’énoncé cesse si l’on supprime cette con- 
dition. Il suffit de prendre l’exemple suivant. 
Considérons le cas particulier ot C(x) = x et ot g(x) est une fonction continue 
paire, jamais négative, et choisie dans chaque intervalle 2(k — 1) S x S 2k, 
2k 


telle que l’on y ait 0 S g(x) S 1/k, d’ov il résulte 0 < i g(x)dx S 2k, et que 
2k-2 


2k 
de plus l’on ait g(2k — 1) = k“, [ g(x)dz = 2k, pouurk >1. Alors, on voit 
2k-2 


qu’ici l’intégrale J = i o(x)dC(z) sera finie et égalei4 >> k-. Pourtant, soit 


= k=1 

w un nombre positif arbitraire; il existe un entier N tel que 2 < Nw. On pourra 
done prendre les x; tels qu’A partir d’un certain rang q (variable avec w) on ait 
Dg = 2N; tour = AN + 1), --- , Torp = AN + p), ---, car alors dans 
(Te+p-1, Top), ON aura | v(x’) — g(x’’) | < g(x’) + g(x’) < 2/(N + p) <a. 
Alors en prenant £,,, = 2(N + p) — 1, on aura pour tout entier K 


: K+q K 1 
sS> 2D olE) (a r-1) > Lowapa i? 
et le dernier terme croit indéfiniment avec K. Donec J est fini et S est infini, 
et infini quel que soit w; on ne peut done avoir J = lim S. 
a0 

La proposition s’étend d’ailleurs au cas ot C(x) est remplacée par une fonction 
v(x) A variation bornée en spécifiant convenablement les conditions d’application. 

On sait que si V(x) est la variation totale de v(x) dans l’intervalle (a, z), 
(a < x) et si l’on pose v(x) = v(a) + Pi(x) — N(x), V(x) = Pi(x) + N(z), 
les fonctions P,(xz) et N(x) sont non-décroissantes de sorte qu’en posant 
P(x) = v(a) + P,(z), v(x) est la différence de deux fonctions non-décroissantes, 
v(x) = P(x) — N(z2). 

Le résultat subsiste pour x < a, si l’on égale alors 4 — V(z) la variation totale 
de v(x) dexaa. Sidonc v(x) est A variation totale bornée dans tout intervalle fini, 
v(x) est la différence de deux fonctions partout non-décroissantes. Si maintenant 
la variation totale de v(x) dans un intervalle (a, b) est bornée quand b tend vers 
+2 comme on a | v(x) — v(a)| S V(z), v(x) restera borné comme V(x) pour 
x > aetilen sera de méme de P(x) et de N(x); en particulier, P(+ ©) et N(+ ~) 
auront une signification et des valeurs déterminées. Si, méme, la variation 
totale de v(x) dans tout intervalle a une borne supérieure indépendante de cet 
intervalle, les fonctions P(x), N(x) seront bornées supérieurement et inférieure- 
ment. C’est dans ce cas que nous pourrons généraliser le théoréme établi ci- 
dessus, en posant: 


b b b 
I g(z)dv(x) = I g(x)dP(x) — [ o(x)dN(z). 
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La condition nécessaire et suffisante pour que les deux intégrales du second mem- 


bre soient absolument convergentes est évidemment que / | g(x)! d[P(x) + N(zx)] 


et par suite il |y(x) dV(x) soit convergente. On a d’ailleurs 


[Zetaoa)| s [7 \e@ \ave). 


On a alors la proposition suivante. Si g(x) est une fonction continue partout, 
si v(x) est une fonction dont la variation totale dans un intervalle a, b a une 


borne supérieure indépendante de a, b, alors pour que les intégrales ¢(x)dv(x) 


et | g(x) | dV(x) soient 4 la fois convergentes, il faut et il suffit que, pour au 


moins une suite de nombres z; croissant de— © 4 + et tels que les oscillations 
w; de g(x) dans les intervalles (x;:, z;) aient une borne supérieure finie w, et 
pour au moins une suite de nombres pris dans les intervalles (x;_;, z;), la série 


oo 


2. | o(&) | [V(ai) — V(ai-)] 


i=—o 


soit convergente. Et alors on aura 


wo 


/ o(x)do(x) = lim DL ol&) (v(x) — v(x) 
quel que soit le choix des £; dans les (x;-, 2;). 

Observons que si g(x) est uniformément continue non seulement dans tout 
intervalle fini, mais sur la droite illimitée (ce qui, par exemple, a lieu pour 
g(x) = 2, mais non pour g(x) = 2”) la condition que tous les w; soient inférieurs A 
w sera réalisée en prenant tous les x; — x;_, inférieurs 4 un méme nombre 6 assez 
petit. Alors l’égalité (2) subsistera en remplagant w par 6. Si g(x) est continue 
et bornée, g(x) est uniformément continue et le résultat précédent s’applique. 


Mais ce n’est pas le seul cas. En particulier, pour que | rdC (x) existe, il faut 


et il suffit qu’il existe au moins une suite de nombres 2; croissant de — ~ A +0, 
dont les intervalles x; — x;; ont une borne supérieure finie 6 et une suite de 
nombres §; dans les intervalles (x;1, x,), telles que la série 


S= ¥ slC@) — Ce.)I 


oO 
soit absolument convergente. Et alors, on a / xdC(x) = lim S. Ce cas par- 


—o 6-0 


ticulier est intéressant en Calcul des Probabilités ot la valeur moyenne d’une 


rdC (x). 


variable aléatoire s’exprime précisément sous la forme / 
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NEW THEOREMS AND METHODS IN DETERMINANT THEORY 


By Leonarp M. BLUMENTHAL 


Introduction. If to each ordered pair of undefined elements p, q of an 
abstract space (set) S, a real, non-negative number pq can be attached such that 
pq = qp, and pg = 0 if and only if p: is identical with q, the space S is said to 
be semimetric. The elements p, g may be spoken of as points of the space, with 
pq as their distance. A given semimetric space S is characterized metrically when 
conditions are stated (in terms of distance relations) which are necessary and 
sufficient for any semimetric space satisfying them to be mapped isometrically 
(congruently) upon S. Among those semimetric spaces which have been charac- 
terized metrically are the n-dimensional euclidean,' spherical,? and hyperbolic* 
spaces. 

In this paper results obtained in the metric characterization of these spaces 
are introduced for the purpose of deriving new theorems concerning certain 
types of symmetric determinants. The application of isometric geometry to 
determinant theory furnishes a new and powerful impetus for its development. 
By such an application one obtains elegant proofs of novel and interesting theo- 
rems. These new methods are well adapted (1) for proving whole chains of 
theorems, as in §§1 and 5, (2) for the determination of relations between the 
elements of a determinant, as in Theorems 3.1 and 5.2, and (3) for ascertaining 
the sign of certain determinants, whose elements are not explicitly known. 

While only determinants with real elements are treated in this paper, the 
development of the theory of complex metric spaces, already under way, may be 
expected to furnish results that can be applied to determinants with complex 
elements.‘ 


Received August 29, 1935; presented to the American Mathematical Society, October 26, 
1935, under the title Jsometric geometry methods in determinant theory. The author is 
National Research Fellow. 

1 Menger, Untersuchungen tiber allgemeine Metrik, Mathematische Annalen, vol. 100 
(1928), pp. 75-163. This paper is divided into three parts; the second part (Zweite Unter- 
suchung, pp. 113-141) contains a characterization of the n-dimensional euclidean space in 
terms of relations between the distances of its points. In a later paper, New foundation 
of euclidean geometry, American Journal of Mathematics, vol. 53 (1931), pp. 721-745, the 
concept of quasi-congruence order is introduced. 

2? Blumenthal, Concerning spherical spaces, American Journal of Mathematics, vol. 57 
(1935), pp. 51-61. See also vol. 55 (1933), pp. 619-640, as well as L. Klanfer, Metrische 
Charakterisierung der Kugel, Ergebnisse eines mathematischen Kolloquiums, Wien, Heft 4 
(1933), pp. 43-45. 

3 Blumenthal, The metric characterization of the n-dimensional hyperbolic space, Bull. 
Amer. Math. Soc., vol. 41 (1935), p. 485 (Abstract). 

4A. Wald, Kompleze und indefinite Raume, Ergebnisse eines mathematischen Kollo- 
quiums, Wien, Heft 5 (1933), pp. 32-42. 
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Part I. The determinant | r;; |, ri; = rji, ric = 1 (7,7 = 1, 2, --- , m) 


1. In this section we deal with determinants of the above form which satisfy 
the additional hypothesis that r;; > 1 (7,7 = 1,2, ---,m;i # Jj). Since these 
determinants play an important réle in the metric characterization of the 
n-dimensional hyperbolic space, we shall denote them by the letter H. 

We prove first the following 

THEOREM 1.1. Let pi, po, ---, Parr be n + 1 points of the n-dimensional 
hyperbolic space H,,,, of curvature —1/r°. The determinant | cosh p,pj/r | 
(i,j = 1,2, ---,m + 1) has the sign (—1)" tf pu, po, --- , Pasi are independent 
(i.e., notin Hm,,,m <n), and vanishes otherwise. 

If we multiply the first column of the determinant by cosh p,p,/r, and subtract 
the result from the k-th column, k = 2,3, --- , + 1, we find, after applying the 
law of cosines for hyperbolic geometry, 


n+l 
= (—1)"- | cos py: pipy |G, ima, ---, ng) * I] sinh? a 


k=2 


cosh 











PiPi 
r 


where pi: pip; denotes the angle formed at p, by the rays pip;, pip;, and 
| cos pi : pip; | (i, 7 = 2, 3, --- , nm + 1) denotes the determinant of order n of 
these elements. It remains to show that this determinant is positive if 
Pi, P2, +++ » Pays are independent points, and zero otherwise. 

But this is immediate, for the ‘‘n-bein” formed in H,,, by the rays joining the 
points po, Ps, --- , Pn4i to pi may, as is well known, be imbedded isogonally in a 
euclidean n-dimensional space. Since, now, the determinant | cos p; : pip; | 
(t,j = 2,3, --- ,m + 1) is invariant under an isogonal transformation, and since 
this function formed for n rays in a euclidean space is positive if the rays are 
independent and zero otherwise, the lemma is proved.® 

TuHeoreM 1.2. If the determinant H is of order m > n + 3, and if (i) for each 
integer 1 S k S neach principal minor of order k +- 1 of H vanishes or has the sign 
(—1)*, and (ii) each principal minor of order n + 2 vanishes, then H vanishes. 

Proof. Since r;; > 1 (¢ ¥ j), we may write 


r= cosh pip;/r (i,j = 1,2,--- ’ m) ’ 


where the points pi, --- , Pm form a semimetric set and r > 0. From the 
hypotheses (i), (ii) we may conclude that each set of n + 2 of the points 


5 The relation satisfied by the ten distances of five points in a three-dimensional hyper- 
bolic space was first given by Schering, Die Schwerkraft im Gaussischen Réume, Gottinger 
Nachrichten, 1870, pp. 311-321, without, however, indicating how it was obtained. Ina 
later paper (Géttinger Nachrichten, 1873, pp. 13-21) he stated the analogous relation for 
n + 2 points in an n-dimensional hyperbolic space. P. Mansion, Ann. Soc. Sc. Brux., 
vol. 15 (1890-1891), pp. 8-11; vol. 19 (1894-1895), pp. 189-193 deduces Schering’s five-points 
relation. The method used in the lemma above is completely different from that used by 
Mansion. So far as the writer is aware, the determination of the sign of the function 
|cosh p; p;/r| for n + 1 independent points of H,,-, given above, is not in the earlier 
literature. 
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Pi, -*+, Pm is congruent with n + 2 points of the hyperbolic n-dimensional 
space H, of curvature —1/r*, while from the fact that H, has the quasi- 
congruence order n + 2, it follows that the m points are congruent with m 
points of H,..6 

Using Theorem 1.1, we conclude that H = 0. 


2. Let us suppose, now, that 0 S r;; < 1 (7,7 = 1,2,---,m;i #7). De- 
terminants of the form we are considering in Part I which satisfy this additional 
requirement we denote by A,. Concerning determinants A, we prove the 
following ; 

TuHeoreM 2.1. If the determinant A, is of order m > 4, and has each third- 
order principal minor equal to zero, the determinant A, vanishes. 

Proof. From the hypotheses made on the elements r;; we may set ri; = 
cos pip;/r, r > 0 (i,j = 1,2, --- , m), with the points pi, peo, --- , Pm forming 
a semimetric set, and pp; S mr/2 for every pair of indices. 

From the metric characterization of the circle’ we may conclude from the 
vanishing of every third-order principal minor, together with p:p; < mr (i,j = 
1, 2, --- , m), that each triple of points contained in the m points pi, po, --- , Dm 
is congruent with three points of a circle of radius r. (The distance of two points 
of the circle is defined as the length of the shorter are joining them.) Each triple 
is, moreover, linear (i.e., congruent with three points of a line), for otherwise the 
sum of the three distances would equal 2zr, which is impossible. 

Since the line has the quasi-congruence order® 3, and by hypothesis m > 4, 
it follows that the m points pi, po, --- , Pm are linear and consequently con- 
gruent with m points of a circle of radius r (since no distance exceeds mr/2). 
Then the determinant A, vanishes. 

It is believed that Theorem 2.1 is the first link in a chain of theorems which 
we state as follows: 


If the determinant A, of order m > n + 3 (n an integer) is such that (i) every 
principal minor of order not exceeding n + 1 is positive or zero, (ii) every principal 
minor of order n + 2 is zero, the determinant vanishes. 


3. In this section we deal with determinants 


Ayv=|rij|, rg=Tyi,-—l<ryjS0( #9), ri=l1, (i,j =1,2,---,m), 


and concerning them we prove the following 

THEOREM 3.1. If the determinant Ay is of order m > 4, and all of its third- 
order principal minors vanish, then r;; = — 34 (i,j = 1,2, ---,m;i # Jj) and 
Aw = — 3 (3/2)"™"! (m — 3). 


® Blumenthal; see footnote 3. 

7 Blumenthal and Garrett, Characterization of spherical and pseudo-spherical sets of 
points, American Journal of Mathematics, vol. 55 (1933), pp. 619-640. See §1, p. 620, for 
the circle. 

® Menger, New foundation of euclidean geometry, p. 727. 





of 


or 
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Proof. We write ri; = cos pip;/r, r > 0, (it, 7 = 1, 2, --- , m), where the m 
points pi, P2, --- , Pm form a semimetric set. Since —1 < r;; S 0, we may as- 
sume that rr/2 < pip; < xr for distinct values of the indices. From the van- 
ishing of each third-order principal minor of Ay, together with pip; < 7, it 
follows that the set of points pi, po, --- , Pm has all of its triples d-cyclic, with 
d= rr. But no one of these triples is linear, for in a linear triple the sum of two 
of the distances equals the third, while any triple contained in p,, pe, --- , DP» has 
the sum of two of its distances greater than or equal to zr, while the third distance 
is less than zr. It follows readily that the set pi, pe, --- , Pm forms a proper 
pseudo d-cyclic set and hence is equilateral, with each distance equal® to 2d/3. 
Hence p,p; = 2d/3, and r;; = cos 2d/3r = — 3 (i,j = 1,2, --- ,m;i #7). Such 
a determinant is easily evaluated to yield the value — 3(3/2)""'(m — 3). 

Theorems 2.1 and 3.1 are in striking contrast. In the latter theorem the 
hypotheses serve to fiz every element in the determinant. 

There is reason to believe that the following chain of propositions is valid, but 
its proof must await, it seems, further development of the theory of pseudo 
r-spheric (S,,,) sets of points. 

If the determinant Ay is of order m > n + 3 (n an integer) and is such that (i) 
every principal minor of order less than or equal to n + 1 is positive and (ii) 
every principal minor of order n + 2 is zero, then 


rg = —1/(n + 1) (t,j7 = 1,2, ---,m;i #9), 
and 








n+1\n+1 


It is interesting to compare Theorems 2.1 and 3.1 with another theorem which 
the writer has proved in an earlier paper, and which is, in a sense, the “‘union”’ of 
these two theorems.” Let us denote by Ay(—4) the determinant r;;, with 
fy = — 3 (i # J), Vii = 1, (i,j = 1, 2, the , m). 

TuHeoreM. If the determinant A = | rj; |, rij = Ti, —1 < re < 1 FQ), 
ri = 1, (i,j = 1,2,---,m), is of order greater than 4, and has each of its 
third-order principal minors equal to zero, either A vanishes or A = Ay(—}). 

It is noted that increasing the range of the elements r;; (¢ # j) from the inter- 
val (—1, 0), open at the left, as in Theorem 3.1, to the open interval (—1, 1), as 
in the theorem above, does not sensibly increase the class of non-vanishing 


m—1L 
Ay = — . (2+3) (m—n— 2). 


® Blumenthal, A complete characterization of proper pseudo d-cyclic sets of points, Amer- 
ican Journal of Mathematics, vol. 54 (1932), pp. 387-396. A proper pseudo d-cyclic set is, 
by definition, a pseudo d-cyclic set that contains neither a convex tripod nor a pseudo- 
linear quadruple. A convex tripod has three of its triples linear, while all four triples of a 
pseudo-linear quadruple are linear. Hence, a pseudo d-cyclic set that contains no linear 
triples cannot contain a convex tripod or a pseudo-linear quadruple, and is, therefore, 
proper. 

10 Blumenthal, A chain of determinant theorems arising from the characterization of pseudo 
r-spheric (S,,,) sets, American Journal of Mathematics, vol. 56 (1934), pp. 225-232. 
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determinants that satisfy the remaining hypotheses of these two theorems. In 
the first case we have proved that such determinants have each element outside 
of the principal diagonal equal to — 3; while the earlier theorem proves that each 
element outside the principal diagonal is equal to either 3 or —3, and that, 
further, the signs of the elements are so distributed that by multiplying certain 
rows and the corresponding columns by —1, the determinant is made identical 
with Ay(—}4). 

Though Theorem 2.1 is implied by the theorem quoted above, its proof 
demands far less than what is given by this theorem. This reason, together 
with the desirability of developing methods that can be generalized to prove the 
chain of propositions of which Theorem 2.1 is the first link, has led us to give a 
short independent proof of it. 


Part Il. The determinant!! D = l raj |, Tez = "ji > 0 (i ~ j), ‘ui = 0, 
umiteue, €iet1..-.0. 


4. It is convenient to emphasize the bordering of this determinant by writinz 





10 
it symbolically in the form D = | : (i,j = 1,2,---,m). We give first a 
ii | 
theorem concerning fifth-order determinants of this type. 
0 


TuHeEoreM 4.1. If the determinant D = (t, 7 = 1, 2, 3, 4) has each of 








lv ij 
its four bordered fourth-order principal minors negative or zero, then, for0 < k S 3, 
0 1) 


1 ri; 
(1) each of its four bordered fourth-order principal minors negative, (2) D® > 0, 
0<k <3, D® 20, and (3) k = 3 is the greatest exponent for which both (1) 
and D™ 2 0 are valid. 

Proof. Since ri; = rj; > 0 (i ¥ 9), ris = 0, we may set ri; = pip;, 
(7,7 = 1, --- , 4), where pi, pe, ps, ps form a semimetric set. By hypothesis, 
each bordered fourth-order principal minor of D is negative or zero. This is a 


the determinant D® = (i,j = 1, 2, 3, 4), of non-negative elements, has 








11 For the cases m = 3, 4, 5 this bordered determinant has a long history. Early papers 
by Cayley [Cambridge Mathematical Journal, vol. 2 (1841), p. 268] and Sylvester show 
that if five points p; (i = 1, 2, --- , 5) are in a three-dimensional euclidean space, the sixth- 
order determinant D, in which ri; (i, j = 1, 2, --- , 5) is the square of the distance pj p;, 
vanishes. This result had been obtained earlier in a different form by Lagrange, Carnot, 
and others. Since Cayley was among the first to discuss the behavior of this determinant 
when it was formed for the points of a euclidean space, and since Menger (loc. cit.) was the 
first to characterize metrically euclidean space by expressing the (necessary and sufficient) 
distance relations of semimetric spaces congruent with it in terms of the sign of the deter- 
minant D formed for these distances, the determinant may well be called the Cayley- 
Menger determinant. 
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necessary and sufficient condition that each triple of points contained in the 
four points be congruent with three points of a euclidean” plane Re. 

‘To prove part (1) of the theorem, we ea gy a semimetric set of four points, 
Pi» Pt» Ps» Pa» With rt j (p: P; ‘YY Gj = , 4), and show that for 
0 <k S 3, each triple of ‘ ‘primed” points is as with a triple of the plane 
R2, and is not linear. Consider the triple pj, p;, p;. Now it has been shown™ 
that if f(x) is any monotonic increasing function of the real variable x, which is, 
further, a concave function of this variable, and vanishes for z = 0, and if 
#1, 2, Z; are any three positive values satisfying the triangle inequality, the 
positive numbers f(71), f(Z2), (Zs) satisfy the strict triangle inequality. 

Since the positive branch of the function r*,0 < a < 1,z = 0, is evidently a 
function of this type, and since pip2, Pops, Pips are positive numbers and satisfy 
the triangle inequality, we see that the numbers (pips), (p2ps)*, (prps)* satisfy 
the strict triangle inequality (and hence the points p;, Ps, Pp, are non-linear and 
congruent with three points of a plane) for 0 < k S 4. Hence the bordered 


fourth-order principal minor 








|. | Gi = 1,2, 3) is negative. Similarly, the 
ij 


other bordered fourth-order principal minors are shown to be negative, and 
part (1) of the theorem is proved. 

Now, since each triple contained in the four points p, (i = 1, 2, 3, 4) is con- 
gruent with a planar triple, the four points determine twelve angles, namely, 
the angles of the four planar triangles. We remark that each of these twelve 
angles is acute, for0 < k < 4. For, let (DP; Ps. Pis) be any one of these angles. 
We have 


(pi,Pis)” + (Pi,Di.)® — (pi, Pi.” 
2(p;,P:.)(Pi,Pia) 





cos (p;,P:,P.,) = 


ae (D;, Pi) + (pi,p;,)" — (Pi, Pix)” 
2(Pi, Pi.) (Pi, Pi)” 
and since 0 < k < 3, then 0 < 2k < 1, and it follows that (p,,p;,)"* + (pi,p;,)** > 


(pi, pi,)*. Hence cos (Pi, PisPis) > 0, and since (PD; P..Pi,) is an angle of a tri- 
angle, we conclude that it is acute. 





: G,j = 1,2,3, 4) is 


positive or zero. To do this, we assume the contrary and deduce from this 
assumption a contradiction. The function D“ is a continuous function of k, 


To prove part (2), we show first that D® = 








2 This is at once evident upon developing one of these minors. Replacing ri; by pip’, 
we find, for example, that the minor formed for the points py, po, ps can be written in the form 
— (Pipe + Paps + Pips)(Pip2 + P2xps — Pips)(Pip2 — P2Ps + Prps)(—Prp2 + Peps + Pips), 
and since, by hypothesis, this minor is negative or zero, it follows that the numbers pips, 
Peps, Pips satisfy the triangle inequality; that is, the points pi, p:, ps are congruent with 

three points of a plane. 
13 Blumenthal, Note on the euclidean four-points property, Ergebnisse eines mathe- 
matischen Kolloquiums, Wien, Heft 7 (in press). 
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which is positive for k = 0 and, by our assumption, negative fork = 4. Hence, 
there exists at least one value k’ such that D“” = 0, and 0 < k’ < }. 

Consider, now, four points p; (i = 1, 2, 3, 4) forming a semimetric set and 
such that (pip3)? = ri; = (p;p3)". Then pip; = (p,p,)"", and since k’ < 1, 
it follows (1) that each triple contained in the four points p; (i = 1, 2, 3, 4) 
is congruent to a planar triple, and (2) since k’ < 3, each of the twelve angles 
determined by the “starred” points is acute. Combining D“? = 0 with (1), we 
conclude" that pj, p2, P3, Ps are congruent with four points of the plane Re. 
This yields the contradiction sought, for of the twelve angles determined by 
four points of a plane, at least one angle is greater than or equal to a right angle. 
Hence, not all of the twelve angles determined by the points pj, p>, ps, P; can 
be acute. This contradicts (2) above. We conclude, then, that D® = 0. 

In showing that D® = 0 we have shown that D™ cannot vanish for 0 < k < 3, 
and since D is positive, it follows that D® > 0 for 0 < k < 3, and part (2) is 
proved. 

We prove part (3) of the theorem by means of an example. Consider the six 
numbers ris = Tes = 4, Ti2 = Tes = 131 = Tra = 1. It is easily seen that the 
determinant D formed for these values has each of its four bordered fourth-order 
principal minors equal to zero, and hence satisfies the hypothesis of the theorem. 
If, now, we form the determinant D“ where k = }(1 + 6), 0 S « < 1, we find 
that each of the four bordered fourth-order principal minors of D“ is negative, 
while D® = —32.2%.(2*— 1). Hence, for « > 0 the determinant is negative; 
that is, for k > 4, D® < 0, and the proof is complete. 

Several remarks may be made concerning the foregoing theorem. It is of 
interest to observe that though no hypothesis is made concerning the sign of the 
determinant D, the extraction of positive k-th roots of its elements isfor 0 S k < 4 
sufficient to make the new determinant D“ positive, while D® = 0. This fact 
has an important geometrical application, which we give in the following 

Corotiary. If p; (i = 1, 2, 3, 4) are any four points of a metric space, for any 
non-negative number k, not exceeding 3, there exist four points p, (i = 1, 2, 3, 4) 
of a euclidean three-dimensional space such that p; D; = (p;p,)* (i, j = 1, 2, 3, 4). 

Thus, if M is any metric space, and we denote by M™ the space derived from 
M by taking the positive k-th power of its metric, then, for 0 < k < 3, the space 
M™ has every quadruple of its points congruent to four points of a euclidean 
space. 

The question arises whether the property proved by Theorem 4.1 is peculiar 
to the fifth-order determinants of the type considered, or whether, when the 
range of k is kept fixed, the theorem may be extended to determinants of higher 
order, the obvious additional hypotheses on higher ordered principal minors 
being made. In the proof of Theorem 4.1 we made use of the fact, from ele- 
mentary geometry, that not all of the twelve angles determined by four distinct 
points of a plane can be acute. This fact, sufficient for our proof, turns out to be 
also necessary. But for n > 2 it is not true that if n + 2 points are in an 


14 Menger, Mathematische Annalen, loc. cit., p. 136. 
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3 
points can be acute.'® Thus the theorem holds only for fifth-order determinants. 
Its restricted character, when regarded only as a theorem on determinants, is, 
however, atoned for when its interesting geometrical applications are taken into 
account. In addition, it raises two questions: (1) if a fifth-order determinant D 
satisfies the hypotheses of the theorem, what is the most general function f such 
that the determinant f(D) satisfies the conclusions that every fourth-order 
bordered principal minor of f(D) is negative or zero, and f(D) = 0, where 


f(D) = “ ‘tens (i,j = 1, 2, 3, 4), and (2) for what functions can the theorem 
W 


be extended to higher ordered determinants? 

We remark, finally, that if it is assumed that D is non-negative, the range of k 
for which the theorem with this added hypothesis is valid is probably the interval 
(0, 1). 


n-dimensional space R,,, not all of the3- q ‘4 ' angles determined by the n + 2 








1 


5. A chain of theorems concerning determinants D = » (i,j = 1,2,---,m) 





y 
is easily obtained from the characterization of the n-dimensional euclidean 
space R,,. 

THeroreM 5.1. Let D be of orderm + 1 > n + 4 (n an integer) and suppose (i) 
for every integer k = n + 1, each bordered principal minor of order k 4+- 1 has the 
sign (—1)* or vanishes, (ii) every bordered principal minor of order n +- 3 vanishes; 
then the determinant D vanishes. 

Proof. We introduce a semimetric set of points p; (¢ = 1, 2, --- , m), 
m > n+ 3, such that p,p} = ri; (i, j = 1,2,---, m). From hypothesis (i) 
it follows directly that every set of n + 1 points contained in the m points is 
congruent with m + 1 points of the R,; this, combined with hypothesis (ii) 
justifies a similar remark'® concerning each set of n + 2 points contained in 
pi(t=1,2,---,m). But the R, has been shown to have the quasi-congruence 
order n + 2; that is, in order that a semimetric set consisting of more than n + 3 
points be congruent with a subset of the R,, it is necessary and sufficient” that 
each group of n + 2 points contained in the set be congruent with n + 2 points 
of the R,. Since m > n + 3, it follows that the m points may be imbedded 
isometrically in the R,, and hence the determinant D vanishes." 

The order of D plays an essential réle in Theorem 5.1. The hypotheses (i), 
(ii) are not sufficient to prove the vanishing of the determinant in case its order 
equals n + 4. Determinants of this type [i.e., of order n + 4 and satisfying 


18 This fact was called to my attention by Mr. M. Ville. 

16 Menger, loc. cit. 

17 Menger, New foundation of euclidean geometry, loc. cit., p. 727. 

18 This is evident, since D represents, to within a constant factor, the square of the 
volume of the simplex determined by the m points. Since this simplex is degenerate (the 
m points being in R,, n < m — 8), its volume is zero. 
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hypotheses (i), (ii)] are of great interest in the characterization of the n- 
dimensional! euclidean space, but little is known about them for n > 1. For 
n = 1, the characterization of pseudo-linear quadruples’ enables us to prove at 
once the following 

TueoreM 5.2. If the determinant D = “ 
its bordered fourth-order principal minors equal to zero,” then either D vanishes or 
Tig = Ts, T23 = Tia, Tis = Tea, One of the positive numbers Vii, V re» V ris 28 the 
sum of the other two, and D = —32ri2723M13. 

It is easily shown that if a determinant D of order n + 4 satisfies hypotheses 
(i), (ii) of Theorem 5.1 and does not vanish, its sign is the sign of (—1)*, but 
neither the value of such a determinant, nor the relations existing between its 
elements (both explicitly exhibited for the case n = 1 by Theorem 5.2) has as yet 
been obtained for n > 1. 


: | (¢, 7 = 1, 2, 3, 4) has each of 
i 


UNIVERSITY OF VIENNA AND INSTITUTE FOR ADVANCED Strupy. 


19 Menger, Mathematische Annalen, loc. cit., p. 126. 
20 This is hypothesis (ii) for n = 1; hypothesis (i) is automatically satisfied, since r;; > 0 
(i # j). 














TEMPERATURE DISTRIBUTION IN A SLAB OF TWO LAYERS 
By R. V. CHurRcHILL 


The list of solved problems in one-dimensional heat conduction in composite 
walls does not seem to include the cases in which the initial temperature distribu- 
tion is arbitrary. The case of the semi-infinite composite solid with an arbi- 
trary initial temperature distribution has been treated recently by Lowan.! 
The solution of the corresponding problem for the wall of finite thickness seems 
desirable for the sake of completeness. It is solved here by application of the 
Laplace transformation. 

The problem under consideration is that of finding the one-dimensional distri- 
bution of temperature in a slab consisting of two layers of different materials 
whose outer parallel faces are held at fixed temperatures, when the initial tem- 
perature distribution in each layer is arbitrarily given. Let the thickness of the 
layers be a, b, and let x = 0 be taken as the surface of separation. Then the 
boundary conditions on the temperatures 7;(z, t), T(z, t) in the two layers 
may be written 


T,(—a, t) = 0, lim 7T,(z, t) = f(z), —a<2zr<0, 
(1) t-0 
T2(b, t) = ¢, lim T.(z, t) = g(z), 0<2z<b, 
t—0 
(2) T,(0, 1) = T:(0,t),  Ky2-T,(z, t) = Ke-T,(z, 1), z = 0, 
Ox Ox 


where K,, Kz are the thermal conductivities of the two layers. 

It is easily seen that the temperatures 7, and T: can be obtained by the com- 
position of known temperature formulas and a simpler unknown formula. Let 
each of the three pairs of temperature functions uw, we, v1, ve and wW,, We satisfy 
the conditions (2), and let 


u;(—a, t) = 0, v:i(—a, t) = 0, w:(—a, t) = 0, 


ue(b, t) = ¢, vo(b, t) = 0, w2(b, t) = 0, 
lim u;(z, t) = 0, lim »;(z, t) = f(z), lim w;(z, t) = 0, —a<2r<0, 
t—0 t—0 t—0 
lim ue(zx, t) = 0, lim v2(z, t) = 0, lim w(x, t) = g(x), 0O<2<b. 
t—0 t-0 t-0 


Then the temperature functions 


hi=m+%+ w, T2 = U2 + V2 + We 


Received October 28, 1935. 
1A. N. Lowan, Heat conduction in a semi-infinite solid of two different materials, this 
Journal, vol. 1 (1935), pp. 94-102. 
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satisfy the conditions (1) and (2). But the solution of the problem of finding 
the temperatures u;(z, t) and ue(z, t) is known,’ and it is evident that the formulas 
for the functions w, and w2 can be written at once from the solution of the 
problem of finding »;(x, t) and v2(xz, t). Hence we shall consider the problem 
solved when we find the formulas for the temperatures »;(z, t) and ve(z, ¢). 

The classical Fourier method of using an orthogonal set of functions leads to 
difficulties when applied to this problem. The method of the Laplace trans- 
formation which was introduced by Doetsch* enables us to transform our 
boundary-value problem into one in ordinary differential equations, whose 
solution is the Laplace transformation of the required solution. It has been 
shown by Carson‘ that this method is formally the same as that of using Heavi- 
side’s operators. A connection between the method of the Laplace transforma- 
tion and Carslaw’s®’ method of contour integrals will be seen at the end of this 


paper. 
The Laplace transformation of a function ¢(t) is defined by 


(3) Lie} = [ “e-* p(t)dt = y(s). 


The inverse of this operator is defined by the solution of the integral equation (3), 
o(t) = L-"{y(s)}. 


In particular, for real positive values of \ and s, the integral 


1 
—Xt —st]t as 
i ewe ; + i 





gives the useful result 


“> 
(4) a at a 


By integration by parts it is seen that the transformation (3) has the property 
(5) 114 o(0} = sL{6(0} — 6(0) = ay(s) — 600). 


?H. S. Carslaw, Introduction to the Mathematical Theory of the Conduction of Heat in 
Solids, 1921, p. 215. Carslaw’s solution, which he obtains by using contour integrals, can 
be found easily by using the Laplace transformation. 

3G. Doetsch, Ueber das Problem der Warmeleitung, Jahresber. deutsch. Math. Ver., vol.33 
(1924), pp. 45-52. F. Bernstein and G. Doetsch, Probleme aus der Theorie der Warmeleitung 
I, Math. Zeits., vol. 22 (1925), pp. 285-292, the first of a series of papers by these authors. 
Math. Zeits., vol. 22 (1925), pp. 293-306, vol. 25 (1926), pp. 608-626, vol. 26 (1927), pp. 89-98, 
vol. 28 (1928), pp. 567-578. 

‘J. R. Carson, Electric Circuit Theory and the Operational Calculus, 1926. 

5 H. S. Carslaw, footnote 2, Chapter 11. 
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The function ¢ may involve a parameter and the operator L is commutable with 
differentiation with respect to it.* 
The conditions which determine our temperatures v;(z, ¢) and v2(z, t) are 


(6) < nla, t) = ki 2 nz, t) —a<2z<0, 
(7) v;(—a, t) = 0, 

(8) lim v;(z, t) = f(x) —a<zr<0, 
(9) < alc, t) = ha % ala, t) 0<2x<b, 
(10) v2(b, t) = 0, 

(11) lim v2(z,t) = 0 0<2r<b, 

t—0 
(12) v(0, t) = v,(0, t), 
(13) Ki n(z, t) = Ky o(z, t) z= 0, 


where k; and kz are the thermal diffusivities of the materials in the layers 
—a<z< Oand0 < z< b, respectively. 

Let the Laplace transformation (3) be applied to the members of equations 
(6) and (9). Then according to the property (5) and the conditions (8) and (11) 
the transformed functions 


yi(z, 8) = L{v,(z, t)}, y2(z, 8) = L{v2(z, t)} 


must satisfy the equations 


(14) Ls y1(z, 8) = syi(z, s) — f(x) —a<2<0, 


(15) (xz, s) = sy2(z, s) 0<2<b. 


ka ys 


The applications of L to the conditions (7), (10), (12) and (13) give the following 
conditions, respectively: 


(16) yi(—a, s) = 0, 

(17) y2(b, 8) = 0, 

(18) yi(0, 8) = y2(0, 8), 

(19) Ki wile, 8s) = = wlz, 8) zr=0. 


§ For other properties of the operator L and its inverse see reference (3), second paper, 
and G. Doetsch, Die Integrodifferentialgleichungen vom Faltungstypus, Math. Ann., vol. 89 
(1923), pp. 192-207. 
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The equations (14) and (15) can be solved as ordinary linear differential 
equations involving the parameter s. Their general solutions can be written 


(20) yi(x, 8s) = an? + Be? + B(z, s), 
Vi Vi 
(21) y(z,s)= Ae ~+Be *, 
where Aj, B,, Ao, Bz are arbitrary constants and 
o ;: 
(22) ate, ) =e [sinh | 4/2 — 8) | soe 


When the constants are determined so that the conditions (16) to (19) are satis- 
fied, the solutions (20), (21) can be written 


(23) ylz,s) = B(—a, s) 


(ona (o/ 2) ~ en o/ 2) (of 
wn(ey/aouloye) eel s)aa oi) 


(24) ye(r,s) = — o8(—a, s) 


sinh 4/2 (b — | 
sinh (« / i) cosh (s Vt) + ¢ cosh (a i) sinh (3 /) 
where 


_Ki, /h 
(25) oak y/E. 


By changing to circular functions, equations (23) and (24) can be written 


am win fae (oy) om(o/2) 
-eoeoy/ Bono D002 DY/(V 2) 


o $(— a, s) sin 1/52 _ | 





+ A(z, 8), 








(27) yz, s) = — 


We) 
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where 


(28) ¢(z,s) = ree / sin ly — (x — | S(&) dé = B(x, 8), 


and the function F is defined by 
(29) F(a) = sin (aa) cos (bua) + o cos (aa) sin (bua) , 


in terms of the thermal coefficient 
‘ k 


Our problem is now resolved into one of finding the inverse Laplace trans- 
formations of these functions y; and ye given by equations (23) and (24) or (26) 
and (27), since 


v(x, t) = L {y(a, s)}, ve(x, t) = L {ye(zx, s)}. 


We shall proceed by first obtaining a formal solution of this transformation 
problem by a method which is logical provided the series used have the necessary 
properties of convergence. The functions »; and v2 so found will then be exam- 
ined, with the aid of contour integrals, to show that they satisfy the required 
conditions. 

In order to put the functions y; and y2 into a form in which the operator L~ 
may be applied we shall use the partial fractions theorem of Mittag-Leffler,’ 
which shows how a meromorphic function M(z) can be expressed as the sum of 
an integral function G(z) (rational or transcendental) and a series which is 
determined by the poles and principal parts of M(z). If M(z) has only simple 





poles z, (n = 1, 2,3, --- ) with residues p,, the theorem shows that 
M(z) = Gz) + > E on + Pe], 
n=1 — <n 


where P,,(z) are polynomials which may be necessary to satisfy conditions of 
convergence ; the degree and coefficients of P,,(z) depend upon p, and 2,. 

If the function M(z) has the form X(z)/Y(z), where X(z) is integral and Y (z) 
has an infinite number of zeros z,, none of which are repeated [we exclude all z, 
which are not poles of X(z)/¥(z)], the above expansion gives 
X(z) X(z,) 


- = G(z) r ’ 
Y(z) + 2 Y’(z,) (z — Zn) 


(31) 


provided the conditions of convergence are satisfied with P,,(z) = 0. 
It has been shown’ that the zeros a, of the function F(a), defined by (29), 
are all real, infinite in number, and not repeated; it is evident that —a,, is a zero 
7 See for example E. J. Townsend, Functions of a Complex Variable, 1915, p. 303, or K. 


Knopp, Funktionentheorie I1, (Sammlung Géschen, Nr. 703), p. 38. 
* Carslaw, footnote 2, p. 215. 
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if a, isa zero. It follows that the function F(+/ — s,,/k,) in the denominators 
of (26) and (27) has an infinite number of distinct zeros s, which are all real 
and negative except for s = 0, since 


(32) 8, = —k a). 


It is clear that s = 0 is not a pole of either y;(z, s) or y2(x, s); only poles of these 
functions are included in the set s,. 

Let the partial fractions expansion (31) be applied to the functions y; and ys 
in (26) and (27). Let Gi(z, s) and G,(z, s) denote the unknown functions G(z) 
which appear in these expansions; they are analytic throughout the finite s-plane. 
Also let 9; and 2 represent the partial fractions series part of the expansions, so 
that 
(33) yi(x, 8) = G,(z, 8) + n(x, 8) ’ y2(z, 8) = G2(z, 8) + G2(z, 8) ° 


Then according to (31), 


~ 


Ala, s)= —2 Vi, > ZJ VJ _ &. o(—a, 8») 


= 
= (VB—6VZ)--— 6 BaloyZ) 
(34) ry = ‘) (s — 8,) 
a V gars o(—a, s,) sin = Sn (x a a)| cos (6 | =) 


ey F’(4/5 k, *) ( _ 8,,) COs (« y =") , 


since s, are non-zero roots of 


Fy =!) = sin (« V 3) cos (6 4/ V i z) 
1 1 2 
+ 000s (a 4/) Z!) sin (6 4/5 i ‘Veo 


« W—s, 6(—a, ) sin | 4/ = (b — | 
(36) g(z,s) = 20Vk. > : 


Kad mer 
n=l F’ (y =) (s —s 





-2Vk D>, 


(35) 


Likewise, 


In order to determine the functions G; and Gz we can make use of the condi- 
tions (14) to (19) which were used to determine y and ye. It is quite easily 
seen that 9% and gj as given by equations (34) and (36) formally satisfy all the 
boundary conditions (16) to (19). Since y; and ye satisfy these conditions the 
corresponding conditions on G, and G_ are determined, according to equations 
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(33). The formal substitution of the expression for 9; in (34) into the differen- 
tial equation (14) leads to the equation 


3 
ox 


(37) ky ji(x, s) — si(z, s) 


» W/—s, o(—4, s,) sin | 4/ _ (rx +a Joos (3 V 52) 


2Vk, > — ma EY mene, Ge 
‘ (oF) (0 / =) 
, Ss [ sin la,(a + £)] f(é) dé sin [an(a + 2)] cos (a, ub) 


F’ (an) COS (a, @) . 




















n=1 


where a, are the positive zeros of F(a). But it is shown later on [see equations 
(44)—(48)], with the aid of contour integrals, that the series in (37) represents 
f(x) in (—a,0). Hence 9g; satisfies (14). Likewise, 


a 7 
ke andes 8) — 8 H(z, 8) 


0 


(38) » / sin [an(a + £)] f(&) dé sin [ex u(b — 2)] 
=-% > = a? 





ra Fe) 
and this series represents a function which vanishes in (0, b) [see equations (49)- 
(51)]. Since 9, and ge satisfy all the conditions which determine y; and ys, then 
G, = Gz = 0, and 

(39) yi(z, 8) = jz, 8) ’ yo(z, 8) = 92(z, 8) . 

This result was not entirely unexpected, since equations (34) and (36) indicate 
that j, and gj: have the limit zero as s becomes infinite through positive values, 
and this property can be shown to apply to y and y2 by using the expressions 
(23) and (24). 


When the inverse Laplace transformation is applied to the series in (34) and 
(36), the transformation (4) gives, in view of (39), 


L- fyi(ax, s)} 


2 V —8, o(—4, 8x) cin] 4/5 (x +0)| cos (3 V 5°) 
= —2Vk Zz, -- : = 2 


n=1 P'(Y/ =) cos (« V =") € ’ 


2 V—8, o(—4, sn) sin| y = (b — | 
L'y.(x, 8) = 20- Vk; bs —_——____,--= = ( 


_ ; ( 
~> ” — 8p 
= : (y ky ) 
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In terms of the positive roots a, = +~/ —s,/k, of the equation 
(40) F(a) = sin (aa) cos (bua) + o cos (aa) sin (bua) = 


these equations can be written 


sin [a,(a lente + £)] f(é) dé sin [a,(a + “I cos (bua) 


(41) (2, t) = ys Lom 


n=1 


‘ -~Gnk t 


—-—— — ¢ 


~F' (en) COS (dain) 


when —a < x < 0, and 


[is sin tate + &)] f(é) red sinlenu(b—-2)]) 
(42) w(2,t) = -26 >) =: Eee ae J en anit 


5! F’(@n) 





when0 <2 <b. 

It can be seen readily that our temperature formulas (41) and (42) formally 
satisfy all the conditions (6)—(13) on v; and v2 except possibly for the initial tem- 
perature conditions (8) and (11). In the special case ¢ = 1, the series in (41) 
and (42) for t = 0 become Fourier sine series; the first represents a function 
which is equal to f(x) in (—a, 0) and vanishes in (0, bu), and the second represents 
a function which equals of(ux) in (—a/u, 0) and vanishes in (0, b). Hence in 
this case the initial conditions are satisfied. But except in such special cases the 
set of functions sin (a,x) is not orthogonal, so we shall make use of certain con- 
tour integrals which are suggested by the series in (41) and (42) to test our solu- 
tion for the initial conditions. 

Consider the contour integral 


— oda, 





, [ sin [a(a + £)] f(€) dé sin [a(a + x)] cos (ab) 
(43) T(z, t) / — 


F(a) cos (aa) 


where —a S x S 0,t = O, in the complex plane of a = pe® from right to left 
over the infinite path P consisting of the two half-lines 6 = 7/8 and @ = 77/8, 
(0 > 0). (Because of the exponential factor it is essential that cos 26 > 0 for 
large values of p on P.) After being multiplied by any fixed power of p the 
integrand of J; has the limit zero as p becomes infinite on the path P, provided 
that either z or tis not zero. It follows from the ordinary convergence test for 
infinite integrals that the integral (43) converges on each half of the path P for 
either —a S x S Oandt>0O0,or-asS2<OQandt 20. It may be noted 
that this is still true if the integrand is replaced by its derivative of any order 
with respect to either z or ¢. 

The integral J,(z, t) is a continuous function of tatt = 0if -as2z<0. This 
can be shown for each half of the path P, on which a = pe‘*’* and a = pe”*’®, by 
writing the integrals in terms of p. The necessary properties of the real and 
imaginary parts of these integrals can be seen from those of the complex inte- 
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grands to show first that each integral is uniformly convergent in ¢tfor0 sts’, 
x < 0 and second, that the integrals are continuous. These results follow from 
two theorems on infinite integrals given by Carslaw.° 

When we put ¢ = 0, —a S z < 0, in the integrand of (43) and integrate 
around the boundary of a circular sector above the radii @ = 7/8 and @ = 77/8 
and under the circular are p = c, the result is zero because all of the poles of the 
integrand lie on the axis of reals. But the integral over this circular are ap- 
proaches zero as the radius becomes infinite, and so the integral over the path P 
must vanish. Hence we can write 


(44) I(x, 0) = lim I,(2,t) = 0 —-azx27r<0. 
t—0 


To see the relation between J,(z, t) and v;(z, t), note first that the integrand in 
equation (43) is an odd function of a, so the integral over the path P is the same 
as the integral over the path made of the right halves of the lines @ = +7/8, 
directed downward. Let an infinite sequence of circular arcs p = p; be used to 
join these rays and form circular sectors including a part of the positive real 
axis, just one definite p, being selected between each pair of adjacent poles of the 
integrand, say. As p, becomes infinite, it can be shown that the integral over 
the circular are approaches zero. The sum of the integrals over the radii 
approaches I;(z, t). Since the set of poles a, together with the positive roots of 
cos (aa) = 0 are the poles inclosed by this path, the theory of residues gives 
the result 





0 

o / sin [ax(a + £)] f(E) d(é) sin [ax(a + x)] cos (axpb) em *e* 

(45) I(x, t) =2 >) + : ene Ste eee, 
—. F'(ax) cos (a.a) — a F(a) sin (axa) 
where a; are the positive roots of 
F(a) cos (aa) = 0. 
Let X,, denote the positive zeros of cos (aa), so that 

(46) Am = (2m + 1)x/2a (m = 0,1,2,---), 


while a, denotes the positive zeros of /(a). One term in each denominator of 
the series (45) is always zero, and when we separate the terms in which a, = A» 
and simplify, equation (45) becomes 


sin [a,(a + £)] f(édé sin [a,(a + x)] cos (a,pub)emen*! 


I(x, t) = 2 y [. 


= 7 F'(en) COS (ana) 
Bh 0 
_ : b> a COS (Ame) S(E\AE COS (Ama )em mn! 
m=O 


°H. 8S. Carslaw, Introduction to the Theory of Fourier Series and Integrals, 1930, p. 196 
and p. 198. 
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It follows from our expression (41) for v;(z, t) that 


(47) v(x, t) = T(x, t) + : 2, COS (Ame) f(E\dE COS (Amare mts. 


m=0~ % 


According to equations (44) and (47) then, 


0 


lim v,(2,t) = -> COS (AmE) f(E\dE COS (Ama). 


0 _- 
= m=0 bs 


The series on the right is a Fourier cosine series representing f(x) in (—a, 0), and 
hence our function v,(z, t) does satisfy the initial condition 
(48) lim (x, t) = f(x) —a <2 <0. 
t—0 
By using the same paths of integration as before it can be shown in the same 
way that the contour integral 


n [a(a + £)] f(é)dé sin [au(b — x)] e~***i'da 


mt J(P) F(a) 





rc si 
(49) I,(z,t) = — £ f 


converges and is represented by the series (42), so that 
(50) vo(x, t) = I,(2, t) e<cez< b. 
This integral also approaches zero with ¢ so that our initial condition 


(51) lim v2(z, t) = 0 0<2<b, 
t-+0 

is satisfied. Our functions v;(z, t) and ve(z, t) therefore satisfy all the initial 

and boundary conditions. 

Since differentiation of the integrands of J,(z, t) and I(x, t) once with respect 
to ¢ or twice with respect to x introduces a factor @ or a’ as the essential change, 
it is readily shown that the integrals of these derivatives over the path P con- 
verge uniformly. Hence J,(z, t) and I2(z, t) can be differentiated inside the 
integral sign, and they satisfy the heat equations. It can be seen, then, that 
the expressions (47) and (50) for v;(2, t) and ve(z, t) in terms of contour integrals 
satisfy all the conditions of our problem. But it is not easy to see how we would 
arrive at this contour integral solution in this case without the use of some other 
method such as the one used here. 

The solution of our problem then is given in series form by the equations (41) 
and (42), and in terms of contour integrals by (47) and (50). The latter form 
is especially useful for examining convergence properties. 


UNIVERSITY OF MICHIGAN. 














ASYMPTOTIC LINES THROUGH A PLANAR POINT OF A SURFACE 
AND LINES OF CURVATURE THROUGH AN UMBILIC 


By Tuomas L. Downs, Jr. 


1. Introduction. If a regular point of a surface is not an umbilic, there pass 
through it two asymptotic lines (which may be imaginary) and two real lines 
of curvature.! If a point is an umbilic, this conclusion does not follow, for a 
circular umbilic is a singular point for the differential equation of the lines of 
curvature and a planar umbilic is a singular point for the differential equations 
of both families of curves. In a previous paper,’? the author has studied the 
relations between two finite sets of directions at a planar point: the “true 
asymptotic directions’, which are the possible tangent directions of the asymp- 
totic lines through the point, and the “‘true principal directions’’, which are the 
possible tangent directions of the lines of curvature. In the present paper, we 
shall consider the asymptotic lines which are tangent to a given true asymptotic 
direction at a planar point and the lines of curvature which are tangent to a 
given true principal direction at a planar or circular point. 

No previous results for the asymptotic lines appear to be known. The lines 
of curvature have been studied in the general case by Delloue ;* the present paper 
amplifies his conclusions. In special cases, the lines of curvature have been 
treated by several writers, among whom may be mentioned Cayley,‘ Darboux,° 
Picard® and Wahlgren.’ 

The results for the asymptotic lines are not parallel to those for the lines of 
curvature. To an arbitrary true asymptotic direction at a planar point, there 
is tangent a unique asymptotic line in the most general case, two asymptotic 
lines in the next most general case. To an arbitrary true principal direction at 
a planar or circular point, there is tangent in general either a single line of curva- 
ture or an infinite number of lines of curvature, depending upon certain definite 
conditions. 


Received December 24, 1935; part of a thesis presented at Harvard University. 

1 An umbilic is a regular point of a surface at which e/E = f/F = g/G = 1/p, where 
E, F, G and e, f, g are respectively the coefficients of the first and second fundamental 
forms of the surface. In the present paper an umbilic will be called a circular point if 
1/p ~ 0 and a planar point if 1/p = 0. 

2 Downs, Asymptotic and principal directions at a planar point of a surface, this Journal, 
vol. 1 (1935), pp. 316-327. 

3’ Comptes Rendus, vol. 187 (1928), p. 702. 

4 On differential equations and umbilici, Collected Math. Papers, vol. 5, p. 708. 

5 Lecons sur la Théorie Générale des Surfaces, vol. 4, Note VII. 

6 Traité d’ Analyse, vol. 3, chap. TX, §14. 

7 Arkiv for Mat., Astr. och Fys., vol. 1 (1903), p. 43. 
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2. The method of reduction. The problem at hand is a special case of the 
general problem of analysis which seeks to determine the integral curves of a 
quadratic differential equation 


(1) A(z, y)dx? + 2B(z, y)dady + C(2, y)dy? = 0, 


which pass through a point at which all three coefficients vanish—a singular 
point. Let the origin be taken at the singular point of equation (1), and let us 
suppose that the coefficients A, B, C are analytic functions of (x, y) in some 
neighborhood of the origin. Then the differential equation takes the form 


(la) [A,(z, y) + +--+ ]dx* + 2[B,(2, y) + --- |] dedy 
‘la 
+ IC, (2, y) + ---]dyv =0, n 21, 


where A,, B,, C,, homogeneous polynomials of degree n, not all identically zero, 
are the terms of lowest order in the Taylor expansions of the coefficients about 
the origin. 

We shall consider only integral curves of (1a) which approach the origin so as 
to have a definite tangent there, that is, so that dy/dz approaches a definite 
limit, finite or infinite. Integral curves of this kind must be tangent at the 
origin to one of the n + 2 lines 


(2) A, (a, y) x? + 2B,(2, y) ry + C,(2, y) y® = 0. 


Let y/x = \ be a solution of the equation (2); we may suppose that d is finite. 
Then if we set y = (¢ + A)z in (la) and divide out x", we shall get a quadratic 
differential equation with a singular point at the origin in the (z, t)-plane. This 
equation is to be treated as we have just treated (1a). 

A finite number of such substitutions suffices, at least in the cases of most 
frequent occurrence, to transform (1) into an equation 


a(t, nd? + 2B(E, n)dtdn + y(E, n)dn? = 0, 
whose discriminant 
A= B° — ay 


does not vanish at & = » = 0, and which therefore factors into the two equations 


dn a (&, 9) dn _ az (&, n) 


dé by (én)? dE be (E, n)” 
in which a;, b; are analytic at £ = » = 0. The theory of singular points of 
equations of this form is completely known. It is covered by a definitive paper 
of Bendixson’s in the Acta.* 
The method of reduction just described is a generalization of that used by 
Picard? in the special case when n = 1 and ) is a simple root of equation (2). 


8 Acta Math., vol. 24 (1900), pp. 1-88. 
» Traité d’ Analyse, vol. III, Chap. IX, p. 223 (1928 ed.). 
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Provided that \ is such a simple root, only one substitution is necessary no 
matter what the value of n, and the method of Picard applies essentially un- 
changed. A similar method for the treatment of the general case has also been 
suggested by Wahlgren.” 


3. The asymptotic lines through a planar point. The asymptotic lines of an 
analytic surface 


S: i= z(u, v) (¢ 1, 2, 3) 
are the integral curves of the differential equation 
(3) edu? + 2 fdudv + gdv? = 0, 


in which the left-hand side is the second fundamental form of S. The lines 
consist of two families of curves on S, one curve of each family passing through 
each regular non-planar point. A planar point is a singular point of (3), inas- 
much as the coefficients e, f, g all vanish there. 

If we choose the planar point P as the origin of rectangular coérdinates 
(z, y, 2) and the tangent plane to S at P as the (z, y)-plane, the surface in the 
neighborhood of P is represented by the equation 


z = F(z, y) = on(Z, y) + onsilz, y) + ++ » Gn(Z, y) #0, n 23, 


where ¢;(z, y) is a homogeneous polynomial of degree k in x and y. If the 
coérdinates (x, y) are regarded as the surface parameters, the differential equa- 
tion (3) becomes 


(3a) [%s m Penis + 3 2 | dx? + 2 [ Ze: + Oyn41 + ~ | drdy 




















ax? ax? oxdy axay 
Won Pvnit ] 
3 ---|dy = . 


The integral curves of (3a) may be regarded as the orthogonal projections of 
the asymptotic lines upon the (z, y)-plane or as the asymptotic lines themselves 
referred to the surface coérdinates = u and y = v. In either case their pos- 
sible tangent directions at the origin—the true asymptotic directions at P — 
are defined by the equation 








Oon ° on Won 2 
4 > _— 1 n ’ ss 0 . 
(4) ; +3 owt arom ) en(z, y) 


It will be convenient to use the slope-angle @ = are tan y/z and to adopt the 
notation 


Pp 
gx(0) = gx(cos@,sin@), (0) = = ¢gx(cos 8, sin @) . 


Then the following theorems are established by the method described. 


10 Bihang t. Kon. Svenska Vet.-Akad. Handlingar, vol. 28 (1902-1903), Afdelning 1, 
no. 4. 
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Let the angle a be a real root of order p of the equation 
(4a) on(9) = 0. 


1. If gnis(a) ¥ 0, there is a unique asymptotic line tangent at the planar point to 
the direction y:x = tana. It consists of two branches which join to form an analytic 
curve, smooth and without inflection at the origin if p is odd, but having a cusp of the 
first kind there if pis even. If p = 1, these conclusions hold even when gnsi(a) = 0. 

2. Ifp > landif 


Pn+1(a) =0, Pn+1(a) #0, 


then there are precisely two asymptotic lines tangent at the planar point to the direc- 
tion y:z = tan a, unless 


where 





ite 2 onsale) - v, (a) 
[on+1(a))? 


In this case, 

a) if = 1, there is at least one asymptotic line in the given direction; 

2 

b) iff1l<@s a i 2 7 there is an infinite number; 
, n® + 3n)? 
c) if@> a ay , r 

To illustrate the method by which these results are established, we shall out- 
line the proof of the first theorem. We suppose that gn4:(a) ~ 0. If p = 1, the 
method of Picard applies and the one substitution y = (¢ + tan a)z serves to 
establish the fact that there is a unique asymptotic line tangent to the direction 
of slope-angle a. 

Let us then assume that p > 1; we may also suppose, without loss of generality, 
thata = 0. Then 





, there are none. 





gon(1, t) = bP + --- b #0, 


and if we set y = ¢z in the differential equation (3a) and divide through by 
br”, the equation becomes 


P(A +) $e@H es) at. det 
(5) $ 2x [P(B +...) $2 (---) fat (-++)] dade 
$ [C+ +) $C.) $at(--- de =0, 
where A = n(n — 1), B = p(n — 1), C = p(p — 1), and 
- p!n(n + 1) gays (1, 0) ~0. 





5 [on (1, t)]ea0 
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The possible tangent directions for solutions of (5) at the origin in the (z, t)- 
plane are given by the equation az* = 0. Let us therefore set successively in (5) 


x= Mt, 2, = Mot, ---,Lp-2 = Xprt. We obtain a series of differential equa- 
tions (I), (II), --- , (P — I) of which the k-th is 

Plte-(A + ---) + are + rd(---) | dai 
(K) + 2ta;, [t?-*(Bi + ---) + kar, + xit( --- )] dx, dt 


+ 2 [Ci + ---) + Pare + at(--- dP =0, 


where B, = kA + B,C, = PA + 2kB+C. 
The tangent directions for solutions of (K) at the origin in the (z;, t)-plane 
are defined by the equation 


(Ex) (k + lfazi & = 0 (kK<p-—1). 


Corresponding to the direction ¢ = 0 of the equations (E;), (Es), --- , (Eps) 
there will be found in each case only the solutions t = 0, z, = t = 0 of (K), each 
of which yields only the trivial solution z = y = 0 of the original equation (3a). 
On the other hand, the directions x, = 0 defined by (E,), (Ez), --- , (E,-2) have 
been disposed of by setting x; = 2441¢ in (K) and thus proceeding to the equation 
(K + I). 

We consider now the differential equation (P — I); its solutions through the 
origin in the (z,-1, ¢)-plane are tangent to the directions 


(E,-) zit (p’az,, + C,t) =0. 


The root ¢ = 0 of this equation is disposed of as we have done above. To discuss 
the root rz». = 0, we set in (P — I) 2,1 = 2,¢ and obtain a differential equa- 
tion (P) in z, and t whose only solutions through the origin can be shown to be 
the axes z, = 0 and t = 0; but these yield only the trivial solution z = y = 0 in 
the (x, y)-plane. 

Thus we have left for consideration only the direction 


pPatp. + Cyt = 0 
defined by (E,-1). We therefore set in (P — I) 


C 
tp = (u +d), d= ~se 9. 


After division by ¢@ there results the equation 
@((A + ad) + au + --- | dv? 
+ 2t(u + A)[(Bp + par) + pau + --- | dudt 
+ (u + d)*[pPau + --- ]d? = 0. 
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This equation factors into two equations of the first degree in du/dt: 








du _ _ —2(pn? + pn — p — 1) (pn? — 1) ’ 
(6a) Pa a ++ a = a(2pn — p — 1) ~ 0; 
du _ _ —p(pn> + pn — p — 1) 
(6b) Ce eye + +++, bo = 2 (pn? — 1) <0. 


Equation (6a) has a regular point at u = ¢ = 0 and so yields only the illusory 
solution t= 0. To (6b) the criterion of Bendixson" applies: besides the illusory 
solution ¢ = 0 it has just one solution through the origin and tangent to the t-axis 
in the (u, t)-plane. This solution gives rise to a unique asymptotic line in the 
direction of the z-axis in the (x, y)-plane. It is shown by Picard” to be an 
analytic curve 


u=c™4+... c#0,m>0, 


in the (u, t)-plane. In the (2, y)-plane it is therefore represented by the equa- 
tions 


z= NNP+.:---, y = MPti4..., 


and so has a cusp at the origin if p is even, but is smooth and without inflection 
if p is odd. 

This completes the proof of the first theorem. The second is proved in a 
similar manner. 


4. The lines of curvature through an umbilic. The lines of curvature on S 
are the integral curves of the differential equation 


edu + fdv Edu + Fdv 


7 = 
”) jfdu+qgdv Fdu + Gdo 


Let P be an umbilic on S, planar or circular, and choose coérdinates as in §3 with 
P at the origin and the tangent plane to S at P as the (2, y)-plane. The surface 
in the neighborhood of P is then represented by the equation 


S: r= 5+ Vt+elnt-- talento, 


where 1/p ¥ 0 or 1/p = 0 according as P is a circular or planar point. If P is 
circular, the osculating sphere at P will be represented in the neighborhood of P 
by the equation 

2: Z = 


a (P+ U) + flew) +o + hile) +o, 


1 Loc. cit., p. 49. 
12 Traité, III (1928), p. 28 and p. 209. 
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where 
Jui = 9, Sox = (Gg, /p™*—') (2? + y?)* (ay, ~ 0). 


Let Q(z, y, z) be a point on S near P. If P is circular, denote by @,(z, y) the 
principal part of the infinitesimal directed distance z — Z from Q to = measured 
along the perpendicular from Q to the tangent plane at P: 


Pn(z, y) = ¢n(Z, y) oe f.(x, y); 


if P is planar, denote by @,(z, y) the principal part of the directed distance z 
from Q to the tangent plane at P. It is to be noted that if P is planar, g, = ¢n, 
where ¢, has the same meaning as in §3, and that in any case" 


2n(0) = ¢n(@) + constant. 


Under these conditions, the differential equation (7) of the lines of curvature 
takes the form" 


Pen, Pena | ‘ | (=: “e) 
| Ze 4 ay TIS Tae 


FP On4t = Yous) | [ Ze Past | “Sa 
+( ay? a J + duty axray + axdy oa oe 











(7a) 











When we set x = r cos 0, y = rsin 0, equation (7a) becomes 


[¥n(O) + THnsi(O) + ---Jdr* + r[xn(O) + rxn+i(6) + ---]drdé 


(7b) 


where 
vu(@) = (k — 1), (0) = (k — 1)e,(0), xu) = gu (0) — k(k — 2)pu(8). 
We assume that y, does not vanish identically, that is, that 
on(z, y) # a(x? + y*)™ (2m = n). 


The possible tangent directions at the origin for the integral curves of equa- 
tion (7b)—the true principal directions at P—are then defined by the equation 


(8) ¥,(0) = ¥.(cos 8, sin 0) = 0. 


13 Downs, loc. cit., §7. There the directions defined by the equation 3, = 0 are called 
the “true osculatory directions at P’’. 

4 The omitted terms of the coefficients are of order at least n. This rule of formation 
holds only through terms of order (n — 1) if Pis circular and through terms of order (3n — 5) 
if P is planar. 
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The results in the general case have been stated without proof by Delloue."® 
We shall here only sharpen the statement of his Case II by distinguishing two 
important subcases. Employing the methods already outlined, we arrive at the 
following theorem. 

Suppose that y is a real root of order p of ,(6) and also a root of xn(@), but that 
xn(@) does not vanish identically. 

1. If p= 1 and 


(A) [Wnp1Xn - Vn Xntileny ¥ 0, 


there is an infinite number of lines of curvature tangent at P to the direction y:z = 
tany. If isa multiple root of x.(0), the condition (A) becomes simply 


(A’) Xuyi(y) ¥ 0. 


2. If p = 2 and nyi(y) ¥ 0, there is one and only one line of curvature tangent 
at P to the direction y:x = tan y. 


HARVARD UNIVERSITY. 


16 Loc. cit. His results are as follows in the notation of the present paper: 

“T. Let OT be a real ray of the pencil ¥,(z, y) = 0 which does not also belong to the 
pencil whose equation is xn(z, y) = 0; y, the angle it makes with Oz; v'?)(6) the first deriva- 
tive of the function ¥,(@) which does not vanish for @= y. If p is odd and if 


¥?)(y)/xn(y) > 0, 


there is one and only one line of curvature tangent to OT at O. In all other cases, there 
is an infinite number. There is always a real direction to which is tangent only a single 
line of curvature. To every real direction of the pencil ¥,(z, y) = 0 there is tangent an 
analytic line of curvature (C), except in certain cases when (n — 1)¥4.(y)/xa(7) is a negative 
integer. If between two consecutive lines (C) there are an infinite number of lines of 
curvature passing through the umbilic, the latter all have the same tangent there. 

“II. If OT belongs to both the pencils y,(z, y) = 0 and x,(z, y) = 0, there is only a 
single line of curvature tangent to that ray at O in the cases of most frequent occurrence. 

“III. The preceding conclusions suppose that ¢,(z, y) is not of the form a(z? + y*)", 
2m = n. In that case an infinite number of lines of curvature with distinct tangents pass 
through the umbilic.”’ 











FORMAL PROPERTIES OF ORTHOGONAL POLYNOMIALS IN TWO 
VARIABLES 


By DunHAM JACKSON 


1. Construction and properties of symmetry of systems of orthogonal poly- 
nomials. The theory of orthogonal polynomials in two variables, as might 
be anticipated, presents numerous analogies with the corresponding theory in 
one variable, together with extensive and fundamental differences and compli- 
cations, which add materially to the interest of the problem, and at the same 
time limit the scope of an elementary treatment of it.! 

The “Schmidt process of orthogonalization”’ is applicable to functions of an 
arbitrary number of variables. If go(z, y), ¢:(, y), ¢e(2, y), --- form a set of 
functions integrable with their squares over a region R with no relation of 
linear dependence connecting any finite number of them (either identically or 
almost everywhere), it is possible to form a normalized orthogonal sequence 
o(x, y), P(x, y), Pelz, y), --- in which , isa linear combination of go, ¢i, ---, 
¢n. In particular, if R is finite and if p(x, y) is a non-negative integrable 
function having a positive integral over R, application of the process to the 
linearly independent functions p', p'z, p'y, p'x*, pry, py”, ---, taken in this 
order, gives a sequence of polynomials q,,.(z, y), n = 0, 1, 2, ---; m = 
0, 1, ---, ”, such that 


[ [oc yqulx, Y)Qnm(x, y)dx dy = 0, n—k + im—l | # 0, 
© R 


| [ he dekneiiite =. 


The n + 1 polynomials ¢,0, Gai, +++ 5 Yann are of the n-th degree in the two vari- 
ables together, and with respect to p as weight function they are orthogonal 


Received February 6, 1936; presented to the American Mathematical Society January 1, 
1936. 

'T am indebted to Professor Shohat for the following bibliographical indications: 
J. Shohat, Théorie générale des polynomes orthogonauz de Tchebichef, Mémorial des Sciences 
Mathématiques, No. 66, Paris, 1934, pp. 20-22, and references 25, 28, 29; F. Didon, Etude de 
certaines fonctions analogues aux fonctions X ,, de Legendre, etc., Annales del "Ecole Normale 
Supérieure, vol. 5 (1868), pp. 229-310; F. Didon, Développements sur certaines séries de 
polynomes, ibid., vol. 7 (1870), pp. 247-268, and other articles by the same author in vols. 
6 and 7 of the same Annales; P. Appell, Sur une classe de polynomes a deux variables et le 
calcul approché des intégrales doubles, Annales de la Faculté de Toulouse, vol. 4 (1890), pp. 
H 1-20; P. Appell, Sur les fonctions hypergéométriques de plusieurs variables, les polynomes 
d’ Hermite, et autres fonctions sphériques dans l’hyperespace, Mémorial des Sciences Mathé- 
matiques, No. 3, Paris, 1925; P. Appell and J. Kampé de Fériet, Fonctions hypergéométriques 
et hypersphériques—Polynomes d’ Hermite, Paris, 1926. 
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to each other and to every polynomial of lower degree; in these particular 
polynomials, though not generally in the case of the others presently to be 
introduced, the second subscript indicates the degree with respect to y. If 
polynomials pro, Pri, +--+ » Pan are defined in terms Of dno, Gniy-+* 5 Ynn DY a 
real orthogonal transformation of the form 


n 


Pri = , CijQni 5 > Cir; = 0 (i # k), i ct ; = i 
j=0 j=0 j=0 

(whether with determinant 1 or with determinant —1), the polynomials p,; 
are likewise of the n-th degree, orthogonal* to each other, normalized, and or- 
thogonal to every polynomial of lower degree. 

Any polynomial P,,(2, y) of the n-th degree which is orthogonal to every poly- 
nomial of lower degree is necessarily a linear combination’ of the n + 1 poly- 
nomials qno, --:, Yan. For if the coefficient of y" in P,(2, y) is c, times the 
non-vanishing coefficient of y" in gan(x, y), the polynomial P,(x, y) — Cndan(z, y) 
has no term in y", and is still orthogonal to every polynomial of lower degree; 
if the coefficient of ry"! in Py — Cndnn iS Cn. times the coefficient of ry" 
iN Gn.n—1, terms in y” and zy"! are both absent from the remainder 


2 2 . 
I 5 = Cran —- Cn~19n,n—1) 


and by continuation of the indicated process there is obtained ultimately a 
polynomial 


Pe _ Cr nn — Cr 19n,n—1 2 Cn0] no 


which contains no term of the n-th degree, but is orthogonal to every polynomial 
of degree lower than the n-th, and so in particular must be orthogonal to itself, 
and hence identically zero. It is equally true that P, can be linearly expressed 
in terms of any set pro, --- , Pon defined as in the preceding paragraph, since 
the q’s can be expressed in terms of the p’s. 

If wro(a, y), «++, Tan(@, y) is any set of n + 1 normalized orthogonal poly- 
nomials of the n-th degree orthogonal to every polynomial of lower degree, the 
m’s are expressible in terms of the p’s by an orthogonal transformation. For 
each 7,;, as just noted, can be written in the form 


n 
= > ViiPnis 


7=0 


2 This term as applied to pairs of polynomials will be understood in each case to mean 
orthogonal with respect to the weight function under consideration; and a corresponding 
interpretation is to be attached to the word normalized. 

3 This is of course not true in general of an arbitrary polynomial of the n-th degree, 
which involves (n + 1)(n + 2)/2 coefficients. 
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and by the assumption that the z’s are normalized and orthogonal 


0= [ p(x, ymax, y)rne(x, yddx dy = D> vives (i # k), 
d R 


j7=0 


1= | [ he dinda dats a D4. 


j=0 


This leads to a relation between properties of symmetry of the weight func- 
tion p(x, y) and corresponding properties of the orthogonal polynomials. Let 
it be supposed that there is a transformation 


(1) x’ = Ar + By, y’ = Cx + Dy, 


(necessarily* of determinant +1), which carries the region R into itself, and 
under which furthermore the function p is invariant: p(x’, y’) = p(x, y). Let 
p(x’, y’) be a polynomial of the n-th degree which is orthogonal to every poly- 
nomial of lower degree, and let p(x’, y’) = x(a, y). Let o(2, y) = s(2’, y’) be 
an arbitrary polynomial of degree n — 1 at most; the degree is naturally the 
same with respect to either pair of variables. Then 


(2) / p(x, y)r(x, y)o(x, y)dxdy = | i p(x’, y’)p(a’, y’)s(a’, y')dx'dy’ = 0. 


In view of the arbitrariness of o(z, y) this means that x(x, y) is a linear com- 


bination of p,o(z, y), --- , Pan(z, y). If p(x, y) is normalized, the determinant 
of the transformation being +1, 


| I az, ») [n(2, Pde dy = | i ola’, y’) [p(x’, Rd’ dy’ = 1, 


and x(x, y) is normalized also. In (2) the polynomials p, s can be replaced 
by any two polynomials which are orthogonal to each other, in particular by 
any two of the polynomials po, --- , Pan. If 


Trot, Y) = Prolt’, y’), --- y Fnn(T, Y) = Punlz’, y’), 


the x’s forma system of normalized orthogonal polynomials of the n-th degree, 
orthogonal to every polynomial of lower degree, to which the preceding para- 
graph is applicable. A transformation (1) under which R and p are invariant 
defines an orthogonal transformation of the set of polynomials pro, --- , Pan, for 


‘If J is the determinant of the transformation, and E the area of the region, 


E= [ [evar iat [ [aed = iain 
J Jr J Jr 


The transformation on z, y is, however, not necessarily unitary; e.g., the transformation 
x’ = 2y, y’ = 42x carries the rectangle —2 S x S 2, —1 S y S 1 into the rectangle —2 < 
2, -—1 < y’ S 1, and leaves the positive function 1 + 2? + 4) invariant. 
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each value of n. To a group of transformations (1) corresponds for each n a 
simply or multiply isomorphic group of transformations® of the p’s. 

For example, if R is the square —1 S x S 1, —1 S y S 1, if pla, y) has 
the form pi(x)p2(y), and if po(x), pi(x), --- and qo(y), ify), --- are the systems 
of normalized orthogonal polynomials in one variable corresponding to the 
weight functions p,; and pz, respectively, the polynomials pj, --- , Pa» can be 
taken as 


Pr(X)Go(y), Pra(z)qi(y), --- » Polr)gn(y). 


If p, and pe are even functions, the orthogonal polynomials in one variable are 
even or odd according as the degree is even or odd, and the transformation 


, 


x’ = —2,y’ = —y carries over the polynomial p, (7, y) = pr_:(x)qi(y) into 
Tri(X, yY) = Pr—i(—2)gi(—y) = (—D)"pr_(x)qi(y) = (—D)"pailt, y); 


for n even the transformations of the group 


1 0 —1 0 
0 1) e -1 
on x and y both correspond to the identical transformation on p,o, +--+, Pans 


while for n odd they correspond respectively to the identical transformation 
and to its negative. The group 


1 0 = | ') l 0 =f ') 
0 1) 0 1) ; waar , waar 


on z and y gives rise to two or four different transformations on the p’s according 
as nis even or odd. The group relationships which arise in various cases would 
obviously constitute an extensive study in themselves. 

As another example, suppose that p is a symmetric function of x and y, 
p(y, ©) = p(z, y), the region R being one which is carried over into itself by the 
transformation x’ = y, y’ = x. Let q(2,y) bea polynomial of the n-th degree 
which is orthogonal to every polynomial of lower degree, and normalized, 


II p(x, y) (g(a, y) Pde dy = 1. 


By the general discussion above q(y, x) is likewise normalized and orthogonal 
to every polynomial of lower degree. The polynomials g(x, y) + q(y, 2) and 


5 These facts are of course suggested by the applications of group theory in quantum 
mechanics. The writer is not aware that the present formulation is a familiar one. 

5 The same thing is readily seen to be true in the case of any weight function such that 
p(—z, —y) = o(z, y), with any choice of the polynomials p,;. For the terms of the n-th 
degree in pai(—z, —y) are the corresponding terms of p,i(z, y) multiplied by (—1)*, 
and if pai(—z, —y) as well as p,i(z, y) is orthogonal to every polynomial of lower degree, 
it follows that p..(—z, —y) — (—1)"pai(z, y), having no terms of the n-th degree, must be 
orthogonal to itself. 
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q(x, y) — gy, x), still orthogonal to every polynomial of lower degree, are also 
orthogonal to each other, since 


II p(x, y) (g(x, y) + aly, x [a(x, y) — gly, x)\dx dy 


- If p(x, y) (q(x, y)Pdxdy — II p(x, y)[q(y, r)Pdrdy = 1-—1=0. 
kK R 


Unless one of them is identically zero, they can be normalized by means of the 
appropriate constant factors, that is to say, the polynomial g(x, y) either is 
itself symmetric or skew-symmetric, or gives rise to a pair of polynomials of 
similar character of which one is symmetric and the other skew-symmetric. 

If r(x, y) is another polynomial of the n-th degree which is orthogonal to every 
polynomial of lower degree, and not linearly dependent on q(x, y) and q(y, 2), 
it is possible to form a linear combination 


q(x, y) = r(x, y) — eq(x, y) — e’q(y, 2) 


which is not identically zero and is orthogonal to g(x, y) and to g(y, x). Then 
q(y, x) also is orthogonal to g(x, y) and to q(y, x), in consequence of the sym- 
metry of p. This polynomial q;(2, y), like q(x, y), either is symmetric or skew- 
symmetric or gives rise to a pair of orthogonal polynomials, one symmetric 
and the other skew-symmetric. If there is a polynomial r(x, y) of the n-th 
degree orthogonal to every polynomial of lower degree and not linearly de- 
pendent on q(x, y), g(y, 2), m(z, y) and qi(y, 2), the process can be continued. 
It leads ultimately to a set of n + 1 polynomials of the n-th degree, all symmetric 
or skew-symmetric, normalized, and orthogonal to each other as well as to 
every polynomial of lower degree. 

If the construction is based on the particular set of polynomials q,0, -- + , Gan 
defined at the beginning of the paper, it can be said with definiteness that 
Gno(X, y) is neither symmetric nor skew-symmetric (for n > 0), since it contains 
a term in x” and no term in y"; the result of subtracting from q,:(2, y) a linear 
combination of ¢,0(2, y) and q,o(y, x) is neither symmetric nor skew-symmetric 
(for n > 2), since it contains a term in x"~'y and no term in zy""‘, and so on. 
The resulting set consists of $(m + 1) pairs, or of $n pairs and a single poly- 
nomial, according as n is odd or even. In the latter case, the single polynomial 
obtained after construction of the 3n pairs must be symmetric, for it is certainly 
not skew-symmetric, since it actually contains a term in 2"?y"". When p(x, y) 
is symmetric, the polynomials pio, +++, Pnn Of the n-th degree in the orthogonal 
system can be chosen so that the matrix of the transformation by which the poly- 
nomials p,(y, x) are expressed in terms of the polynomials p, (2, y) is in diagonal 
form, with 4(n 4+ 1) or 4n + 1 of the diagonal elements equal to +1, according 
as n is odd or even, and the rest equal to —1. The transformations of the p's 
corresponding to the group 
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1 0 01 
0 1) 1 0 


on x and y are in this case distinct for every n = 1. 

Parts of the above reasoning are applicable under more general circumstances, 
or at any rate under different circumstances. Let a sequence of functions gy 
be given as at the beginning of the paper, and as they occur in order let them 
be grouped into sets in any way, with y; functions in the first set, we in the 
second set, and so on. A linear combination of ¢’s involving one or more 
functions from the n-th set, with or without functions from earlier sets, but not 
containing any from sets beyond the n-th, will be called a sum of the n-th grade. 
Let yu, for a particular value of n be represented for simplicity by the symbol ». 
The Schmidt process, applied to the ¢g’s in order, gives vy sums of the n-th grade, 
which may be denoted by ®,, ®n2, --- , ®,,, normalized and orthogonal to 
each other and orthogonal to every sum of lower grade. Any set of » functions 
Wii, Vie, «+: , Vay expressed in terms of ®,;, --- ,®,, by an orthogonal trans- 
formation of the form previously considered will be a set of normalized orthogo- 
nal sums of the n-th grade, orthogonal to every sum of lower grade. Any sum 
of the n-th grade which is orthogonal to every sum of lower grade is linearly 
expressible in terms of ®,;, --- ,®,,, for subtraction of suitable multiples of 
#,,, ®,,-1, --- im succession leaves a remainder which contains no term of 
the n-th grade, and so must be orthogonal to itself and hence identically zero. 
Any two sets of normalized orthogonal sums of the n-th grade, orthogonal to 
every sum of lower grade, with v sums in each set, must be expressible in terms 
of each other by orthogonal transformation. 

These considerations apply to orthogonal polynomials classified otherwise 
than by the degree of the polynomial in the two variables jointly. If p(x, y) 
is a given weight function as before, and if a polynomial is said to be of the n-th 
grade when the exponent of the highest power of either variable occurring in it 
is n, sets of orthogonal polynomials of grade 0, 1, 2, - - - are obtained by applying 
the Schmidt process to the products of p! by the monomials 1, x, ry, y, 2°, x*y, 
x*y*, ry", y*, --- successively; the value of yu, in this case is 2n + 1. The scope 
of the earlier developments with regard to invariance of p(x, y) under linear 
transformation of x and y is limited by the fact that such a transformation 
does not in general leave the grade of a polynomial unaltered; for example zy, 
which is of the first grade, is in general carried over into a polynomial of the 
second grade. Transformations of the form x’ = +2, y’ = +y, or of the 
form az’ = +y,y’ = +2, do however leave the grade of a polynomial unchanged, 
and if R and p are invariant under one of these transformations, it defines 
orthogonal transformations of the various sets of polynomials of specified grade 
in the orthogonal system. If p(y, r) = p(x, y), the 2n + 1 polynomials of the 
n-th grade can be chosen for each value of n so that n + 1 of them are sym- 
metric and the remaining n skew-symmetric. 

With a corresponding definition of the grade of a trigonometric sum in two 
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variables, the grade of such a sum is unaltered by the transformations x’ = +2, 
y’ = +y and 2’ = +y, y’ = +2, as well as the order, considered to be the 
sum of the orders with respect to the two variables separately, and a theory of 
orthogonal transformation can be worked out for the corresponding sets of 
orthogonal] sums. 

To return to the case first considered, that of a system of orthogonal poly- 
nomials classified according to degree in the two variables together, if qo, «+ ~ , Gnn 
are subjected to a complex unitary transformation, the resulting polynomials 
Pnoy> *** 4 Pun (in which x and y are still to be thought of as real variables 
ranging over the region &) will be orthogonal to each other in the Hermitian 
sense, normalized, and orthogonal to every polynomial of lower degree (with 
real or complex coefficients) in the Hermitian sense as well as otherwise, since 
in the relationship with polynomials of lower degree the real and pure imaginary 
parts are orthogonal separately. So the real orthogonal transformation of the 
q’s induced by a linear transformation of the form (1) which leaves R and p 
invariant can be reduced to normal form by suitable choice of the p’s according 
to the general theory of unitary matrices, and even if the p’s thus introduced 
are complex, they still have a definite significance as orthogonal polynomials.’ 


. Recursion formula and Christoffel-Darboux identity. The processes 
which lead to the recursion formula and the Christoffel-Darboux identity for 
orthogonal polynomials in one variable are applicable also in the case of two 
variables, though the results are naturally less simple, and their utility for the 
theory of convergence of the corresponding developments in series is not so 
readily apparent. 

Let a set of normalized orthogonal polynomials corresponding to a _— 
function p(x, y) in a region R be denoted as before by p, (a2, y), n = 0,1, 2, --- ; 
i = 0, 1,---, n, each polynomial being of the degree indicated by its ‘first 
subscript. For specified n and 7 the product rp, (x7, y), being a polynomial of 
degree n + 1, can be expressed in the form 

n+l om 


UPnil@, y) = : ; CmjP mil r, y), 


m=0 j,=0 


with 
Cnj = | [ o. YLPalr, Y)PnAla, yaa dy. 
k 


This coefficient of course depends on n and 7 as well as m and j, but a corre- 
sponding elaboration of the symbolism is unnecessary. As xp,,j;(a, y) is a poly- 
nomial of degree m + 1, and as p,.(a, y) is orthogonal to every polynomial of 
degree lower than the n-th, ¢,; = Oifm <n — 1. So 


n+l n—-1 


CP = > Cn4, jPn4i,j + pa CnjPni + > Cna-, jPn-,j- 


7 The linear transformations discussed in this section have been further studied by Mr. 
Andrew Sobezyk in a master’s thesis at the University of Minnesota. 
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The three sums on the right represent polynomials of degrees n + 1, n, and 
n — 1 respectively, each orthogonal to every polynomial of degree lower than 
its own. If these are normalized, and if the normalized polynomials are repre- 
sented by Unis i(z, y), Vala, y), and W,_1,:(x, y), the identity takes the form 


LPni(2, y) = OniU nai, s(z, y) + BriVaila, y) + ¥niW ii, (2, y). 


(If one of the sums is identically zero, it can be regarded as the product of an 
arbitrary normalized polynomial of appropriate degree by a vanishing coeffi- 
cient.) The normalized polynomials Uy4:,;, Vai, Wa-1,:, of degrees n + 1, 
n, and n — 1, are each orthogonal to every polynomial of lower degree, and in 
particular, for fixed n and 7, are orthogonal to each other, but (as far as appears 
from the present reasoning) it is not to be supposed that 


Unit.oy Unis, Singhal Unit.n 
are in general orthogonal to each other, or that Vj, --- , Va» are orthogonal 
among themselves, or W,-1,0, --- , Wn-i.., or that V,; is the same as p,;. 


In consequence of the specified properties of the polynomials l’, V, W 


| p(x, y)[xpax, y)Pdxdy = a&%; + Bi; + v3. 
d R 


On the other hand, if G is the greatest value of | x | in R, this integral can not 
exceed G?, since p,; is normalized. So 


at, + B+ 73; 5 G@, 


where G depends only on R, and in particular is independent of n and 7. 
Similar reasoning is applicable to the product yp,:(z, y), or to 


(Ar + By)pri(x, y), 


if A and B are any constants. 

The formulas thus obtained may be regarded collectively as corresponding 
to the recursion formula connecting successive members of a set of orthogonal 
polynomials in one variable. 

An “arbitrary” function f(z, y) can be formally expanded in a series of the 
form 

« k 
tS > Cri Pri(x, y), 
k=0 i=0 


with 
Chi = | [0% v) f(u, v) pei(u, vjdu dv. 
R 


If S,(z, y) denotes the partial sum of this series through terms of the n-th 
degree and if 


n k 
K,(2,y, u,v) = D> DY pula, y)peilu, v), 


k=0 i=0 
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then 
S,(z7, y) = | [ p(u, v) f(u, v)K,(z, y, u, v)du do. 
R 


If the polynomials pyo , --- , Px are replaced by an alternative set by means of 
an orthogonal transformation, the sum 


k 
D pxi(x, y) pai(u, v) 
i=0 


is Invariant under this transformation, and consequently the whole expression 
K,,(2, y, u, v), in conformity with the fact that S,(z, y) is definable independently 
of any particular orthogonal system as that polynomial of the n-th degree (at 
most) for which 


| i a(x, y)[f(z, y) — S,(x, y)Pdx dy 


is a minimum. 

The series expansion of a polynomial amounts to nothing more than a re- 
arrangement of the polynomial itself, and any polynomial P,,(x, y) of the n-th 
or lower degree is reproduced identically by the formula 


P,(2, y) = | fo% v)P,(u, v)K,(2, y, u, v)du do. 
R 


Considered as a function of u and v, the product (u — 2)K,(x) y, u, v) is a 
polynomial of degree » + 1. As such it can be expressed in the form 


n+1 k 


DX Dd ceipei(u, v), 


k=0 i=0 


in which the c’s are functions of x and y given by 


Chey 


| | plu, v)(u — x)K,(2, y, u, v)pe(u, v)du dv 
R 


II puK, pedu dv — r{ [ pK, pridu dv. 
R R 


For k < n, the function uwp,;(u, v) is a polynomial of the n-th or lower degree, 
and hence, by the preceding paragraph, 


/ [ plu, v)upes(u, v)K, (2, y, u, v)du dv = xpp(x, y), 
R 


while the integral with the factor u omitted is identically equal to py(x, y). 
So® c,; = Ofork <n. 
8 For a corresponding argument in one variable, ascribed to J. Geronimus, see J. Shohat, 


On Stieltjes continued fractions, American Journal of Mathematies, vol. 54 (1932), pp. 79-84; 
p. 81. ° 
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Any polynomial of degree n + 1 in wu and v and of the same degree in x and y 
‘an be written in the form 


. os 
} > >. Cen Pult, Y)prilu, v). 
k=0 1=0 (=0 j=0 

If (u — none y, U, v) is so expressed, the fact that for k < n the coefficient 
of pxi(u, v) is identically zero as a function of x and y means that all the coeffi- 
cients Cea; in which k < n vanish. By the skew-symmetry of the function 
for interchange of the pair of variables (u, v) with the pair (2, y) it appears 
that cia; = 0 also whenever 1 < n. When parts of the calculation are made 
more explicit, as will be done presently, everything being expressed in terms 
of p’s, all terms which are of lower degree than the n-th in either pair of variables 
must cancel out in the final result, and such terms need not be traced in detail 
through the intermediate stages of the work. 

It is sufficient accordingly to begin by observing that 


(u — x)K,(z, y, u,v) = (u— 2) YS parila, y)prilu, v) + terms of lower degree. 
i=0 


By insertion of the integral expressions for the coefficients in the recursion 
formula 


n+1 


UPr(u, v) = y {I p(r, S)rpail?, 8)pngr. Ar, 8)Pasr, Cu, odr ds 


> 
+ + | | p(r, 8)rprilr, 8)py(r, 8)pr(u,vdr ds + terms of lower degree, 
7=0 , R 


Similarly, with an interchange of the subscripts 7 and j, which is arbitrary for 
the moment but does not affect the validity of the formula, 


> 


IPni(X, y) 


i=0 


¥ | [oo s)rprilr, 8) Dns, i(r, 8) Pn, (x, y)dr ds 


+> | Lo S)rpnlr, 8)pnuil’, 8)pnilx, ydr ds + terms of lower degree. 
1=0 RK 

On multiplication of the identity for up,;(u, v) by p,i(a, y) and summation 

with respect to 7, the terms of the n-th degree with respect to each pair of 

variables in the expansion of > Pnil®, Y)Pni(u, v) are seen to be 


> | p(r, 8)rpailr, 8) pail, 8)Pnj(U, 0) paix, y)dr ds, 


> > 


® Here and elsewhere it is to be noted that linear independence of the p’s, obvious from 
the manner of their construction, is also immediately deducible, without reference to 
the details of that process, from their property of orthogonality. 
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while the terms of like degree in the corresponding expression for 
2D) Pri, Y)Pri(Uy 2) 
7 
are the same, so that these terms cancel out of the representation of 
(u — x)K,(z, y, u, v), 


together with all terms of lower degree, and only the terms which are of degree 
n + 1 in one or the other pair of variables remain. Let 


L,(2, y, u,v) = K,(z, y, u,v) — K,alz, y, u,v) = DY pac(x, y)paiu, v). 
i=0 


Then the aggregate of terms of degree n + Lin wand vin u > paix, y)pas(u, v) 


has the representation 
| [ o 8)rLnyilu, v, 7, 8)L,(2, y, 7, s)dr ds, 
R 


and the terms of degree n + 1 in 2 and y in x > Pri(X, Y)Prj(u, v) are repre- 
2 


sented by 
| i p(r, s)rL,(u, v, 7, 8)Lnya(z, y, 7, 8)dr ds, 
Ld 
and as all other terms destroy each other, 


(u — x)K,(z, y, u,v) = | [oo s)rM,,(2, y, u, v, 7, s)dr ds, 
R 
where 
M,(2, y, u,v, 7,8) = Lasi(u, v, r, s)L,(2, y, 7, 8) — La(u, v, 7, 8)Dnsa(x, y, 7, 8). 


Similarly, 
(v — y)K,(2, y, u,v) = If p(r, s)sM,(z, y, u, v, r, s)dr ds. 
R 


Combination of these results gives immediately an identity of corresponding 
form for 


[(Au + Bv) aed (Ax + By)|K,.(2, Y, U, v) 


with arbitrary A and B. This general identity in a sense takes the place of 
the Christoffel-Darboux formula for orthogonal systems in one variable. 
Similar reasoning is possible in the case of polynomials classified according to 
the highest exponent attached to either variable. If this exponent is called once 
more the grade of the polynomial, there are 2n + 1 normalized orthogonal poly- 











434 DUNHAM JACKSON 


nomials of the n-th grade, which may be denoted by p,:(z, y), 7 = 0,1, --- , 2n. 
In this case xp, (7, y) may be either of grade n + 1 or of grade n, and in the 


identity 
rpri(t, Y) = aniU ngs i(@, Y) + BriVnilZ, y) + Y¥uiWn-ail2, y), 
where the first subscript of U, V, W now indicates the grade of the polynomial 
in each case, it may be that @,; = 0. The relation 
ans + Bi; + Yn <4 


holds as before. The only difference in the form of the identity for 


(u — x)K,(2, y, U, v) 


is that K,, consists now of (n + 1)? terms and L,, of 2n + 1 terms, 


n 2k 


K(x, y, u,v) = D> D> pei(x, y) pase, v), 
k=0 1+=0 

Lilx, y, u,v) = D> pailz, y)pni(u, v). 
i=0 


Analogous considerations apply also to the theory of orthogonal trigono- 


metric sums in two variables.” 


UNIVERSITY OF MINNESOTA. 


For the case of one variable see the writer’s paper Orthogonal trigonometric sums, 
Annals of Mathematics, (2), vol. 34 (1933), pp. 799-814. 











ON LOCALLY-CONNECTED AND RELATED SETS 
(Second paper) 
By 8. Lerscuetz 


The subject matter of three recent papers by the author [1, 2, 3]' has called 
forth remarks from Borsuk (on retracts), from Hurewicz (on fixed points) and 
corrections from Morse (on critical sets), which, together with some further 
developments induced thereby, we propose to consider in the present paper. 


I. Chain-retraction 


1. We shall need to refer in the sequel, explicitly and separately, to the 
following three characteristic properties relating a topological space ® to its 
retract, the closed set S (Borsuk [6]): 

(a) there exists a single-valued transformation T: R — S; 

(b) T is continuous; 

(c) T = 1lonS. 

As a special case we might have for T a deformation over % onto S leaving 
S point for point invariant. We should then call T a deformation-retract. 

Once retracts are defined, the notions of AR, ANR follow. We have estab- 

lished in [1] the equivalences between types: 


(1.1) ANR ~ LC, 
(1.2) AR ~ LC, 





where LC designates in essence LC sets, in which in addition all spheres are 
homotopic to points. These two equivalences characterize absolute retracts 
by properties of local connectedness. 

Now one of the chief features of our theory of chain-deformations [2] was the 
dissociation between the homotopic deformations of a set and its chains, and 
operations on the chains alone, regardless of what happens to the set itself. 
The degree to which this was accomplished there did not yield the extension 
of (1.1), (1.2) to HR sets. In truth we had not looked earnestly for it and 
were content in our paper to obtain certain other extensions of [1] from LC to 
HLC. A chance observation by Borsuk, to whom we mentioned this point, 
led us to the expected generalization as we shall now show. It will be profitable, 


Received April 22, 1936. 

1 Numbers in square brackets refer to the bibliography at the end. The general nota- 
tions and terminology are as in Topology [5]; the abridged notations are the same as in 
{1, 2]: LC = locally connected, H = homology, R = retract, NR = neighborhood retract, 
A in a compound abridged symbol stands for ‘‘absolute’’. 
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however, to re-examine the HR notions and indicate how they should be put 
forth. 


2. By referring to [2], No. 9, it will be seen that the HR property there given 
would be the analogue of (a) if JT were a deformation-retract. Furthermore, 
the analogue of (b) or (ce), which would demand, in particular, invariance of 
the chains on S, was not imposed. In these two deviations may be said to lie 
the chief difficulty in extending our equivalences. We shall see in fact that 
when we conform quite strictly with point set retraction, the difficulty vanishes. 
Agreeing then that the property given, loc. cit., is to be termed chain-shrinking 
and not described as retraction, we proceed to build up the analogues of point 
set retraction for chains. 

Let then R = f{e,}, R’ = {e.} be two quasi-complexes ({2] No. 4). We 
shall call chain-transformation of R into &’ a single-valued transformation r+ 
of the chains of & into those of &’, of form c, > ¢.,r < q, which induces on 
their chain-groups homomorphisms commutable with the boundary operator 
F (Fr = rF). It is an e-transformation whenever diam (| ¢, | + | re, |) < ¢ for 
every c, of 8. Whenever &’ is on a set S, we shall also say that & is chain- 
transformed onto S. In particular, if + merely e-transforms a suitable sub- 
division of R, we shall still say that & is e-chain-transformed onto S, 

We now define the closed subset S of R as an homology retract (= HR) of 
M whenever the following conditions hold: 

(a) any finite 8 = {c'} of R is chain-transformable onto S; 

(8) for every ¢ there exists an » such that every & of the spherical neighbor- 
hood S(S, n) is echain-transformable onto S; 

(y) in both cases (a), (8), the c’s C S remain invariant. 

Whenever (8), (vy) alone hold, S is called a neighborhood-HR (= HNR) of ®. 
This is essentially the extension of the NR notion to the present case. 

In the applications to a single chain c,, we must consider it as the R or sub- 
chain of the & consisting of c, and F(c,). 


3. The analogy of our three conditions with (a), (b), (c) is obvious. The 
chief difference is that we do not demand simultaneous transformations of all 
the chains of R onto S. 

In the case of an HR one may also have occasion to drop the continuity con- 
dition (8). We shall say then that we have a weak HR. 

A special case of retraction is one in which the transformations are always 
chain-deformations.*? We shall say then that we have a deformation HR, 


2? The observation made to us by Borsuk, alluded to at the end of No. 1, consisted pre- 
cisely in assuming that, everything being immersed in the Hilbert parallelotope , the 
associated deformation-chains be merely taken C § and not C &. A mild step further 
consists in frankly replacing chain-deformation by chain-transformation. The latter is 
intrinsic and may be defined without regard to § and in fact, if so desired, for any topo- 
logical space whatever. The relation between the notion put forth by Borsuk and the 
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or HNR as the case may be. In particular the first may likewise be weak or 
otherwise. 

The only retraction theorem of [2] is Theorem IX and under our present 
definitions we must replace in it HNR by “deformation-HNR”. We shall 
refer to the theorem under this form as Theorem IX’ of [2] and similarly for 
other restated or modified theorems in the sequel. 

Let us observe in passing that, as pointed out to us by Morse, the proof of 
Theorem II of [2], page 9, only establishes the following weaker result, which 
as a matter of fact, is not really needed in the applications: 

TuHeoreM II’. Jf a closed set B is chain-deformed in such a manner that the 
chains on a closed subset A remain on A, then B may be weakly chain-deformation 
retracted onto A. 


4. The absolute-HR or -HNR (= AHR, AHNR) are defined as for point set 
retraction (Borsuk [6]): S is an AHR or AHNR, whenever the HR or HNR 
property, as the case may be, is possessed by any topological image of S relative 
to the containing space. It is proved also as for ordinary retraction (Borsuk [6] 
p. 160) that the necessary and sufficient condition for a compact metric space 
R to be AHR or AHNR is that its image on § be HR or HNR for © itself. 
And now we are able to prove that in fact 


(4.1) AHNR ~ HLC, 
(4.2) AHR ~ HLC. 


These are the expected analogues of (1.1) and (1.2). 

Let us identify R throughout with its topological image on § and let it first 
be HLC, with &() as its “gauge-function’’, or the function designated by » 
in [2], No. 15. Given ¢ and » < } &(4e), we shall show that ® satisfies the 
HNR conditions with (e) as the function in condition (8). Let &, be a finite 
quasi-complex C S(®, 7) and let mesh R, < ». If this last condition is not 
fulfilled, we replace &,, by a suitable subdivision of mesh <n. Let us suppose 
also that &, has a subcomplex Y on M. We define a +r: 2 — ®, such that 
7% = Yas follows. Take for transform of any ¢ of &, — Ya zero-chain ¢4 = reo 
consisting of a point of R whose distance to | co! = d(\co|, WR). Then ¥ and 
the chains cj, make up a partial realization R* of R,, whose mesh < 3n < £(}e). 
Therefore, by the HLC condition, &* may be completed to form an image &’, 
of &,,, or transform of &, whose mesh < ¢; the ¢-transformation thus determined 
is precisely a 7 of the required nature. This shows that the right side of (4) 
implies the other. 


5. Conversely, suppose that % is an AHNR (i.e., an HNR for ), with n(e) 
as the function in condition (8). Let &’ be a partial realization on % of a certain 
one adopted here may be said to be that whereas he requires some deformation quasi- 


complex DR, at least in H, we only demand a partial realization of DR, in the sense of [2], 
No. 12, with all deformation-chains left out (unrealized). 
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R, with mesh R’ < n(e), where € is given. We first complete &’ to a realization 
R”’ of R in H, by the process of [1], No. 17,* except that cells are replaced by 
chains, spheres by chain-boundaries. The sets (¢), loc. cit., being convex, 
any cycle on a (¢) bounds on it. Hence if the boundary F(c,) of an expected c, 
is on (¢), we may insert ec, C (¢). This takes the place of the construction by 
segments, loc. cit. 

The complex &’’ thus constructed being of mesh < mesh R’ < n(e), there 
exists an e-chain-transformation 7 of &’’ into a R* of R preserving KR’. It is 
clear that mesh R* < ¢ and that it completes &’ on MR in the manner prescribed 
by the HLC condition. ‘Therefore the left side of (4.1) implies the other also 
and (4.1) is proved. 

The same procedure likewise yields the proof of (4.2), since in the two cases 
now involved the upper bounds of the y’s with increasing ¢ are the same as 
for e. 

Corotiary. The sets AHNR have the properties of the sets HLC considered 
in [2], No. 16. In particular, their Betti-groups have the same structure as for a 
finite complex. The sets AHR have the Betti-groups of a point. (See [1], No. 20.) 

It is important to bear in mind throughout that we may have any type of 
chains for which the postulates of [2], §I, hold. In particular we may have 
either singular or regular chains (in the sense of that paper). 


6. In both [2] and [3] we have had repeated occasion to consider chain shrink- 
ing away from a set (see notably [2], p. 16). It was pointed out to us by Morse, 
however, that the theory of critical sets requires a more delicate notion which 
we may describe as “local’’ shrinking away from a set (see Morse [9]). As 
usual in topology, the term “local’’ refers to the fact that the given operation 
may be confined to any preassigned neighborhood of the given set. More 
precisely, if A, B are subsets of R, we say that A may be locally chain-shrunk 
away from B, whenever given any open set U > B, it is possible to find another 
V such that U > V D> Band that A may be chain-shrunk onto A — V over AU. 
By the statement: A may be chain-shrunk away from B at a point 2, we shall 
mean that about x it may be chain-shrunk away from both B and xz. That is 
to say, forevery U D x there isa V Dz such that V C U, and also a neighbor- 
hood W of B on U, such that for any quasi-complex & there is a chain-deforma- 
tion displacing only the elements of R on U, and this away from both B and x 
(not bringing them nearer to B or x), those on V being chain-deformed to the 
outside of W + V. Under these definitions we may apply the reasoning of 
Theorem III of [2], p. 10, without having recourse to Theorem II, the V’s 
of the proof being now as described above, and the W’s, loc. cit., being now 
such that their intersection with the corresponding l’ plays the réle of the W 


3 We recall the following errata given at the end of vol. 35 of the Annals of Mathematics 
and referring to [1], No. 17: line 14 of No. 17, replace ‘“‘convex sets of §’’ by “spheres of 
’’, line 15, cross out ‘‘convex’’. Cross out line 16. 

















ON LOCALLY-CONNECTED AND RELATED SETS 439 


considered above. We shall designate this stronger theorem as Theorem III’ 
of [2]. 


II. Local connectedness and the fixed point formula 


7. When we first undertook to extend our basic coincidence and fixed point 
formulas ([5], Chapter VI) we found that the LC properties of the sets played 
an important réle. Confining our attention to the fixed point problem, it was 
shown that the fixed point formula was valid for a compact LC subset of 
euclidean spaces ([5], p. 347). Since every finite dimensional space can be 
mapped topologically onto some euclidean space, this implies the validity of 
the formula for all finite dimensional compact metric LC spaces. Later we 
showed ({2], p. 129) that the restriction as to finite dimension could be dropped. 

Now the coincidence and fixed point formulas belong in their essence strictly 
to algebraic topology. One would expect, therefore, to have their range of 
validity limited, if at all, by restrictions on cycles and the like, that is to say, 
by HLC rather than LC restrictions. Furthermore, the HLC should refer, 
preferably, to the more “purely’’ algebraic homology theories such as those of 
Vietoris or our own (regular cycles of [2]). 

Now in a private communication (Dec. 1935) Hurewiez indicated to us a 
most ingenious method for establishing the validity of the fixed point formula 
for finite dimensional compact metric HLC sets in the sense of singular cycles. 
Soon after we succeeded in showing that the formula holds for any compact metric 
HLC set, regardless of type or of dimension. This result does have the requisite 
degree of generality, and we shall establish it in the present section. The 
treatment is independent of the type considered. As a matter of fact, HLC 
in the sense of singular chains and, say rational coefficients, implies the same 
for regular or Vietoris chains. 


8. Let the general notations be the same as in Topology, p. 358, except that 
the pair (©’, L’) is merely a topological image of ($, L). We assume then L 
to be compact and HLC and shall prove that the basic fixed point formula (49), 
Topology, p. 359, holds for every ¢.s.v.t. T of L into itself. Or 

TueoreM. Let T be any c.s.v.t. of a compact metric HLC-set into itself and 
let g” be the matrix of the transformation which T induces on a basis for the rational 
p-cycles of L. Then the number of signed fixed points of T, 


(8.1) 6 = X(— 1)” trace ¢’, 


is a topological invariant of T, and if @ # 0, T has at least one fixed point. 

It will be observed that owing to [2], Theorem VIII, the matrices ¢ and also 
the sum in (8.1) are all finite. 

Application. If L is an AHR, every ¢.s.v.t. of T into itself has at least one 
fixed point. 

Coincidences. While we shall not consider them here, we may remark that 
the same proof would enable us to show that if L, L’ are two HLC spaces and 
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T, T’ are two transformations L — L’ such that T and T’" are e.s.v.t., then 
the number @ of their signed coincidences given by formula (48) of [5], p. 359, 
where all elements are finite, is a topological invariant of the pair T, T’. In 
particular, when 6 # 0, there are coincidences of T, T’. Generally speaking, 
accented elements are to represent the analogues in 9’, but not necessarily the 
topological images, of the corresponding elements in §. In particular, the 
choice of associated pairs N‘, NV” is to be made as follows. Having determined 
sequences {3}, {M’*| converging respectively to L, L’ on H and H’, let «; 
be the width of 93%. For a given j we choose an MN” such that WW, N”’ are 
related exactly as the associated euclidean space neighborhoods of [5], p. 353. 
Their projections on §,, 5, i sufficiently high, are precisely N‘, N’’. These are 
open neighborhoods of L‘, L’‘ whose closures N‘, N” are i-manifolds which 
may be assumed covered with simplicial complexes of mesh n; < «. 

Our next step must be the choice of the extensions T,, T:, loc. cit. The first 
is to be the c.s.v.t. § — H’ image of the identity for 5. The second is to be 
determined in terms of 7 as follows. Noticing that as usual we may take L 
connected, we see that the same will hold for the complex N‘ which is then a 
relative z-circuit. As such it has a fundamental 7-cycle T';, whose vertices shall 
be denoted by 2. They are of course likewise the vertices of the complex N‘. 
Now let L”’ be the image of Tin L—L’. Since T is a e.s.v.t., L’’ is homeo- 
morphic to L and hence likewise HLC. Let then y,;, be a point of L such that 
d(x, Yin) = A(x, L) and let za = Tyan, wn = yn X 2 CL’. The simplexes 
of N‘ make up a quasi-complex & of which the set {w,,} is a partial realization 
R’. Clearly mesh R’ — 0 with e;. Hence when it is small enough, 8’ may be 
completed so as to form a realization &’’ of R on L’’ whose mesh is less than a 
certain assigned £, and in &”’ the chain I; will have a certain image C;. The 
projection of C; on Ni X N’‘ shall be taken as the component G?' defining the 
term T2; of the sequence {T.;} = T.. The rest is then as in [5], p. 358. It 
is a simple matter to verify that I, is a finite contraction. Therefore the pair 
T,, T: comes under our conditions of applicability of the fixed point formula. 
This proves our assertion. 


9. The process which enables us to weaken the LC of Topology (pp. 347 
and 359) into HLC is rather obvious: under the LC assumption we could con- 
sider R’ as a partial realization of the true-complex N‘, then complete it to a 
true singular image K’’ of N‘ on L’’. K”’ is then a true image of a c¢.s.v.t.: 
Ni — L’ and hence even Ni — L’. Since all that we needed for our purpose is 
an 7i-cycle image of T;, it was sufficient to obtain a partial realization &’’ of 
the chains, elements of &, and this the HLC condition enabled us to do. 


III. Critical sets 


10. Morse has justly criticized certain definitions and results in [3]. From 
his criticisms it appears that the modifications to be indicated presently must 
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be made. Except for Theorem XI which must be modified, our proofs are 
adequate throughout, as they were intuitively based on the proper definitions. 

As a preliminary remark it is to be understood that all retractions of [3], or 
those to be mentioned, are deformation-retractions in the sense of No. 3 of the 
present paper. 

The basic modification required is that throughout [3], with exceptions to 
be noted presently, chain-shrinking and deformation are to be strengthened by 
demanding that they be “local” in the sense of No. 6. The exceptions are [3], 
4, 6 (from Theorem IV on), 17, where the same must be replaced by chain- 
deformation retraction. 


11. Analytical and topological critical sets. The proof of the second part 
of Theorem XI of [3] ({3], No. 23) rests on the faulty Theorem II of [5] and in 
fact the theorem is not correct under our definition of t.c.p. This is clearly 
shown by the function f = 2%, the point x = 0 being an a.c.p. but not a t.c.p. 
More generally, an a.c.p. may lack any topological features. This is the type 
known by experts as inflectional. The example just given, however, points 
also to the necessary modification. ‘ 

Consider the plane curve y = x*. The reason why the a.c.p. at the origin P 
is not topological is that the sections of the curve by the lines y = constant 
are all homeomorphic. On the other hand, if we cut the curve by the parallel 
pencil y + mx = const., m ~ 0 and arbitrarily small, we obtain two ordinary 
critical points (contacts of ordinary tangents) — P when m— 0. This may be 
interpreted as follows: the a.c.p. P is the limit of t.c.p.’s of the “modified” 
function z* + mr as m-— 0. Hence Theorem XI is restored provided that we 
add to t.c.p.’s the points which are limits of t.c.p.’s as m — 0. This is what 
we propose to do. 

In the case of a function f(x) of n > 1 variables a linear increment may be 
insufficient, notably if the Hessian H(f) = 0. This corresponds, for n = 2, 
to surfaces f = k which are developable. We therefore modify f, by adding a 
quadratic polynomial, into 


(11.1) F= f + Uujri+ Ny ViT;, Ni; = Aji - 


We shall now call P a quasi-topological critical point (quasi t.c.p.) whenever, 
given any open set U > P on Q and any « > 0, the parameters u may he chosen 
<e in absolute value and such that F has t.c.p.’s in U. 

It is now easy to show that Theorem XI holds provided we replace t.c.p. 
by quasi t.c.p. (Theorem XI’). For this purpose we may assume Aj; = \4j;, 
and show that \ and the u’s may be chosen as required. 

Now the reasoning of [3], No. 23, shows that for \ fixed # 0, H(F) # 0. 
Taking | \ | < ¢«, H = 0 will represent about P an analytical locus of dimension 
<n, and there will be points Q on a given open set U D> P for which H = 0. 
Since f.; ~ 0 when Q — P, we may find a point Q for which 


| uy| = | — (fz; + dijr;) | < ¢é 
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Now Q will be an ordinary a.c.p. of F, and hence a t.e.p. of F, on U, with 
the variables u, \ restricted as required. Therefore P is a quasi t.c.p. and 
Theorem XI’ follows (compare Morse [10], p. 178). 


12. General remark regarding analytical critical points. From the geometric 
point of view the definition of a.c.p. appears to be too strictly dominated by the 
analytical features of the basic function f(x). Thus the usual formulation does 
not cover the case of the function 


z=V1l-v-y, +s. 


To treat this case, it is necessary to consider z as a point-function on an Me, 
the sphere. The usual formulation applies even less, of course, to the implicit 
two-valued function z(z, y) given by 


P+yY+2=1. 


Even more striking exceptions could be obtained by taking algebraic surfaces 
or varieties with singular loci. 

From this point of view it would seem more natural to designate as a.c.p. 
any “new” singular point appearing in the horizontal sections y = f(x) as com- 
pared with those below it. This is the analytical analogue of our topological 
treatment in [3]. This would automatically do away with Whitney's para- 
doxical are of critical points, not in a horizontal section [8]. The preceding 
analytical considerations and similar topological considerations were among our 
prime notions for attaching the theory of critical sets, not so much to the prop- 
erties of f as to those of the whole locust y = f. 
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THE ALMOST PERIODIC BEHAVIOR OF THE FUNCTION 1/¢(1 + it) 
By AuREL WINTNER 


It is known! that the prime-number theorem implies the convergence of the 
development 


(1) 1/¢(s) = > u(n) n-*, 


which is obvious in the half-plane o > 1, at every point of the line « = 1 also. 
The object of the present note is to show that the trigonometrical series 


(2) 1/¢(1 + it) = p> u(n)n-“rio = > u(n)n- exp (—it log n) 
I I 
is the Fourier series of the function which it represents, i.e., that 
(3) 1/¢(1 + it) ~ yw u(n)n- exp (—it log n), 
i 


where the sign ~ refers to the class B? of Besicovitch.2. In other words, the 
function 1/¢(1 + 7) is almost periodic (B*), and, on placing 


Mi f(O} = lim Mri fo}, 
T=+% 


where 
T 

(4) Mri fl} = [ f(odt/T, 

0 
the mean value 
(5) Mife™/E(1 + i)} 
exists for every real \ and is 0 or u(n)/n according as \ + log n or A = log n, 
where n = 1, 2,---. On choosing n = 1, it follows, in particular, that 
(6) MpA/EC. + w)} 


exists and is equal to u(1) = 1. 

Since (3) refers to the class (B?), it also follows that Mt{| ¢(1 + it) |-*} exists. 
The latter result, proved by Landau on pp. 801-804 of his Handbuch, suggests 
but does not imply (3); it does not even imply the existence of the Fourier 
constants (5), (6). 


Received December 28, 1935. 

1 Cf. p. 811 of the article by Bohr and Cramér in vol. 2, III, of the Encyklopédie der 
mathematischen Wissenschaften, where several references are given. 

2A. 8S. Besicovitch, Almost Periodic Functions, Cambridge, 1932, Chap. IT. 
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The result is independent of Riemann’s hypothesis. On Riemann’s hypoth- 
esis the almost-periodicity (B*) of (1) easily follows for ¢ > 3, at least. For 
on denoting by «, the abscissa of convergence of the series (1) and by o, the 
abscissa of its absolute convergence, Littlewood has shown’ that o = 4 on 
Riemann’s hypothesis, while, of course, ¢, = 1. Thus it is seen from the mean 
value theorem of Schnee that, on Riemann’s hypothesis, Jt{| 1/¢(¢ + it) |*} 
exists for every ¢ > } = 3(o, + a.). Hence, since Littlewood’s treatment of 
the Lindeléf hypothesis implies* 1/¢(¢ + it) = O (| t |*) uniformly for ¢ > 4 4+ «, 
the statement follows from a general result of Besicovitch (op. cit., p. 164). 

Let q > 0 be a fixed integer, x > q + 1 a variable which will tend to infinity, 
and put 


(7) ft) = . u(n)n-Orin 
and 

(8) S,(t; 2) = > p(n)n-A+, 
Thus 


S(t) — S,(t;z) = > p(n)n-+i0 | 


z+l1 


Hence, on placing M(x) = » u(n) and using the identity 
i 


n+l 
nti) _ (n + 1)-(+in = (1 + it) / pa (2+it) dr, 
it is seen by partial summation that 


oo n+1 
f(t) — S,(t; 7) = (1 YF it) is M(n) / r-2+i0 dp — M(x) [x + 1)-"+*, 
z+1 n 


Now® 
M(x) = O(2/log® x) ast — + &, 
so that 
2 n+l 2 
> M(n) r?dr = > O(n/log® n) O(n) = O(1/log* x) 
z+1 n r+l1 
and 


M(x) [x + 1} = O(2/log® x) O(a") = O(1/log* x). 


8 Cf. E. C. Titchmarsh, The Zeta-function of Riemann, Cambridge, 1930, p. 78. 
* Ibid., p. 77. 
5 More than this is known; ef. Bohr and Cramér, loc. cit. 
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Consequently, if ¢ > 1, 

(9) f(t) — S,(t; 2) = tO(1 og! r) as x > + &, 

where the O-term holds uniformly for 1 < t < + & and q is fixed. Also,® 
1/¢(1 — it) = O(log t) ast +4 =x, 


while 
Kian 
Ad) — 1/7 -— #) | = > i u(n)n-2-* | < g = const. 
i 
in view of (1) and (7), so that 


(10) F.(t)| < C logt 


for every t > e and for some C = C, > 0. On combining (9) and (10) with 
the formal identity 


lta | cat S, r= — \fe sl S, |? + 2M (Sy ‘naj S)Sal, 
it is seen that 
| f(t) |? — | S,(t; 2) |? = COC log® x) + tO(1/log* x) log tas x > 4+ &, 


where the O-terms hold uniformly for e < t < + «. Hence, on using the 
notation (4), 


( Mert fol) 2} —Mrt Salt; x) 73 
11) 
= O(1/ log’ r)T? + O(1/logt xr) T log Tas xr — + =, 


where the O-terms hold uniformly fore < T < + «. 
On the other hand, since? 


S,(t; x) |? — > oy | p(n)/n = ’ u(n)u(m)(nm)-m/n)* 
q+1 a+1 q+1 
in view of (8), it is clear from 
r , 
[ (m/n)* dt = —i{(m/n)'™ —1}/log (m/n), where m # n, 
0 
that the absolute value of the difference 
(12) Mr{| S(t; 2) 2} — | u(n)/n 
qt+l 


® Cf. Titchmarsh, op. cit., p. 17; also p. 24. 
7 The accent in the double summation means that m # n. 








446 AUREL WINTNER 


is not greater than 


i > 2 | u(m)u(m) (mn)-'{ (m/n)'* —1} /log (m/n) | 
atl q 
<TD) do’ 2| mn log (m/n) |“. 
atl qt+l1 


Since q is fixed, the last double sum is* O(log? x) asx — + x. Finally, on 


z oO 
replacing in (12) the sum by one commits an error which is but 
’ 


q+1 q+l1 
| +» (u(n)/n)?| < } 1/n? = O(1/z). 
zt+l1 z+l1 


On combining these estimates of (12) with (11), it follows that 


x 


Mert | folt) 7} — DO | w(n)/n |? 


qt 1 
= O(1/log’x)T*? + O(1/log*x)T log T + O(log?x)T-! + O(1/r) asx > +2, 
where the O-terms hold uniformly for e < T < + x. On choosing T = log’ z, 
the error terms become functions of x alone and tend to zero as x > + ~, Le., 
asT—+ x«. Thusif T7— + ~ and q is fixed, then Mr}| f,(t) \?} tends to 
the limit 


De | u(n)/n|?, 
q+l1 


so that M}\ f,(t) *} exists and is equal to the last series. Since this series has a 
nx 


positive value less than }> n~, it follows that 


q+1 
Mii fi. 2} ~Oasq—~4+ ~. 
This may be written, according to (1) and (7), in the form 
(| 4 |) 
My > u(n)n arin — 1/e(1 + it)| p> 0, gq 4+ &. 
i i 
This completes’ the proof of (3). 
Tue Jouns Hopkins UNIVERSITY. 
8 Cf. Titchmarsh, op. cit., p. 30, where the double sum is shown to be 


o> ni > ry, 


n=l r=1 


where o = 1, so that the estimate becomes 


o( > n-! Li r ' = (> n~' log *) = (> n~' log :) = O (log? z). 


n=1 r=l n=1 n=l 


* Cf. Besicovitch, op. cit., p. 100 et seq. 














MOMENTS OF INERTIA OF CONVEX REGIONS 
By Fritz Joun 


Let R denote a closed and bounded two-dimensional convex region. Let d 
be the greatest, A the smallest diameter of R, a diameter being defined as the 
distance of two parallel lines of support.' Let A be the area and L the circum- 
ference of R. It was recently proved by F. Behrend that there exist for any R 
affine transformations transforming R into convex regions for which any one of 
the following inequalities is satisfied: 


d . A d . 

a ——_ eu _ $14/2; 

A = V2: Pies ’ i =i v2: 
if, moreover, R has a center, i.e., if R is symmetrical with respect to some point, 
then there are also affine transformations transforming R into regions for which 
any of the following inequalities hold: 

2 2 
4 <2, w L 

The corresponding equalities are all satisfied in the case of a square. 

Now let \ denote the ratio of the major and minor axes of the ellipse of 
inertia of R corresponding to the center of mass of R in a homogeneous mass 
distribution, i.e., of the “central” ellipse of inertia of R. We shall prove in 
this paper that the inequalities 


d - A 
ae .. 
(1) a = V2 (2) an = 1 
hold; if R has a center, then also 
&? 
ae 
(3) A =? 


These inequalities include some of Behrend’s results; for every R can be easily 
transformed by an affine transformation into a region for which the central 
ellipse of inertia is a circle, i.e., for which \ = 1, and in this case d/A < ¥/2, 
A/A* < 1, and if R has a center d?/A < 2 also. 


In a second paper I intend to show (1) that if R has a center, 
d > V2 + Vv 100, 
AX 3 : 
Received December 3, 1935. 
1 For notations see Theorie der konreren Kérper by Bonnesen and Fenchel; we shall 


refer to this book as B.-F. 
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(2) that if B is the area of Legendre’s ellipse of inertia of R, then? 


2r 


< —, 
Bs 3734. 


The same methods applied to convex regions in space no longer yield the 
best possible constants. For example, we obtain for a convex three-dimensional 
region with a center, for which the central ellipsoid of inertia is a sphere, the 
inequalities 


d 10 V T 
4 Ss \/ :? P = >? 
4) A=V3 a= 3 


where V is the volume of the region. 

We shall now prove these statements. Let R be a closed and bounded convex 
region of area A,. If g is a straight line, we call the distance Dp(g) of the 
lines of support of R which are orthogonal to g the diameter of R in the direction 
of g® Thus d = Maximum Dz; and A = Minimum D,. Let Jg(g) denote the 
moment of inertia of R with respect to g. If then g and A are any pair of 
orthogonal straight lines through the center of mass Cz of R, the inequalities 


! 1 


(5) 5 AeDih) S Lng) S 75 Audi), 
(6) An ¢1,(h) 

12 D2(h) ~ 
hold. 


In order to prove (6) we do not need the assumption that R& is convex. We 
consider the rectangle Q of the same area Ag which is bounded by the two lines 
of support of R parallel to g and by two equidistant parallels to h. Its sides 
are De(h) and Ag’ D,(h). Obviously the points contained in Q but not in R 
have a smaller distance from A than the points contained in R but not in Q. 
Thus 


3 
In(h) = Ig(h) = Ss. , 
12 Dj th) 
and (6) is proved. 

We now apply Sleiner’s symmetrization with respect to h; i.e., we carry every 
secant of R which is normal to h along its straight line, until the center of the 
secant lies on h. Let 2,2 = R, denote the region generated from R by sym- 


2 This inequality, together with the more elementary one B 2 A, was proved in a 
different way by Blaschke, Uber affine Geometrie, X1, XIV, XIX, in Ber. Verh. d. siichs, 
Akad. d. Wiss., vols. 69 and 70. Cf. also Vorlesungen viber Di fferentialgeometrie, vol. II, p. 64. 

3 In B.-F. called “Breite’’ of R. 

‘ Blaschke, Areis und Kugel, p. 45, or B.-F., p. 69 et seq. 
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metrization with respect to h. R, is again convex and is symmetrical with 
respect to h. Moreover Arg, = Ar, Cr, = Cr (as Ce already lies on h) and 


(7) Dr (h) = Dr, Tx,(g) = I,(g), Teh) S Ipglh). 


h is a double normal of R,,° i.e., the lines perpendicular to h in the points of 
intersection of h with the boundary R, of R, are lines of support of R,. 

Now let R2 = >,R;. Then R2 will be symmetrical with respect to g and h. 
Moreover, 


(8) Trg) S Ing), Dah) = Dr, (h), 


since h is a double normal of Ri. We consider the rhomb S of area Ax, = Ar 
which has as two opposite vertices the points of intersection of h with the 
boundary of R,. Because of the convexity and symmetry of R, the distance 
of any point of R2 not belonging to S from g is greater than that of any point 
of S not belonging to R.. Thus 


1 ° 
Trg) 2 I(g) = 54 Ax, Dy, (h). 


From this relation, together with (7) and (8), it follows that 


1 
Ty(g) 2 54 An Diath). 
We now return to R;. Let g* be the perpendicular bisector of the secant 
h of R,. As g passes through the center of mass of R; and is parallel to g* 
we have 


(9) Tr,(g) = Te,(g*). 


We reflect that part of R; which lies on one side of A with respect to g*, leaving 
the rest of R; unchanged. As R, is symmetrical with respect to h, the result 
will be a convex region R; symmetrical with respect to the point of intersection 
P of g* and h. Moreover, 

(10) Dr(h) = Deh), Ar, = An, In,(g*) = Ie,(g*). 


Finally let Ry = >,Rs. Ry is symmetrical with respect to Ah and P and 
therefore with respect to g*. Let Q be the rectangle of area Ax, = Ag, which 
is bounded by the lines of support of R, parallel to g* and by two equidistant 
parallels to h. Then, since Ry is convex and symmetrical with respect to g* 
and h, 


(11) Ix,(g*) = In,(g*) $ Io(g*) = i Ap, D?,(h). 


5 See B.-F., p. 52. 
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From (7), (9), (10), (11) we may then conclude 





1 
I,(g) s 2 A, Di th). 
Thus (5) is proved. 
From (5) it follows that 
. = aeatnee De <2 Maximum /, = 2y2, 
& Minimum D, Minimum /, 


according to the definition of the ellipse of inertia. 
Moreover, we get from (5) and (6) 








AR 1 " 
—_ < ) \s — A De 
12 Di = I,(h), I,(9) s 3 1, Di(h), 
or 
vp ath). Aa 
~ Ip(g) Diath) 
or 


A, Sd Dih) 
for every h. In particular, 
A, & dA?. 

Now let & have a center (i.e., R is symmetrical with respect to C,). Let 
D,(h) be a greatest diameter: D,(h) = d. Then h will be a double normal 
of R& Let Re = 2,R. Re will be symmetrical with respect to g and C, and 
therefore with respect to h. Moreover, Ar, = Ag, Ie(h) = Ig(h), Defh) = 
D,(h) = d. We consider the same rhomb S as we did after (8); from the 
convexity and symmetry of R, it follows immediately that 

1 Aj; 1 A} 
In(h) s Ih) = = —*- =5=—. 
R, ) = ‘ 6 D3 (h) 6 dP 
As according to (5) 


1 1 


54 A,@ = om A, Djth) < I,(Q), 
it follows that 
Apt < Tn(9) : An < yr Ae 
24 ~ 1[,(h) 6@~ 6 


Thus d? < 2\Ay,, if R has a center. 
In the case of a three-dimensional convex solid B with a center C we may 


5 See B.-F., p. 52. 
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proceed in the same manner. Let H denote a plane through C; let g be the 
normal of H in C. I,(H) and I,(g) may denote the moments of inertia of B 
with respect to H and g respectively, and V the volume of B. Let D,(g) be 
the distance of the planes of support parallel to H. 

Then J,(H), D,(g) and V are unaltered, and J,(g) is diminished, if we apply 
to B symmetrization with respect to some plane containing g. By a sequence 
of such symmetrizations one can transform B into a solid of revolution B, 
with axis g. This is the construction of Schwarz,’ which consists in replacing 
every plane section of B parallel to H by a circle of the same area and with 
center ong. B, is again a convex solid (Theorem of Brunn) of the same volume 
V and is symmetrical with respect to H. Besides 


In(H) =1,(H), — [n,(g) S Ie(g), — Da,(g) = Dalg). 


If the Q is the cylinder of revolution with axis g and volume V which is bounded 
by the two planes of support of B, parallel to H, then 


Vv? 


In(9) 2 109) = sy 


i.€., 


r2 


eee. ae 
(12) I,(g) = 2=Dalg)’ 


Moreover, as B, is a convex solid of revolution and symmetrical with respect 
to H, we have 


In(H) < I(H) = 4 V Dio); 
thus 
(13) 1,(H) 5 45 V Dig). 
Finally, we compare B, with the double cone S of volume V which is sym- 


metrical with respect to H and the vertices of which are the points of inter- 
section of g with the planes of support orthogonal to g. Then 


In(H) = IH) = 7. V D3), 
i.€., 


(14) In(H) = ip V Dig). 


In the particular case where the central ellipsoid of inertia is a sphere, we have 


T,(g) = 21,(H) = const. 


7 Blaschke, Kreis und Kugel, p. 86, or B.-F., pp. 71-72. 
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Thus it follows from (13) and (14) that 


d _ Maximum Dy — , /l 
A Minimum Dz ~ a” 


and from (12) and (13) that 


V< ; Di(q) 


for every g, 1.€., 


UNIVERSITY OF KENTUCKY. 














BLOCH FUNCTIONS 


By RapHaret M. Rosinson 


In this paper we prove the following theorem. 

If f(z) = x +.--- is regular in |x| < 1, and maps |x| < 1 on a (many- 
sheeted) region such that the upper bound of the radii of circles contained in a 
single sheet of the region is as small as possible, then the unit circle is a natural 
boundary for f(x). 

In proving this, we introduce a method which can probably be used to obtain 
much more extended results about the functions which map | x | < 1 on regions 
not containing circles any larger than necessary. 

This paper is divided into three sections. §1 contains some preliminary 
material concerning Bloch’s Theorem. §2 contains some lemmas about special 
mapping functions. §3 contains the above theorem and another similar 
theorem, and some remarks concerning further results about the functions 
mentioned above. 


1. Let R be a region in the complex plane, and let f(z) be regular in R. Then 

(1) f(z) is said to be univalent (= schlicht) in R, if f(71:) ¥ f(z2) for 2, # 22, 
x, and ze in R. 

(2) If S is a point set in the complex plane, f(x) is said to assume S in R, 
if for every y in S there is an z in R, such that f(x) = y. 

(3) If S is a point set in the complex plane, f(z) is said to assume S uni- 
valently in R, if f(x) is univalent in a subregion R, of R, and assumes S in R,. 

Bloch’s theorem may be stated in the following form. 

If f(x) = x + --- ts regular in |x| < 1, there is a complex number yo such 
that f(x) assumes | y — yo| < P univalently, where P > 0 is an absolute constant. 

Here yo depends on the function f(x), but P does not. 

There are three constants connected with this theorem defined as follows. 
% is the upper bound of constants P which satisfy the theorem. is the upper 
bound of P if we strike out the word “univalently”. is the upper bound of P 
if f(x) is assumed to be univalent; here it is immaterial whether the word 
“univalently” is present or not. 

The relations 8 S % S Aare obvious. Landau! has given numerical bounds 
for the values of the three constants; in particular, he has shown that ¥ < YW. 
An improved upper bound for % was given by me.? 


Received February 3, 1936. 
1. Landau, Uber die Blochsche Konstante --- , Math. Zeitschrift, vol. 30 (1929), pp. 
608-634. 
? Robinson, The Bloch Constant U --- , Bull. Amer. Math. Soc., vol. 41 (1935), pp. 535-540. 
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Let © = &, ¥, or A, with the following understanding. When € = &, 
“assume properly’’ shall mean “assume univalently”; otherwise, it shall mean 
simply “assume”. By an admissible function we shall mean a function 
f(z) = x + --- which is regular in |x! < 1, and, if € = A, is univalent 
in|x| <1. 

Then the definitions of 8, %, and % go into the following statement. C€ is 
the upper bound of constants P such that every admissible function f(2) assumes 
the interior of a circle of radius P properly. 

If ¢ > 0, not every admissible function f(x) properly assumes the interior of a 
circle of radius © + ¢«. We wish to show that there are admissible functions 
which do not properly assume the interior of any circle of radius greater 
than GC. 

For every positive integer n, choose an admissible function g,(z) which does 
not properly assume the interior of any circle of radius € + 1/n, and which 
satisfies the inequality® 


lg.(z)| S — : for |x| <1. 


1 —|2? 





Then the sequence g,(x) is uniformly bounded in the circle |x| S r, if r < 1. 
Hence a subsequence can be chosen which converges uniformly in any circle 
|2| S r,r < 1, and hence converges to an analytic function f(z) in| xr, < 1. 

We thus obtain a sequence f,,(x) such that f,,(x) is admissible, does not properly 
assume the interior of a circle of radius € + 1/n, and such that f,(x) converges 
uniformly to f(z) in |x| S r, for every r < 1. 

We wish to show that f(x) is an admissible function which does not properly 
assume the interior of any circle of radius greater than ©. In the first place, 
f(x) is admissible. For f(x) is regular in |z| < 1, of the form x + --- , and 
if the f,(x) are univalent, so also is f(z). It remains to be shown that f(r) 
does not properly assume the interior of any circle of radius >€. We shall 
distinguish two cases, according as “‘assume properly’? means (1) “assume” or 
(2) “assume univalently”. 

(1) Suppose that, in |x| < 1, f(x) assumes the circle | y — yo| < © + 2e, 
where y is a complex number, ande > 0. Toany point y,;in | y — yw | S$ C+. 
there is an 7; > 0, and a function /,(y) regular in | y — y: | S 2r, except possibly 
for a branch point of finite order at y:, and such that in |y — y:| S 2n, 
| hilfy) | < 1 and f(hi(y)) = y (ie., hi(y) is a branch of the function inverse to 
f(x)). We can find a finite number of points y:, --- , ym, and corresponding 
Ti, +++ Tm, Such that the circles | y — yx | <r, (1 S k S m) cover | y — yo| S 
© + ¢. The values assumed by the function h,(y) (corresponding to the point yx) 
in| y — yx | < 27, forma region R, in the interior of the unit circle. f(x) maps Ry 
on the circle | y — y | < 2r,, possibly counted multiply. Since f,(7) converges 
to f(x) uniformly in Ry, f,(2) maps Ry on a region containing | y — ye| <r, 
forn > N,. Hence forn > N (= max N,), f,(x) assumes | y — yo | < © + €. 


§ Landau, loe. cit., p. 618. 

















erges 
< Tk, 
+e. 
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(2) Suppose that, in | z| < 1, f(x) assumes the circle | y — yo| < € + 4e 
univalently, where yo is a complex number, and « > 0. There is a region R, 
in |x| < 1,such that f(z) is univalent in Ry and maps R, on | y — yo | < € + 4e. 
There are regions R; and R; within Ry, which f(z) maps on | y — yo | < € + 2 
and |y — yo| < © + 3¢, respectively. In R;, f(x) is univalent, and f,(2x) 
converges uniformly to f(x); hence for n > Nj, f,(x) is univalent in R,.. For 
n > N (2 %M,), f,(x) assumes | y — yo| < © + ein Ro. 

Hence (in either case), if f(x) properly assumes the interior of a circle of 
radius © + 4e, f,(x) properly assumes a circle of radius € + ¢, for n > N; 
but since f,(z) does not properly assume a circle of radius € + 1/n, this is 
impossible. Therefore f(x) does not assume a circle of radius >€ properly. 

Any admissible function which does not assume the interior of a circle of 
radius greater than © properly will be called a Bloch function. It will be 
said to be of the first, second, or third kind, according as € = &, &, or . 

We have shown that there are Bloch functions of each kind. If f(x) is a 
Bloch function, so also is f(axr)/a, where | a | = 1; it is not known whether Bloch 
functions are unique except for this trivial transformation. Bloch functions 
of the first and second kind are not univalent, since 8 and ¥& are less than %; 


it is not known whether Bloch functions of the first and second kinds are 
different from each other. 





2. By the inner radius of a region R with respect to a point a in R is meant 
the number p > 0 such that |x| < pcan be mapped on R by a function of the 
forma+2-+---. An equivalent statement is that |x| < 1 can be mapped 
on R by a function of the form a + pr + .-.--. 

Lemma l. LetO0<@6< 7,15 a2. Let R(a, 8) be the region obtained from 
|y| < 1 by modifying the boundary in the following way. Replace the arc of the 
unit circle joining e*, and passing through 1, by another circular are joining 
e+, and such that the angle formed at e**, measured within R, is ar. Then the 
inner radius of R(a, 0) with respect to the origin is 


a sin (6/a) 


sin 6 


Proof. Ii Sw > 0, we may write w = | w| e*, where 0 < ¢ < 7; by w* we 
shall mean | w *e’**,. The transformation 


y—c* z — e\a 

e*®y — 1 e*z — 1 

takes |z| < 1 into R(a, @). The point y = 0 corresponds to a point z deter- 
mined by e®/« = (z — e)/(e®z — 1). This gives z = — a, where 





e* — e#le _ sin [(@ — 6/a) 2) 
ea — 1 sin [(0 + @/a)/2)’ 
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so thata 2 0. If we put 





_u-a 
1 — aw’ 
which takes < linto|z| < 1, then u = 0 corresponds to z = —a, and 


|u| 
hence to y = 0. 


Now 
du u=0 . dz J z= —a (e*®a + 1)? 


Multiplying these together, substituting the value of a, and simplifying, we have 


(‘”) _ «sin (8/a) 
du] .—0 sing — 


Since the transformation from u to y takes | u| < 1] into R(a, 6), in such a way 
that u = 0 goes into y = 0, this is the required inner radius. 

Lemma 2. The inner radius, with respect to the origin, of the region obtained 
from |u| < 1 by removing the points r S u < 1, where 0 <r J 1, is 


vee. ae 
(1 + r)* 


Proof. Since v = x/(1 + x)* maps |x| < 1 on the v-plane excluding the 
half-line v = 1/4, the required mapping function is determined from 


P 


u zx 


Guy "G+ay 
with the condition | u| < 1, and this transformation is of the form 
u=pr+:-::-. 


Lemma 3. The inner radius, with respect to the origin, of the region obtained 
from R(a, 0) by excluding the points for which y = 1 — b@ is greater than 1, 
ffl <a < 2, 


b < cot (@ are cot [2(a? — 1)/3]"*) = 8B, 


and 6 is sufficiently small. The number 8 cannot be replaced by any larger number. 
Proof. The circle |x| <1 can be mapped on the region obtained from 
|u| < 1 by excluding the points r < u < 1, by a transformation of the form 


u= pr +---. |u| <1 can be mapped on R(a, @) by a transformation of 
the form y = ([a sin (@/a)]/sin @)u + ---. Combining these transformations, 
we map |x| < 1 on a region obtained from R(a, 6) by excluding certain points 
on the real axis. We can choose p so that y = x 4+ --- (i.e., so that the inner 


radius of the last region with respect to the origin is 1). Suppose that in this 
case the end of the cut is at the point 1 — «6, where « depends on @ and a. 
The proof will be complete if we can show that «x — 8 as @— 0. This in turn 
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will follow if the position of the end of the cut is given by a power series in @ 
of the form 1 — B@+..-.-. 
Choosing the value of p stated above, we have 


sin 6 1 1 
>= ——_—_. = ] - 1——)@+.--. 
f a sin (6/a) 6 ( 5) : 
the series containing only even powers of 6. If 7 has the meaning of Lemma 2, 
then it can be calculated from p, and the result is 





r= 1 — 2c6 + 2c?@2? 4 ..., 
where 
ce = [(1 — 1/a*)/6}"/. 


Let a have the meaning of Lemma 1. Then a simple calculation shows that 





y a—l - 
6 ane Ee a coe |, 
a+l1 ( + 6a + ) 
rhe point x» = 1 goes into uw = r. The transformation used in Lemma 1 takes 
this into the point z = (r — a)/(1 — ar). Putting in the values of r and a, 
and simplifying, we find that 
a = 1 — 2acé on 2a°c"6* oa ere, 
” With this value of z , we find that 
», 1) ; 
Zo — ¢ 2ac + 7% , 
ceiedetittninaint SB a (J ARP oo 
e*z,— 1 2ac—1 a+ + , 
where A depends on @ in a manner which we do not need to determine. Hence, 
for the corresponding point yo, 
yo — e”* , 
— ae SS ACI oa aA on *¢*e ), 
ved e*®yy — 1 
1, where 
_ (2ac + 1\* 
2ac — 1 
ber. Solving this for yo , we find 
rom. 
orm A+ 1. 
a ee wr 
n of Yo —. + 
ome Now (2ac + 7)/(2ac — 7) is a quantity whose absolute value is 1, and whose 
ints argument is 2 are cot 2ac; hence 
nner . 
this om, 
id a. where 


turn y = eare cot 2ac. 
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Therefore 


A$1._ er peri, 
J i= ——_——- ¢ = cot +. 
rd ea. 1 e'? Sidi ey Y 


But 
cot y = cot (a arc cot Zac) = cot (@ are cot [2(a? — 1)/3]'/*) = 8B, 
so that 
yo = 1— pO+-:--. 


Thus the position of the end of the cut is expanded in a series of the required form. 
Lemma 4. The inner radius with respect to the origin of the region R,(@) ob- 
tained from |y| <1 by adding the interior of the circle through e® and e~* 
orthogonal to the unit circle, and then taking away the points for which y = 1 — 6/3, 
is greater than 1, if 0 < @ < 0, where % is an absolute constant. 
Proof. This is a special case of Lemma 3, since 


1/3 < cot [(3/2) are cot (5/6)"?] = 0.336... . 


3. The theorem stated at the beginning of the paper may now be easily 
proved. 

THEOREM 1. Every Bloch function of the first kind has the unit circle as a 
natural boundary. 

Proof. Suppose that the unit circle is not a natural boundary for every 
Bloch function of the first kind. Then we can find a Bloch function f(x) of the 
first kind which is regular at 1. There is a positive 6 < % such that f(x) is 
regular in the region R,(@) of Lemma 4 for every 6 < 6, and maps R,(@) on a 
region which is obtained from the map of |x| < 1 by replacing an analytic 
boundary are J, which is approximately a straight line, by a new analytic 
are K, which is approximately a semicircle on J as a diameter, and which lies 
on the outside of the map of |2| < 1, and then making a cut L along an 
analytic are which is approximately a diameter of K, orthogonal to J, the length 
of the cut L being approximately two thirds of the diameter of K. When we 
say that this part of the boundary is approximately a certain shape, we mean 
that it can be made as nearly that as we please, by taking @ sufficiently small. 
Since F,(@) has an inner radius greater than 1 with respect to the origin, its 
map must contain circles of radius greater than 8. But it is clear that, if @ 
is sufficiently small, replacing the boundary are J of the map of |x| < 1 by 
K and L could not increase the size of the circles in the map. 

THEOREM 2. Let f(x) be a Bloch function of the third kind, and map |x| < 1 
ona region R. If an analytic arc D is part of the boundary of R, there are points 
of R on each side of D in the neighborhood of any point of D. 

Proof. Dis the map of an are £ of the unit circle along which f(x) is regular 
and f(r) # 0. Without loss of generality, let 1 be a point of E, and f(1) the 
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point of D under consideration. Then there is a positive 6 < 6), such that 
f(x) is regular in the region F,(@) of Lemma 4 for every 6 < 6. If R lies 
only on one side of D in the neighborhood of f(1), then R,(@) is mapped by 
f(z) on a region which does not overlap itself, if @ is sufficiently small. The 
further argument is then as in Theorem 1. 


Suppose that fi(x) is a Bloch function of the first kind, and maps |x| < 1 
on a many-sheeted region R,. We should like to show that 2; has no boundary. 
(Branch points of any order in R,; would serve to limit the size of the circles 
assumed univalently by f,(x), provided that these branch points were suitably 
distributed.) R, cannot have an analytic boundary arc, as Theorem 1 shows. 
On the other hand, 2, cannot have a boundary are which is not sufficiently smooth 
to allow a circle of radius $ to roll along it, on the inside of R,;. For in this 
case we could modify R; by simply adding something to it, in such a way as to 
increase its inner radius with respect to the origin, but not the size of the 
circles in it. The remaining case is intermediate between these. In this case, 
one might try to use a method similar to that used in the case of an analytic 
arc; but in this case the modification of the boundary would have to be made 
directly in the map, instead of first in the unit circle. The difficulty comes in 
calculating the effect of such a modification of the boundary of R, on the inner 
radius of R, with respect to the origin. Were this difficulty overcome, and the 
results as anticipated, the same method could be used to show that a Bloch 
function fo(x) of the second kind maps | z | < 1 on a boundaryless many-sheeted 
region #, (branch points of infinite order in R: serving to limit the size of the 
circles assumed by j2(x)), and that a Bloch function f;(x) of the third kind maps 
|z| < 1 on a plane region R; of which every point in the plane is an interior 
or boundary point. 


Brown UNIVERSITY. 











AN EXTENSION OF THE TABLE OF BERNOULLI NUMBERS 
By D. H. LexHmMer 


As part of an investigation on Fermat’s Last Theorem being conducted by 
Professor H. S. Vandiver under the auspices of the American Philosophical 
Society, it was thought advisable to extend the existing tables of Bernoulli 
numbers to be used for direct divisibility tests in seeking to establish the regular- 
ity' of primes. Congruence properties of Bernoulli numbers are important in 
other branches of the theory of numbers such as class-number problems, and 
it is hoped that the table given below will prove useful in these and other 
connections. 

Previous tables of the numbers of Bernoulli have been given by Euler, 
Ohm,* Adams* and Serebrennikoff.6 The table of Adams gives the first 62 
(non-zero) numbers of Bernoulli, while Serebrennikoff’s calculations extend 
to the first 92 numbers, the first 90 of which have been reprinted by J. Peters® 
and H. T. Davis.?. The present table gives 20 additional numbers, thus making 
available the first 110 Bernoulli numbers. 

The method used by Adams and Serebrennikoff (except for his last number) 
was based on the fundamental though inefficient umbral recurrence relation 


(1) (B + 1)* = B* (n > 1), 


the fractional terms being elimimated by means of the von Staudt-Clausen 
theorem.‘ A recurrence with fewer terms than (1) would have been preferable. 
Many such recurrences involving either B, or numbers closely allied to B,, 
are available.6 As the recurrences become shorter, however, the coefficients 
ultimately increase in complexity. The best compromise® seems to be the 
following lacunary recurrence for the so-called numbers of Genocchi," which 
occur in the expansion of tan(z/2) and are connected with the Bernoulli 
numbers by 


(2) G, = 2(1 — 2")B,. 


Received January 29, 1936. 

' A prime p is said to be regular if it does not divide the numerator of any of the first 
(p — 3)/2 non-zero Bernoulli numbers. 

2 Euler, Acta Petropolitane, vol. 5 (1781), Part 2, p. 46. 

3 Ohm, Jour. fiir Math., vol. 20 (1840), p. 11. (Table computed by Rothe.) 

* Adams, Jour. fiir Math., vol. 85 (1878), pp. 269-272; Cambridge Observations, vol. 22, 
appendix; British Association Reports, Sectional Transactions, 1877, pp. 10-11; Collected 
Papers, vol. 1, 1896, pp. 425-451. 

5 Serebrennikoff, Academia Nauk, (8), vol. 16 (1905), No. 10; ibid., vol. 19 (1906), No. 4. 

6 J. Peters, Zehnstellige Logarithmentafel, vol. 1, Berlin, 1922, table 8, p. 83. 

7H. T. Davis, Tables of Higher Mathematical Functions, vol. 2, Bloomington, 1935, 
pp. 230-233. 

8 See N. Nielsen, Traité Elémentaire des Nombres de Bernoulli, Paris, 1923, for references. 

® Other methods for computing large Bernoulli numbers such as that used by Serebrenni- 
koff in computing Ng and the central difference method of Joffe [Quarterly Jour. of Math., 
vol. 47 (1916), pp. 103-126, and vol. 48 (1917-20), pp. 193-271] were also considered. 

 Genocchi, Jour. fiir Math., vol. 99 (1886), pp. 315-316. 
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The recurrence" in question is 


{n/3] 2n (f Qn,ifn = 3K —1 
‘ Jae . G * = J ’ a ’ 
(3) 4G2, + 3 >> 4 2n—60 oe \—4n, otherwise. 


As it stands, this is not in the form best suited for calculation. If we define 


(4) — e 2n G 
g\n, ~ N6r | 2n—6a |» 


(3) becomes 


" ae. FF ‘ 50 n(—1)" 
(5) 100 | Ge, | = 75 >> (—1)'g(n, d) + \ 100 n(— 1)", 


For \ > 0, the g(n, d) are readily computed from previous g’s by 


g(n, A) = g(n — 3, — 1) f(n, A), 
where 

fia, 2) = Be — D- - & 

GA(6X — 1) --- (A — 5) 

Since g(n, A) is an integer, the denominator of f(n, 4), when it is in its lowest 
terms, must divide g(n — 3, \ — 1). This serves as a useful check in com- 
puting g(m, A). The disadvantages in handling the somewhat larger numbers 
G, are more than offset by two advantages: (A) the G’s are integers, so that 
no additional work in eliminating fractions is necessary, and (B) an excellent 
check is afforded by writing 


(6) | Ban | _ N,/D,, 
and thus obtaining from (2) 
(7) N, = Gen, ‘d, , 


where d,, is the integer 2(4" — 1)/D,, and thus obtaining N, as the quotient 
of an exact large-number division. (For n = 110, Ge» has 313 digits, while dj.» 
has 63 digits.) This check assures the correctness not only of the present table 
but also of the tables of Adams and Serebrennikoff inasmuch as their values 
were used in computing the table” of G,,. 

The arrangement of the numbers in the table of Bernoulli numerators is a 
departure from the traditional method of writing large numbers in a few long 
lines. It will be found that with standard computing machines large numbers 
may be dealt with more effectively if grouped in “periods” of 9 figures and 
written in columns. 


1D. H. Lehmer, Annals of Math., vol. 36 (1935), pp. 637-649. The No; computed in 
this paper by a recurrence with gaps of 12 agrees with the value found from (3). Two 
errors in this paper should be noted. These occur in the recurrences for EZ, with gaps of 
6 and 12, where 2 and 2"A~! should read 2°~? and 2", respectively. 

2 A complete table of G, (n = 1 to 220), together with material by means of which the 
present table may be readiéy extended, is deposited in the library of the American Mathe- 
matical Society. 
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The corresponding denominators are as follows. 


n 


91 
92 
93 
94 
95 
96 
97 
98 
99 
100 


LeHIGH UNIVERSITY. 


D. 


D, 
6 
1410 
42 
6 
12606 
868841610 
6 
171390 
244713882 
1366530 
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n 


101 
102 
103 
104 
105 
106 
107 
108 
109 
110 





D, 
6 
281190 
6 
27030 
9225988926 
3210 
6 
15270994830 
6 
7590 











BOOLEAN FUNCTIONS AND POINTS 
By J. C. C. McKinsey 


One of the important topics of algebraic geometry is that of the relations 
between points and rational curves. How many arbitrarily selected points, 
for example, can lie on a curve of degree n, and how many points are necessary 
to determine a curve of degree n? In this paper, I treat analogous problems in 
Boolean algebra. I take, for the analogue of the rational function in ordinary 
algebra, the Boolean function in Boolean algebra; that is to say, a function of 
Boolean variables which can be expressed by a finite number of applications 
of the Boolean operations +, X, and ’. For the analogue of the points of 
ordinary algebra, I define Boolean “points’’ as follows: 

DeriniTi0on 1. By an n-space Boolean point is meant an ordered set 


(21,1, piel: » Ba.03 Se) 


of Boolean elements. The ordered set (21,1, --- ,2n,1) is called the abscissa 
of the point. By saying that the point satisfies the function f(x, --- , 2), 
I mean that f(71,1, --- ,2n,1) = 2; I also say that the function passes through 


the point, or that the point lies on the function, and use other terminology as 
in algebraic geometry. 
My results are now developed in a series of theorems; a summary will be 
found at the end of the paper.' 
LemMa TO THEOREM I. [f in the complete expansion of 1 in terms of n variables 
, , te 
(1) Tiss In Mees Teak, Hee $Me ay 
. 2 , . . 
we substitute uv; for x; and u;v; for x;, the resulting expression, namely, 
=, ee 0 .¢ 
Uy +++ UnVy f UW +++ UnWnal,Y, + +++ + ULV, +++ UAY,, 
is equal to 
(ujAv;)(ueAve) --- (U,Av,). 
Proof. The expression (1) is equal to 


(2) (x, + 21)(2 + 1g) --+ (2a + 2,). 


Received December 16, 1935. 

! Besides the ordinary operations of Boolean algebra, I make use of the operations 
e and A defined by a ° b = ab’ + a’b, and aAb = ab + a’b’. These operations are associative 
and commutative and mutually dual. A systematic discussion of them will be found in 
B. A. Bernstein’s paper Postulates for Boolean algebra involving the operation of complete 
disjunction to appear in the Annals of Mathematics. I take this opportunity to acknowl- 
edge the valuable suggestions made by Professor Bernstein in connection with the present 
paper. 
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It is clearly immaterial whether the proposed substitution be made in (1) or 
in (2). Making it in (2), we have 


(ur + U0, ) (Ur, + UyV) +++ (adn + UAD,), 
or 
(u,Av,)(ueAve) -- + (u,Av,,). 
THeoreM I. Jf r points {(21,:, --- ,2%ni32%)},t = 1, ---, 7, be given, there 


exists a function passing through the r points if and only if 
a > (zjezj)(a1,: Aa), ;)(%2, Are,;) - ++ (@n Arn, ;) = 0. 
j=1i=1 


The r points determine a unique function if and only if in addition 


I] Gust +s tad +--+ UT Git -s +22.) = 0. 
t=1 i=1 

Proof. There exists a function passing through the r points if and only if 
there exist elements A;, --- ,A, (where, for typographical reasons, I write 
m = 2”) such that 


(1) ee Se ee re a  F (@ 


II 
— 
~ 
~ 
~" 


Conditions (1) are equivalent to 


Ax(n.: re Fn, 86) + A,(z,, wien In. 2;) +--+ + AL(2; ; 2» en Ln, 2) 


+ A.(2;.; +++ 2, 2;) = 0 (¢ = 1, ..-, 94). 


Conditions (2) are in turn equivalent to the single condition 


r 


A’ > (1,3 22+ In iti) + At . (x4, ; a t,. 2;) + -:> 

. i=1 i= 

(3) 
“ A,. # (x; tae 2, 2) + Ay » (23.3 tae x, 2.) = 0. 
i=1 t=1 

Thus a necessary and sufficient condition that there exist a function passing 
through the r points is that there exist a solution of (3) considered as an equa- 
tion in A;,---,Am. But there exists such a solution if and only if the follow- 
ing conditions obtain: 


ys (ai.5 +++ Ln.iti) - - (21,5 «++ te.2,) = 0, 
i=1 i=1 


(4) 


> (x; , ool rz) . > (2; .; er ae = 0). 
i 1 
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Conditions (4) are equivalent to the conditions 


y > (2; + z;) (x1, 1, j “ss Ln, Fn, il = 0, 


(5) i adh oe 
>» , (2,°2) (21,21, °°" x, a, J=0. 


Conditions (5) are equivalent to the single condition 
©) x x (25+ 2)) [rita s+ Pn ing Hove + 21.1.3 cee 0a = 0. 
By the lemma, condition (6) is equivalent to 
(7) > > (:+84) (x,,cArn.4) «++ (te.chte,,) = 0. 
im i 


Condition (7) is the condition specified in the first part of the theorem. 
For the second part of the theorem, we see? from (3) that we must have 


(x (m1,i ++ rc) s(¥ (m1,i ++ ra.) = 0, 


(8) fa ce Pa ate 
(> leis a red) a( (21.4 al r1.20)) = 0. 


To say that (8) holds is equivalent to saying that (4), and hence (7), holds, 
while in addition, 


(x (m5 ++ ri) . (x (ai ++: r..2))) = 0, 


i=l i=1 
(9) | wh a at 
(> (2). vies v1.) (> (ri; wee r.,21)) = 0). 
i=1 i=1 


Taking the negatives of both sides of (9), we have 


, 


> (a4, ; ee In, Zs) + z. (x1, i iy In, i2;) = 1, 


i=1 i=1 


(10) Ay ee ony 
> (2:4 ose 2a 5) + + (x3; 52, 2 =. 


i=1 ‘=1 


? A condition that a Boolean equation have a unique solution is given by A. N. White- 
head, Memoir on the algebra of symbolic logic, American Journal of Mathematics, vol. 23 
(1901), pp. 140-150. A simpler derivation for this condition is given by B. A. Bernstein, 
Note on the condition that a Boolean equation have a unique solution, American Journal of 
Mathematics, vol. 54 (1932), pp. 417-418. 











468 J. C. C. MCKINSEY 
This is again equivalent to 
r r ’ P 
(11) > tc -++ tac @ 1,---, > 215 °°° Zac = 1. 
t=1 


1=1 


Taking the negatives of both sides of (11), we have 
(12) [J (i,t: +20 =0---, TT Git --- +m.) = 0. 
ad i=1 


Writing (12) as a single condition gives the condition specified in the second 
part of the theorem. 

THeoreM II. Jf P, is an arbitrary point, there exists a function passing 
through P,. But if r > 1, there does not necessarily exist a function passing 
through r arbitrarily selected points. 

Proof. If we take r = 1 in Theorem I, then 


SF Gsm) Geshe. «++ (eacdeed 


j=l i=l 


= (2, ¢ 21) (a1,1.Aa1,1) «++ (@n Arn.) = 0. 


Hence there always exists a function passing through one point. Indeed, such 
a function is f(71, --- ,2%,) = 2. 

To show the second part of the theorem, we need to show that if r > 1, we 
can choose r points so that the expression 


> Dd (z+ 2) (ais Axi, j) ++ (ani Ata, 5) 

j=1 t=1 
does not vanish. But this can always be done by taking 

21 = Ze, M1,1 = 71,2, oo Init = Xn,2; 
for then 
DY Dd (2:02) (are Aas, j) +++ (emi Aan, 5) > (ere 22) (a1.1 Ate) «++ (tna AXn,2) 
j=li=l 
= (21 °2;) (a1. Ar1,1) +++ (2n,1 Atn,.1) = 1, 


so that 


> >> (2; + 2;) (a1,¢Aa1,;) «++ (tn, Aen) = 140. * 
 7=1 1=1 


It will be observed that in the last part of the proof just given we have 
employed two points (21,1, --+ , 2n,13 21), (Wa, +++ » Enya} z1), which are such 
that if a function passed through both of them it would not be single-valued. 
This is impossible for the functions we are considering. In this connection 
the following theorem is of interest. 

THeoreM III. Let P,, Pe, ---,P, be a set of r > 1 arbitrarily selected 
points having distinct abscissas; then 
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1. In a two-element algebra, there always exists a function passing through 
Pe, «++ 5 Pes 

2. In any other algebra, there does not necessarily exist a function passing 
through P,, ---,P,. 

Proof. 1. Consider, for a two-element algebra, the expression 


> > (2; + 2) (aa,¢ Ami, j) +++ (@n,¢ An, ;). 


j=l i=1 


Any term of this sum for which 7 = 7 vanishes because z; +z; = 0. Now let 
T = (2; + 2;) (ai: Ari, j) +++ Cn, Ata, ;) 


be a term for which «7 # j. Since, by hypothesis, P; and P; have different 
abscissas, there must be some k such that 2; # a,;,;. But 2x,,; and 2x, ; 
are each either 0 or 1, hence z;,;Az;,; = 0, since 0Al = 140 = 0. Hence 
T = 0. Thus we have 


DY Dd (e+) (ar. Aan.;) +++ (ane Arn, ;) = 0. 
j=l i=1 
Hence, by Theorem I, there exists a function passing through P,, --- , P,. 
2. An algebra which is not a two-element algebra must contain an element a 
which is different from 1 and 0. Consider now P;, P2, ---,P,, where 
P, = (1, 1, ---, 1; 1), P2 = (a, a, --- , a; 0) and P;, --- , P, are arbitrarily 
chosen. Then 


.m > (2; + 2j)(aie Mary) +++ (Xn Any) > (21+ 22)(t1a A aie) +++ (tn An) 


7=1 i=1 


= (1-0)(lAa) --- (lAa) = (1)(a) --- (a) =a # 0. 


Hence 


U r 


b (2; + z(t Aaj) «++ (tn ATny) ¥ 0, 
j=1 i=1 
and thus by Theorem I there does not exist a function passing through the r 
points. 
The first part of Theorem III is of some interest in connection with the logic 
of propositions, since the logic of propositions is a two-element Boolean algebra. 
The theorem then asserts that we can find a function of the r propositional 


variables pi, --- , Pp; Which will assume any pre-assigned sets of truth-values 
for any pre-assigned sets of truth-values of p,, --- , p,; this property has been 


called “symbolic completeness” by C. I. Lewis.? The theorem is proved in 
another way by E. L. Post.‘ 


® Lewis and Langford, Symbolic Logic, p. 231. 
4 Introduction to a general theory of propositions, Amer. Jour. Math., vol. 43 (1921), pp. 
163-185. 
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The following theorem sets a lower limit to the number of points that can 
determine a unique function. 
THEorREM IV. Jf r < 2", then there exists no set of r points 


{ (21,4) +++ » 2n.c3 2e)}, t=1,---,7, 


which determines a unique function. 

If r = 2", there exist sets of r points which determine unique functions as well 
as sets of r points which do not determine unique functions. 

Proof. To show that r < 2" points never determine a unique function, it is 
sufficient, by Theorem I, to show that the expression 


(1) I] Git +++ +n) +0 + TD Git + +2) 
i=1 t=1 
does not vanish for any choice of the x’s if r < 2". 
Let us find the discriminants in the complete expansion of (1), putting 
t = 2™, 


, , 
(2) Aytyitie ++ Bap Bee + A,X 47 1,2 98 Ba pe 


These A’s are all either 0 or 1, for they are found by substituting 0 and 1 for 
the z’s in (1). Suppose now that some A; = 0. This means that there exists 
a set of values (all 0 or 1) for the z’s in (1) which makes (1) vanish. Let us 
denote such a set of values of the 2’s by y’s. Then we have 


(3) I] (Gact --- +%o+°°° + I] (yictees tyna) = 0. 


Hence, in particular, 


(4) II (YW: + or + Yn a) = 0. 
i=l 


Each factor of the product (4), however, is either 0 or 1. Hence there must be 
at least one factor which vanishes. Suppose this is the first factor; then we have 


(5) Yrt--s + Yn = 0. 
Hence 

(6) Y= sss = Yi = 9, 
and 


(7) Yr tees + Yaa t+ Yor =1,:-:- xs +--+ + Bias i a = 1. 


Substituting (5) and (7) in (3), we obtain 


Oe Tia, +--+ +i 48.0 +--+ Ll Gi + --- +9.) @ @ 
t=2 i=2 


We can now repeat the argument using the next product 


(9) II (yh, +--- tye + yy) = 0, 














BOOLEAN FUNCTIONS AND POINTS 471 
obtaining, say, 


TT (us +--+ + ynee + yrs + ym) ee: 

(10) ‘=’ ; 

+ II (yet ty) = 0. 
At each step we reduce by one the number of terms in the sum (of products) 
represented in (1); at each step we assign values to n of the y’s. Hence at the 
end of r steps we shall have assigned values to all of the y’s. But r steps will 
not have exhausted the terms of (1); for (1) has 2" terms, and by hypothesis 
r <2". Hence, at the end of the r-th step, we shall be led to the contradiction 
1 = 0. Therefore it is not the case that there exists ak so that A, = 0. But 
we have said that A; = 0 or Ay, = 1. Hence, for all k, A, = 1. Then (2) 
becomes 


, , 
(11) U1,1T1,2 **> nr + cee aa Ti 11,2 eee Lair 


which equals 1. Hence, from (1), 


(12) TI] @ast--- and +--+ I Git +s +21) =140 
i=1 t=1 
for r < 2". Thus r < 2” points never determine a unique function. 
To show that it is always possible to find 2” points that determine a unique 
. function, it is sufficient to consider the 2” points 


(13) (1,---,1;1),---,@,---,0;1). 


These points clearly satisfy the first condition of Theorem I, since z;+2z; = 
1-1 = 0. Also, each product in the second condition of Theorem I will contain 
a vanishing factor, and hence will itself vanish. Thus the second condition is 
satisfied. 

For r > 2", we can choose 2” of the points as those specified in (13) and the 
others as (u, v, --- , w; 1), where u, v, ---, w are arbitrary. 

To show that for r = 2" we can always find r points which do not determine a 
unique function, it is sufficient to take r points which do not satisfy the first 
condition of Theorem I. Hence the points can be taken as in the proof of 
Theorem II. 

Summary. Given a set of r n-space Boolean points {(21,;, --- ,2n,i}20}3 
if r = 1, a function can always be found passing through the point. If r > 1, 
a function can sometimes, but not always, be found passing through the points 
of the set; with the important exception that, if the algebra is a two-element 
algebra, and the r points have distinct abscissas, such a function can always 
be found. 

If r < 2", the set of points will never determine a unique function. If 
r = 2", the set of points will sometimes, but not always, determine a unique 
function. 


THe UNIvEeRSTY OF CALIFORNIA. 











THE NULL DIVISORS OF LINEAR RECURRING SERIES 
By MorGan Warp 
1. Let 
(u) Mes Ma Mey +> > May *** 
be a particular solution of the difference equation 


(1.1) bf 


~ 


atk = C1Qne kA — sae + Qn, 


where uo, +--+, “ea, G1, °** , Ce are given rational integers and c¢, + 0. If 
all of the terms of (uw) beyond a certain point are divisible by a given integer m, 
then m will be said to be a null divisor of (u), and (u) a null sequence modulo m. 
In this case there is an integer v called the numeric of (uv) modulo m such that! 


u, = 0 (mod m), n2y», u,_1 # 0 (mod m). 


In a previous paper,? I have solved the problem of determining the numeric 
of (uv) given its & initial values, the recurrence (1.1) and the null divisor m. 
In this paper I propose to determine all of the null divisors of (u).* 

If a and 6 are null divisors, then ab is also a null divisor provided a and b 
are co-prime. It suffices then to consider only the case when m is a power 
of a prime. If p is a prime null divisor of (uv), the exponent of the highest 
power of p dividing all terms of (u) with large suffixes will be called the index 
of p in (u). If, for example, from a certain point on all terms of (uw) are 
divisible by p? but not by p’, p is of index two. 


2. My main results are summarized in the following two theorems. 
THEOREM 1. If in the difference equation (1.1) we have 


(2.1) Ce = Ce) = +++ = Ce_sa1 = O (mod p), ¢.-. # 0 (mod p), 


where p is a prime, and if d, denotes the greatest common divisor of the k — s 
consecutive terms 


Unisy Untsity * °° » Untk—-l (n = 0) 


Received March 10, 1936. 

1 We exclude from consideration the trivial common divisors of up, ui, -*- , Ue. 

2 The arithmetical theory of linear recurring series, Trans. Am. Math. Soc., vol. 35 (1933), 
pp. 600-628. This paper will be cited here as Theory. 

3 The present paper is a condensation and completion of some earlier results on the same 
subject. Cf. Abstract 22, Bulletin Am. Math. Soc., vol. 41 (1935), p. 24. Very recently, 
Abstract 11, Bulletin Am. Math. Soc., vol. 42 (1936), p. 25, Mr. Marshall Hall has given 
some results on null divisors which we shall discuss in the course of the paper. 
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of (u), then a necessary and sufficient condition that the index of p in (u) be X 
is that p* be the highest power of p dividing d,,. If d,, = 0 (mod p**'), then 
un, = 0 (mod p**'), n = (A + Ds. 

It follows from this theorem that the prime null divisors of (uw) are common 
divisors of c, and u,_;, and that if \ is the index of p in (u), then the numeric 
of (u) modulo p* is less than or equal to As. 

Let A(u) denote the k-rowed persymmetric determinant 





Mas May -** 5 Mae | 
Uy, Us, <*>, Ue | 
(2.2) A(u) = |. 
eee coe eee eee | 
| 
Uk—1y Uk, coe » Um.9 
THEOREM 2. Jf A(u) is not equal to zero, and if the coefficients c,, C2, «++ , Cx 


of (1.1) are co-prime, then for any prime null divisor p of (u), the index of p in (u) 
is less than or equal to the highest power of p contained in the first elementary 
divisor of the matrix associated with the determinant A(u). 

If A(u) vanishes, then (u) satisfies a difference equation of order® less than k. 
If ¢,, €2, --- , ce have p as a common factor, there is no limit to the size of the 
index® of p. 

These two theorems give a simple and practicable way of determining both 
all the null divisors of any given linear recurring series, and the numeric of any 
null divisor.” For if », is the numeric of (wu) modulo p’, and if p’ is the power 
of p associated with A(u) described in Theorem 2, we have the inequalities 
ve Ss S as. 


3. My proof of Theorem 1 will be based on the following 

Lemma. Under the hypotheses of Theorem 1, if mo is any positive integer, 
then d,,, = 0 (mod p) when and only when dy = 0 (mod p). 

For let A, B, C, D and E denote rational integers, and m any modulus. 
Clearly 
(i) If A = B (mod m), then’ (C, A) = 0 (mod m) when and only when 


(C, B) = 0 (mod m),. 


‘For other arithmetical properties of A(u), see Theory. On pp. 604, 608 of Theory 
the element in the lower right corner of A(u) should be u2,_2 instead of wox_1. 

5 See §5 of this paper. 

® This result is given by Mr. Marshall Hall, abstract cited. See also §5 of this paper. 

? The method for determining the numeric which I have given in Theory, pp. 614-616 
is only of theoretical interest. 

* We use the customary notation (z, y, 2, --- ) for the greatest common divisor of 
TY, 2, °*° 
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(ii) If (D, m) = 1, then (EZ, C) = 0 (mod m) when and only when 
(DE, C) = 0 (mod m). 
Now take p for the modulus m, and let 
A = Unit; B= Cylngh—1 + Cotlngk—2 ++ H Chants, 
C m (Unieri, Unt+or8, °° * » Un+k—a)s D= a., EB = t,;.. 


Then (A, C) = d,.,, (B, C) = (DE, C) and (E, C) = diyi. Also by (2.1), 
A = B (mod p) and (D, p) = 1. Therefore, by (i) and (ii), d,,, = 0 (mod p) 
when and only when d, = 0 (mod p), and the lemma is evident. 


4. Proof of Theorem 1. First, suppose that the index \ of p in (uw) is zero. 
Then by the lemma, (p, do) = 1, and conversely if (p, do) = 1, the index is zero. 
By the lemma again, if® p/do, then p/u,,n = s,s + 1,---. Therefore the 
theorem is true when A = 0. 

Assume that the theorem is true when \ = k. Then it is also true when 
» = k+ 1. For by our assumption and the theorem itself, \ > k when and 
only when u, = 0 (mod p**'), n 2 (k + 1)s. Write 


(4.1) Unt(k+1)s = pettu’ (n = 0, 1, 2, --- ). 
Then (u’) is a solution of (1.1). 


Let do = (uc., Urecaiy ++ Unge_1). By the lemma again, the index of p 
in (u’) is zero when and only when (p, d,) = 1 and if p/dj, then p/u,,n 2s. 

Therefore by (4.1), the index of p in (u) is k + 1 when and only when 
pe" |! dass, and if p*** / dasys, then u, = 0 (mod p***), n 2 (k 4+ 2)s. 
Thus Theorem 1 follows by induction. 


5. We preface our proof of Theorem 2 by some results from the algebraic 
theory of recurring series which are of arithmetical importance.” 
Let 


F(z) = 2* — ca" — cg? — .-- — 
be the polynomial associated with the recurrence (1.1). Then if 
F(z) = 2 — or" — --- —& (ry = 0,1, --- ,&), 
so that Fo(r) = 1, F.(x) = F(x), I have called the polynomial 


(5.1) U(x) = wo a(t) + WP ror) + «++ + Ue-iFo(x) 


® We write p/ x for p divides x. If the highest power of p dividing z is p’, we shall 
write p? // z. 

” Most of these results may be found in chapter XII, §II of the well known Treatise on 
the Theory of Determinants, by Muir and Metzler, Albany, 1930. 

















NULL DIVISORS OF LINEAR RECURRING SERIES 475 


the generator of the sequence (u). (Theory, p. 606.) We have in fact for 
sufficiently large values of x the identity 


(5.2) U(2)/F(2) = 3 ue/2™, 
Furthermore," 
(5.3) A(u) = (—1)**-? Res { U(z), F(zx)}. 


It is obvious from (5.2) and (5.3) that if A(u) vanishes, (uv) satisfies a differ- 
ence equation of order less than k. 
If we write the left side of (5.2) as 


z*U(z)(1 — e,/t — +--+ — oy /2*)-, 


it is also obvious that if c;, --- , c, have a common divisor m, all terms of (uw) 
beyond a certain point are divisible by m* for any preassigned value of X. 
Thus” the index in (u) of a prime dividing ¢:, --- , c, is unbounded. 


Finally, since by (5.1) 
U(r) = uer* + (uy — cytig)ah? + «++ + (Une — Cittene — +++ — Cy_ite), 
the coefficients of U(x) are relatively prime when and only when 


Uo, M1, *** » Uk 


are relatively prime. 
6. We conclude with the proof of Theorem 2. I have shown in Theory 
that the numeric of (uw) modulo p* is the least value of N such that 


(6.1) x*U(r) = 0 (modd p*, F(2)). 


But I have also shown" that for fixed polynomials U(x) and F(x) (where F(x) 
is primary and the coefficients of U(x) are not divisible by p) the congruence 


(6.2) Y(x)U(xz) = 0 (modd p*, F(x)) 


has solutions Y(x) not divisible by p only if \ S o, where p’ is the highest 
power of p dividing the first elementary divisor é of the matrix of the resultant 
of U(x) and F(z). But it is easily shown that é is also the first elementary 
divisor of the matrix of A(u), so that the theorem is established. 


1A proof of this formula is indicated in Theory, pp. 608-609. The sign, however, is 
incorrectly given there as (—1)*. The result goes back to Netto, Journal fiir Math., vol. 
106 (1895), pp. 33-49; Muir’s History, vol. III, p. 326. 

12 Marshall Hall, abstract cited. 

'S Trans. Am. Math. Soc., vol. 35 (1933), pp. 254-260. 








476 MORGAN WARD 


7. In the congruence (6.1), let us regard p, N and U(x) as unknown, and A 
and F(z) as pre-assigned. On observing that Res {x*, F(z)} = + cj, we see 
that if p divides c, then by the result just stated for the congruence (6.2) 
we can choose a polynomial U(x) not divisible by p to satisfy (6.1) provided 
that N is taken so large that the first elementary divisor’ of the matrix of 
Res {x%, F(x)} is divisible by p’. In the corresponding sequence (u), the 
index of p is = X. Hence no upper limit exists for the indices of p in all the 
solutions of (1.1) admitting p as a nuil divisor. 


CALIFORNIA INSTITUTE OF TECHNOLOGY. 


' It is not difficult to show that the power of p which divides this elementary divisor 
increases with N. 
% Marshall Hall, abstract cited, 

















THE SIMPLE GROUP OF ORDER 25920 
By J. S. Frame 


1. Among the 53 known simple groups of composite order less than one 
million,' 42 may be represented as linear fractional modular groups on two or 
three variables. There are in addition three other alternating groups, and 
three multiply transitive groups. Of the five remaining groups, three are hyper- 
orthogonal groups on three variables, whose irreducible representations were 
discussed in a recent paper.2. The other two, of orders 25920 and 979200 
respectively, may be defined as the “abelian linear groups’ A (4, 3) and A(4, 4). 
The first of these is also isomorphic to the hyperorthogonal group HO(4, 4) 
on four variables in a modular field of four marks. Thus it is the smallest 
example both of the “abelian linear group” and of the hyperorthogonal group 
on more than three variables. We are interested here in its properties from 
the latter point of view, and shall obtain the complete table of characters of 
its irreducible representations. 

The group HO(m, q*) may be defined as the quotient group with respect to 
its central of the special unitary group G,, , consisting of matrices of degree m 
and determinant 1, with coefficients from a finite field GF(q*) of qg? marks. 
Here q = p* is a power of the prime p, and “conjugate imaginaries”’ are defined 
by the equation = x’. We may think of the transformations of the group G,, 
as operating on a set of vectors in an m-dimensional space where the coérdinates 
are marks of the GF(q*). All multiples of a given vector will be said to form 


m 


a ray. The inner product, (a|b) = , * ab; , of two vectors a and b is in- 


variant under each transformation of the unitary group G,, , so that the iso- 
tropic vectors—those for which (a | a) = 0—are permuted among themselves, 
and so are the remaining non-isotropic vectors, for which (a | a) # 0. It has 
been shown’ that the permutation groups thus induced on the rays of each of 
the two types are transitive. If a single vector be selected from each ray, 
these vectors undergo a monomial substitution under the group G,, , with multi- 
pliers which, although appearing as marks from the GF(q*), may be replaced 
by suitably chosen (q? — 1)-th roots of unity from the field of complex numbers. 
It has also been shown that for m = 3 the permutation group on the isotropic 
rays is doubly transitive, and the corresponding monomial groups either have 
just two irreducible components, or are irreducible. In this way more than 


Received February 20, 1936. 
1L. E. Dickson, Linear Groups, 1901, p. 309. 
2 J. S. Frame, Some irreducible monomial representations of hyperorthogonal groups, 
this Journal, vol. 1 (1935), p. 442. 
3 J. S. Frame, Unitdére Matrizen in Galoisfeldern, Commentarii Mathematici Helvetici, 
vol. 7 (1935), p. 97. 
477 











478 J. S. FRAME 


half of the irreducible representations of HO(3, q) could be found. But the 
problem of reduction is more complicated when m > 3, since the permutation 
group on the isotropic rays is then only simply transitive, and has always just 
three irreducible components, while the corresponding monomial representations 
have either three or two irreducible components. This problem, as well as 
the problem of reducing the permutation group on the non-isotropic rays, can 
best be solved by studying the hermitian invariants of these groups. 


2. We know that the number of irreducible components of a linear group 
is equal to the number of linearly independent hermitian invariants. If we 
let the variable x, correspond to the vector a from the isotropic ray R,, we 
find that the three hermitian forms }> #22, >> fat» with (a |b) # 0, and }>z.x, 

a 


with (a ¢) = 0 are linearly independent, and are invariant under the permu- 
tation group P,, on the isotropic rays. In order to display the reducibility 
of P,, , these hermitian invariants must be transformed to diagonal form. Let 
us denote by J, J, and H respectively the matrices of their coefficients, and 
also introduce the matrix K = J + J + H, in which each element is 1. Each 
of these matrices is symmetric and has n = QnQm—1/Qe rows and columns, if 
we set Q, = gq” — (—1)". J is the unit matrix, and H is a matrix having 
h = POQm—2Qm—,/Q2 1's in each row and column, with zeros elsewhere. The 
inner product of two rows of H which correspond to the variables x, and 2, 
will be ho = Qe + G'Qm—iQm—s/Qe if (a|b) = 0, and hy = Qm—2Qm—3/Qe if 
(a b) # 0. Hence 
H? = hI + hyd + hH = (h — hy)I + (hy — h)H + WK. 


Now since HK = hK, the matrix H satisfies the minimal equation 


(H — hI) (H? — (hy — h)H — (h — hy) = 0. 


9 


The roots of this equation are h, (—q)""* — 1, and (—q)™* — 1. Knowing 
that H has in all n roots, with sum zero, and with sum of squares equal to hn, 
the multiplicities of the three distinct roots may be calculated. They are 
found to be 1 for the root h, ¢Qn—1Qm—2/Q2Q; for the root (—q)""* — 1, and 
PQ mQm—sz/Q2Q; for the root (—q)""* — 1. It is found similarly that the matrix 
J satisfies the minimal equation 


[J —-(n—1—A)]][J? — (ur —h — 29 —(h -h& — DN =O, 


9 


whose roots q?"~*, —(—q)""’, and —(—q)"~ occur respectively with the 
same multiplicities as those of H. These multiplicities are precisely the degrees 
of the irreducible components P,, . 

TueoreM 1. The representation of the hyperorthogonal group HO(m, q*) as a 
permutation group P.,, of degree QmQm—1/Qz2 on the isotropic GF-rays in m dimen- 
sions has just three irreducible components, whose degrees are 1, ¢'Qm:Qm—2/Q2Qi 
and @QnQm—s/Q2Qi respectively. (When m = 3, the last of these vanishes.) 

The q”"Q,,/Q, non-isotropie rays R,, with (a | a) # 0, are permuted tran- 
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sitively among themselves by the transformations of G,,. The subgroup leaving 
one ray invariant permutes the others in q transitive sets; except that when 
m = 3 and 3 divides q + 1, there are q + 2 transitive sets. Hence the permu- 
tation group ®,, on these rays has just gq + 1 (or g + 3) irreducible components. 
The corresponding invariant hermitian forms are the unit form }>z.27,., and 
the q forms }> #2, where (a| b)(b| a)/(a|a)(b|b) = k, and k is in turn 
each one of the g marks from the GF(q). (In the special case mentioned above, 
the form for which k = 0 splits into three separate parts.) Proceeding as for 
the permutation group P,, , we now have g + 1 different matrices instead of 
three to reduce to diagonal form. The computations are too complicated to 
be given here at length, so we shall merely state the results without proof. 

THEOREM 2. The representation of the hyperorthogonal group HO(m, q*) as a 
permutation group ®,, of degree q™"Q»/Q: on the non-isotropic GF-rays in m 
dimensions has in general q + 1 distinct irreducible components. These include: 
the identity of degree 1 and one representation of degree @Qm—1Qm—2, Q2Q: which 
are components of P,, , and in addition one of degree qQmQm—1/Q2Q: if q is odd, 
[q/2] — 1 of degree QnQm—1/Qe2 , and [q/2] of degree QuQm—1/Qi . When m ='3, 
and 3 divides q + 1, one of the last type splits into three components of equal 
degree. 

For the monomial substitutions on the isotropic rays the unit hermitian form 
is invariant, and also the form Dd eaFars , With (a|b) = 0,a # b, and suitably 
chosen coefficients ¢ = a. There is a third bilinear form }°caFaxs , (a |b) = 
k # 0, where cq is the complex (gq? — 1)-th root of unity which is made to 
correspond to the mark 1/k; but this is hermitian only when all the coefficients 
Cap are (q + 1)-th roots of unity. In this case the degrees of the three irreducible 
components are QnQmn—3/Qe, @"?Qm/Qi, and g”Q,,/Q:. Otherwise, these 
last two components are to be replaced by a single one of degree q”"—*Q,,. In 
either case the first component vanishes when m = 3. The monomial sub- 
stitutions on the non-isotropic rays have similar hermitian invariants, q + 1 
or g in number, depending on which roots of unity are used to correspond to 
the marks of the finite field. Two component representations of degree 
q”Q,»/Q, and q”*Q,,/Q; or a single one of degree g”—“Q,, are the same as 
above. The remaining g¢ — 1 components in the reduction are of degree 
q™Q,, and (¢q — 1)q”*Q,,/Qi , except for one of degree g”-?Q,,/Q, if q is odd. 


3. In order to illustrate how these reducible representations split up in 
general into irreducible components, we consider in this paper the simplest 
example of these groups, namely, the simple group HO(4, 4) of order 25920. 
We denote the marks of the GF(4) by the symbols 0, 1, w, w*, where 


1+ wo + w = 0 (mod 2). 


Then the 135 four-dimensional isotropic vectors (the null vector not included), 
of which either two components or none are zero, line up in 45 isotropic rays; 
and the remaining 120 vectors, of which one or three components are zero, 
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form 40 non-isotropic rays. Each set of rays is permuted transitively when 
transformed by the matrices of G,. In each case the subgroup leaving one 
ray R, fixed permutes the remaining rays itn two transitive sets, so the permuta- 
tion groups P, and #, each have three irreducible components. The corre- 
sponding monomial representations, two conjugate imaginary representations 
of degree 45, and two of degree 40, each have just two irreducible components. 

The operations of the group G, fall into 20 sets of conjugates. In the following 
analysis we display the matrices of 10 of the sets in normal forms generalized 
for all m, and evaluate the indices of the sets for the particular group HO(4, 4), 
for which m = 4 and q = 2. To obtain the remaining 10 conjugate sets, we 
examine the subgroups permutable with selected matrices. We denote by d 
the order of the central of G,, , and abbreviate gq" — (—1)™ by the symbol Q,, . 

(1) Only the d matrices of the central of G,,—those which correspond to the 
identity in the quotient group H,, = HO(m, q*?)—can leave m linearly inde- 
pendent isotropic rays invariant, and they leave Q,.Qn—1/Q2 = 45 isotropic, 
and g”"'Q,,./Q; = 40 non-isotropic rays absolutely invariant. These numbers 
are the respective degrees of the reducible monomial representations we are 
investigating. 

(2) The Q,.Q,1/Q: = 45 matrices of order p, of the form (4;; + ed,a;), 
where (a| a) = € + @ = 0, leave absolutely invariant the 


1+ PQn—Qm—3/Q2 = 18 


isotropic, and q”~'Q,,2/Q: = 8 non-isotropic, rays orthogonal to the vector a. 

(3) The qQmQm—1Qin—2Qm—3/Q2Q: = 270 matrices of order p, of the form 
(6;; + ab; — bja;), where (a| a) = (a|b) = (b|b) = 0, a # kb, constitute 
a single set of conjugate matrices leaving absolutely invariant 


1+ PF + F'QmsQm—5/Q2 = 5 


isotropic, and g”~'Q,,_,/Q; = 0 non-isotropic rays. 

(4) The q”?Q,,.QmiQm—2/Q: = 540 matrices of order p* (if p = 2), or p 
(if p > 2), of the form (6;; + ba; — ab; + ed,a;), where (a| a) = (b| a) = 
(b| b) + € + € = 0, constitute a single set of conjugate matrices when (b | b) # 0, 
m > 3, which leave absolutely invariant 1 + ¢°Q,,-;Q,—,/Q: = 1 isotropic and 
gq” Qm—s/Q, = 4 non-isotropic rays. 

(5, 6) The g”"'Q,./Q; = 40 matrices [66;, + 0(@-”" — 1)éc;/(e | c)], where 
(c | c) # 0, 06 = 1, 6 ¥ 1, form for each admissible 6 a set of conjugate 
matrices whose periods divide Q;. Each of these leaves relatively invariant 
with factor @ just Q»1Qn—2/Q2 = 9 isotropic rays, and q”—Q,,1/Q; = 12 non- 
isotropic rays, and also with factor 6'~™ the non-isotropic ray R, itself. There 
are Q,/d — 1 = 2 such sets in H,, . 

(7, 8) The ¢g”"Q,,.Qn:Qn—2/Q{] = 360 matrices 


[96,; + bd; 4; + 0(0 _— 1)é,¢; (e | c)], 


where ¢ + @ = (a. a) = (c|a) = 0, ea(e|c) # 0, 66 = 1, 6 ¥ 1, form for 
each admissible @ a set of conjugate matrices whose periods divide pQ; , each 
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of which leaves relatively invariant with factor @ certain 1 + @?Qn—,Qmn—4/Q2 = 1 
isotropic and g”Q,,_;/Qi: = 4 non-isotropic rays, and also with factor @!-™ 
the non-isotropic ray R, itself. There are Q,/d — 1 = 2 such sets in H,, . 

(9, 10) The @”-*QnQm1Qin—2Qm—3/Q,> = 2160 matrices 


[0(4;; ot bia; _ ajb; a €a;4;) a a(e-™ _ 1)éec; (c c)], 
where (a | a) = (a|b) = (a\c) = (b\c) = (6/6) + e€ + 2@=0, (c\e) #0, 


06 = 1, 6 * 1, form for each admissible @ a set of conjugates whose periods 
divide p*Q,;. Any such matrix can leave invariant only the one ray R,., and 
relatively invariant only those rays such as R, which are orthogonal to a, b, 
and ¢c. Relatively invariant with factor @ are 1 + @Qn—Qn-s'Q2 = 1 iso- 
tropic, and q”*Q,,,.Q2/Q: = 0 non-isotropic rays. There are Q,/d — 1 = 2 
such sets in H,, . 

We obtain the numbers of conjugates in the other 10 sets by determining 
for each set the subgroup permutable with one of its matrices. To save space, 
we let J, P, A, B denote respectively the identity, a matrix of order p, and two 
arbitrary matrices, each on 2 or m — 2 variables, and let @ and @ be multi- 
pliers ~ 1 such that 66 = ¢@ = 1. Then by [@, 6, P], for instance, we mean a 
matrix of order pQ, which multiplies the first two variables respectively by @ 
and 6, and transforms the remaining m — 2 variables by an operation of order p. 
Using this notation, we obtain the sets of conjugates as follows. 

(11) The multiplication [¢, 6, 7] is permutable only with matrices of the 
form [¢*, ¢8, A] when 6 # +1. There are Qjg,,.—2 of these in G,, , corresponding 
to Qih,—2 in H,,, if g» and h,, denote the respective orders of G,, and H,, . 
Hence the matrix (6,0, J] belongs to a set of An /hm—2Q7 = 2" Qn Qm—1/Q]i = 480 
conjugate matrices. It leaves Q,,-2Qn_,/Q2 = 3 isotropic rays invariant, multi- 
plies two non-isotropic rays respectively by 6 and @, and leaves g”“"Q,—2/Q: = 2 
others invariant. Using the various possible values of 6, we have altogether 
[q/2] = 1 such set. For g odd, we should also have a set for @ = —1, con- 
sisting of h,./h»—2g2 matrices, whose characters in the representations we are 
studying would be +Q; + Qn—2Qn-s/Qe and +(q@ — 9g) + g™ Qn 2/Qi 
respectively. 

(12) The matrix [6, 6, P] is permutable only with Qjgn—2 Qn—2Qm—3 = qQi = 
18 matrices, and belongs to a set of ¢?”"-QnQm—1Qm—2Qm—2/Q? = 9Q,Q,Q2/Qi = 
1440 conjugates. It leaves one isotropic ray invariant, and two non-isotropic 
rays relatively invariant with factors @ and @ respectively. There is [q/2] = 1 
such set in H,, . 

(13) The multiplication [6/7, 67] is permutable only with matrices of the form 
[A, B]. There are g»—292Q,/d = @Q3Q,/d = 108 of these in H,,. Hence there 
are hyd /gm—goQs = @"-*Qn.Qm—1/Q2Q; = 240 matrices in this set of conjugates. 
Q, isotropic rays and g? — gq non-isotropic rays have the multiplier @, and 
(when m = 4) an equal number have the multiplier 6. There is [¢g/2] = 1 
such set in H,, . 

(14) The matrix [@P, 6P] is permutable only with (¢Q1/d)gn—2Qi1/Qn—2Qm—s = 
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12 matrices of H,,. It belongs to a set of g?"-*QnQmsQm—2Qm—3/Q] = 2160 
conjugate matrices whose periods divide pQ;. Two isotropic rays are rela- 
tively invariant, with the respective multipliers @ and 6. No non-isotropic 
rays are invariant (for m = 4). There is [qg/2] = 1 such set in H.,,. 

(15, 16) The matrix [@P, 67] is permutable with just g,.—29Qi/d matrices 
of H,,. Hence there are ?"~“Q.Qn1/Q: = 720 matrices in the corresponding 
set of conjugates. One isotropic ray is relatively invariant with multiplier @, 
and Q, with multiplier 6. Also g? — q non-isotropic rays are multiplied by @. 
There are Q,/d2 — 1 = 2 such sets in H,,, if de is the h.c.f. of Q, and 2. 

(17) The eyclie permutation T: (x;72%;2,) is permutable with the subgroup 
H, of order 8 generated by itself and the matrix (1 — 6;;). It leaves invariant 
one isotropic ray and no others, and belongs to a set of 3240 conjugates. 

(18) The matrix 


is of order 5, and can be shown to be permutable only with its own powers. It 
leaves no rays invariant, and belongs to a set of 5184 conjugates. The matrices 
S and T? generate the entire group HO(4, 4) of order 25920.4 

(19, 20) The matrix ST? is of order 9, is permutable only with its own powers, 
and is therefore one of 2880 conjugate matrices. It leaves no isotropic rays 
invariant, but multiplies the vector (1, 0, w, w?) by w(= @). Its inverse belongs 
to another conjugate set with multiplier w (= 8). 

We now have accounted for all the 25920 matrices of HO(4, 4) in 20 sets of 
conjugates, and have found the characters of six reducible representations of 
degrees 45 and 40. They were seen to be the foliowing, where @ is in turn 
each of the three cube roots of unity, 1, w, and @: 


45, 13, 5, 1, 90, 98, 6, 6, 6, 6, 3, 1, 36 + 36, 6 + 6, 36 + 6, 6 + 36, 1, 0, 0, O; 
40, 8, 0, 4, 1+ 126, 1 + 126, 1+ 40,14 46, 1,1,2+ 6+ 6,6 + 6, 26 + 26, 0, 20, 20, 0, 0, 0, 6. 


These six representations can be split into nine distinct irreducible components 
by applying the theory developed in §2._ The two representations corresponding 
to permutation groups have in common the identity representation and one 
other irreducible representation of degree ¢@°Q»—:1Qm—2/QeQ: = 24, and their 
remaining irreducible components are of degree 20 and 15 respectively. The 
four monomial representations are conjugate imaginary in pairs. Those of 
degree 45 have a common real component of degree QnQn—3/Q2 = 15, and the 
remaining components of degree 30 are also found as components in the mono- 
mial representations of degree 40, whose second components are of degree 10. 


4H. R. Brahana has already given two such generators for the simply isomorphic group 
A(4, 3). See Annals of Mathematics, vol. 31 (1930), p. 533. 
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The arithmetical properties of tables of characters now enable us to deter- 
mine the characters of all the irreducible representations. For instance, the 
conjugate set (18) can have only five characters different from zero—rational 
integers the sum of whose squares is 5—which can only be +1. Hence only 
these five of the 20 irreducible representations can have degrees not divisible 
by 5. If these be 1, 24, x, y, z, we shall have 1 — 24 +2 +y+2 = O, where 
+x = +y = +2 = 1 (mod 5), and g, y, z each divide 25920. The choice 
x = 4,y = 54, 2 = 81 leads to incompatible conditions for the characters in 
the conjugate set (2); the choices x = 9, y = 4, 16, or 64, z = 36, 16, or 96 
do likewise for the characters in the conjugate set (5). There remains only 
the choice x = 64, y = 81, z = 6. Since there are no other representations of 
these degrees, the characters must be integers, and are found to be uniquely 
determined from the table. The degrees of the remaining 8 irreducible repre- 
sentations are all divisible by 5, and it can be shown that half are divisible 
by 2, half by 3, and an odd number by 4. Since the sum of the squares of 
these degrees is 11800, they can be determined uniquely: 5, 5, 40, 40, 45, 45, 
30, 60. After some further computations of a similar nature, the complete 
table of characters can be filled in, as shown on page 483. Each row corresponds 
toa complete set of h, conjugates, and each column to an irreducible representa- 
tion. The numbers hy, are given on the left, and h/h, on the right. 


Brown UNIVERSITY. 




















ABEL-POISSON SUMMABILITY OF DERIVED CONJUGATE FOURIER 
SERIES 


By A. F. Moursunp 


1. Introduction.' In this paper we give theorems concerning the Abel- 
Poisson summability of the r-th, r = 0, 1, 2, --- , derived series of the conjugate 
series of the Fourier series.2. These theorems may be considered as extensions 
of theorems given by B. N. Prasad,’ A. Plessner,* and others for the sum- 
mability of the conjugate series and its first derived series. 


2. Notation. Throughout this note we assume that the function f(x) is 
Lebesgue integrable on (— 7, 7) and of period 27. The letters r and p always 
represent positive integers or zero, and the letter K represents a positive absolute 
constant which need not be always the same even in a single discussion. For 
convenience we designate a fixed value of x by x instead of the usual 2. 

We set 


(1) p(v, s) = (1 + s* — 2s cos v), P(v, s) = sp(v, s) sin v; 
r+pt+l 
(2) M,™ (v, 8) = otte*t oti {P(v, s) — 1/2 cot v/2}; 


z (—b, cos nx + a, sin nx)s" (0s s <1), 


n=1 


(3) V(s, x) 


Received September 5, 1935; presented to the American Mathematical Society, Sep- 
tember 13, 1935. 

' The author is indebted to the referee for many helpful suggestions. 

2 No theorems concerning the Abel-Poisson summability of the r-th, r > 1, derived 
conjugate series appear in the literature. For theorems concerning the summability of 
such series by other methods see the following papers: A. F. Moursund, On summation 
of derived series of the conjugate Fourier series, Annals of Mathematics, (2), vol. 36 (1935), 
pp. 182-193, and American Journal of Mathematics, vol. 57 (1935), pp. 854-860; A. H. Smith, 
On the summability of derived conjugate series of the Fourier-Lebesgue type, Bulletin of the 
American Mathematical Society, vol. 40 (1934), pp. 406-412; A. F. Moursund, On the r-th 
derived conjugate function, Bulletin of the American Mathematical Society, vol. 41 (1935), 
pp. 131-136. Since Abel-Poisson summability follows from Cesaro summability it is possible 
to obtain theorems which are similar to, but less general than, some of the theorems of 
this note from Theorem 6.4 of the second paper cited here. Theorem | does not follow 
from any of the results of papers listed here. 

3B. N. Prasad, Contribution aU étude de la série conjuguée d’ une série de Fourier, Journal 
de Mathématiques Pures et Appliquées, (9), vol. 11 (1932), pp. 153-205. This paper gives 
a long bibliography. 

4A. Plessner, Zur Theorie der konjugierten trigonometrischen Reihen, Mitteilungen des 
Mathematischen Seminars der Universitit Giessen, Heft 10 (1923). 
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where a, and b, are the Fourier coefficients of f(x); 


[(r—1)/2] 


AM(r) = f(z + v) + (-—D fle —») —2 YO argv, 
Ce . a 
A?*(y) = 1/p! [ (v — t)PA\(t) dt, 


where a,_1, @_2,---, @-1-%, 0 S 22 < r — 1, are arbitrary numbers; 
|r/2]-1 {[r/2]}-1 oj—9; > 
gti-ti pitt 
C, = aa. (¢ — 1 — 2) ——nnenens aoe OO o/ZD 
ee  Qj+1= daar a 
(5) 
(r) [(r+p)/2]-1 oi . (ith 
CC’)? = —C 1/2 Art Frey) —- 604 8/2 : 
7 slates Pe (e) dy © - ree 


In (4), (5), and elsewhere in this note, a summation is to be interpreted as 
zero whenever the upper limit is less than the lower limit. Thus Co, Ci, 
and C‘° are equal to zero, and 


(6) AV) =f(@ +r) —f@—v), AYO) = fle +0) + f(x — 0) — aw. 
We find it convenient to set 

(7) ¢ = arc sin (1 — s) (O<s <1), 

and write o(1) as e — 0 for o(1) as s > 1. 


3. Results. Our principal results are given by the following theorems. 
THEOREM 1. Jf 


(8) A’?*) (ny) = o(vttrt), as v > +0, 
then 
ar wa r+p "8 [rtp 
(9) es V(s, x) + (- _ | A‘(v) oe cot o/2dv-4C\”, ase 08 
5 An equivalent definition for A'?*)) (y) is given by 
A(P+1)(y) = i A‘) (t) dt (p = 0, 1, 2, «+ ), 
0 


i.e., A°’*)) @) is the (p + 1)-fold integral of A‘°)(v). For either definition A\’*'? (v) is 


the g-fold integral of APT l—a yy 


§ The condition A (p+1) (yy = o(v"*?*!) is equivalent to the condition 


[ ACP(t) er? dt = o(v). 
0 


See Smith, loc. cit., pp. 411-412, for a proof of the equivalence of two similarly related 
conditions. By setting r = 0, p = 1, in Theorem 1 we obtain Prasad’s Theorem 3 (loc. 
cit., p. 173) for the summability of the conjugate series; and by setting r = 1, p = 0, we 
obtain Plessner’s Theorem 3 (loc. cit., p. 4) for the summability of the first derived series 
of the conjugate series. Plessner’s Theorem 1 for the conjugate series is included by 
Prasad’s theorem. 
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THEOREM 2. Jf 


(10) lim [ 4‘”)(y) cot v/2 dv 
3 ri dv"+? : 


5—+0 
exists, then (8) holds and accordingly 


(—1)rteHt 


(1) “~V(e,2)> lim | Av) cot v/2 dv + C, 
Ox” : 3 a dyt+P 


6-++0 
ass—1 — 0. 

THeoreEM 3. Jf r = 1 and d*'f(x)/dx* is of bounded variation, then (10) 
and consequently (9) and (11) hold for every p and almost all x with 


dif (x) /dxt!-* 

(r — 1 — 22)! 

Tueorem 4. Jf r = 0 and f*(2), the generalized derivative of order r + 1 
of f(x), exists at the point x [i.c., if f(x) satisfies an equation of the form 


f(z +) + (-)D"Y (a — ») 
2 





Qi131 = 





I(r+1)/2] ” +1—2i ( pth 
= (r+1—2i) 

2, S00) a + OO) Ea 
where w(x, v) — 0, as v — +0], then (10) and consequently (9) and (11) hold for 
every p with a,1-9 = f° (x)/(r — 1 — 22)!. 


4. An expression for a’V(s, x)/ax’. In this section we obtain the expression 
for a’°V(s, x)/ax" used in the proof of our theorems. 
Lemma l. For0 Ss < 1 


nos 2k 
P(v, 8s) = DS s*sin nv; ae P(v, s) = 0] : 
n=1 ov veO, 3 
and 
grktt gett ; 
ape P(v, 8) = § 2 jpeti cot v/2 + o(1), ass — 1. 


Proof! For v # 0 


gzkt! 


{P(v, s) — 3 cot v/2} 


oven 
1 — 5)? ge 

-* 2 : ape {p(v, s) cot v 2} = o(1), ass— 1. 

7 For a proof of the first part of this lemma see T. J. I’a. Bromwich, Theory of Infinite 


Series, 1926, pp. 186-187. The second part follows from the first part, so we prove only 
the third part. 
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Lemma 2. Forr = 0 


_yyren fr rch sr 
(12) DD aso tt ™ Pe, ») do = — C, + of), 
- | 


0 1=0 


as s => 1. 


Proof. Integrating by parts and using Lemma 1, we have 


7 r r—2i-1 = ? ! r 
i pri ~ P(v, s) dv Z (—1)" a”~ ~ Plo, s) (r ~ — 21)! yt a] 


j=1 av" (r— —j)! 0 
{[r/2]-1 2+ y . 
a (r—1—23)! 2... 
= (—1)" 2 ; mod » 2k —2i+1 
p> ay%*+! J tv | rer (2k — 2i + 1)! 
1!" 2}-1 (—1)" (r a 1 — 9)! ee gzktt 
“Se a tL 408 eet 


The lemma follows when we multiply by the proper constant factor and sum 
with respect to 7 between 7 = 0 and [(r — 1)/2]. The term which is not in C, 
introduced when 7 is odd vanishes. 

Lemma 3. Forr = 0 


r rn r+-p+1 [{F +; 
(13) *. V(s, x) = (1) : oa 4°” ww - P(v, 8) dv +C\” + 0(1) ass—1. 
Ox" T 0 ouv"*P 


Proof. Upon differentiating (3) r times, taking into account the fact that 
f(x) is of period 27 and simplifying, subtracting the left member and adding 
the right hand member of (12) (Lemma 2), integrating by parts p times, and 
using Lemma 1 we have 











8,z) = cos nx + a, sin nr) «\ 
ae! 
ow : S(v)P(v — 2, 8) iv} = crf f(a + 2 — ~ Pl, 8) dv 
= f : rl s} av p» » os) de 
rape ee iy fe + +v) + (-1) fia — Bp a Pah 
a 1m f A‘"'(v) # P(v, s) dv — C, + o(1) ass— 1 
us 0 ou’ 


_4)rtt 
% = {e (- A\(0) S = ~ Plo, of 


7=1 


+ = A‘”)(v) — a P(v, 8) iv} — C, + o(1) ass—l1 


(—1)rtett (p) (p) 
ee * 4( ’ (v) a — P(r, s) dv + C'\”’ + o(1) as s > 1. 
T 0 
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5. Final lemmas. We use the following lemmas in proving our theorems. 
The reader can readily supply the proofs and parts of proofs which are omitted. 
Lemma 4. For0 <v<S-r 


d [(r—1)/2] 
£ cot v/2 = (—1)" [ ayer cot™+ty/2 + = a; cot "2p, 2], 
' 1=0 


ais d* 
where the a’s are positive constants. Hence — cot v/2 = 0 when v = z. 


dv* 
Lemma 5. For0 < v S wand0 S s < 1, the function v'** A p(v, s)/adv' is 
uniformly bounded. 
Proof. Obviously 


0 
re p(v, s) = v*{(1 — s)? + 48 sin® v/2}-! < K. 
Assuming that the lemma is true for 7 = 0, 1, 2, --- , n — 1, we have forj = 


| n—l | 
"ded wi s)| | = | ae a {sin v[p(v, s)]?} — 





a - dv™ n—1 
n—1 | ~=j—¢ 
di d” 1-i 
<u SK | —— {sin vp(v, s)} 7 p(v, s) 
Fem 4 dyi ym i 


n—1 | qi 
.. Je 
prt? } & /e a {sin v- p(v, s)} | 


i=0 


IIA 


n—1l 


yt? > K/ytt- | sin 0 plo, s) | +t xK 
=1 


1=0 


” 


di-* | 
= sin v apne pv, 8) \ <K 





The lemma follows by mathematical induction. 
Lemma 6. For 0 Sv S rand0 <8 < 1, &*! @'P(v, s)/dv" is uniformly 


bounded. 
Proof. Using Lemma 1 we have 


<< ae 


n=1 (n — -1)! 


= et. s.r! (1 — 8)-@+) < K. 


, a 2 ' 

r n— 1 

ot! = po,s)| set D wer sen y Cte, 
1 | n=1 


Lemma 7. For0 < v S rand0 Ss <1, the function v"**a"} p(v, s) cot $v} /au’ 
is uniformly bounded. 

Proof. Upon differentiating the product by Leibnitz’ formula the lemma 
follows from Lemmas 4 and 5. 

Lemma 8. For0 < v Ss rand0 < 8s <1, the function v'«?M”(v, 8) is uni- 
formly bounded. 

Proof. Since 


_(—s) garter 


M‘”(v, 8) = a in {p(v, s) cot v/ 2} -vrte*! 
prt + 


the reader ean establish the lemma with the aid of Lemma 7. 
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6. Proof of Theorem 1. Upon making use of the expression for 
art! V(s, x)/av"*! 


given by (13) [Lemma 3] we see that proving Theorem 1 is equivalent to showing 
that when (8) holds 


"4 Oy 
(14) i Py : 


as e — 0. 
Upon integrating both of the integrals in (14) by parts and using Lemmas 
1 and 6 when r + p is odd and Lemmas 1 and 4 when r + p is even, we have 





y— sf" AM) —— ~ cot 3u dv — 0, 





-[ ayes (v ) gurtett ~~, Plo, 8) dy ‘sii gAPT (YD s du rtp =, cot io 
+1 
+ sf APT (y ) 5 cot dy dv + o(1) 


= - [as y. ty 2 s) — 4 cot 4v} dv 


re — gr+p+ 

-| Aleth(y) ~ te, sdvet+h. 
1) ov'tPt ! 

By Lemma 8 


I,| = | {Ayr '(v) rey M (0, 8) de) 


IA 


6 r 
{o(1) as 6 > O] i] Kev dv + ral ev dv 
€ 6 


= [o(1) as 6 — 0] + [o(1) as « > 0, for a fixed 4]; 


and by Lemma 6 





€ r+pt+l } { 
_ f Aber 1 (v) prtptt } {iptnss a 2 P(v s) dv 
0 ee rN oy tert f 





our tpt 


¢ | r+ptl 
= [o(1) ase —> af e7! | ert pt? ai P(v, 8) | dv = o(1) ase — 0. 
0 i 


Consequently (14) holds and Theorem 1 is established. 


7. Proof of Theorem 2. ‘To establish Theorem 2, it will be sufficient to show 
that (10) implies (8), for (8) implies (9), and (9) and (10) imply (11). 
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When (10) holds 
APtD(y) = [es cot a — 
0 dv'* Pp 
wie 
0 \ t 


= o(y"trtt), 














cig 


dr » r+p \-1 
dutt? cot 3u au nae cot $t> an 





as a consequence of Lemma 4; for when (10) holds, 
+p 
ia A’, <i (u ) ai cot du du— 0, as v, {— +0. 


8. Remarks about Theorems 3 and 4. The existence of the limit in (10) 
for p = q implies its existence for p > q, but the converse is not necessarily 
true. When the a’s are chosen in terms of the generalized derivatives as in 
Theorem 4, the existence of (10) for r = gq implies its existence® for r = q — 2i, 
0 < 2% < gq. The existence almost everywhere, when d*~'f(x)/dz" is of 
bounded variation, of (10) with the a’s chosen as in Theorem 3 can be made to 
depend on Plessner’s proof” for the case r = 1, p = 0. 


THE UNIVERSITY OF OREGON. 


8 Prasad, loc. cit., discusses the relationship between the cases r = 0, p = 0 andr = 0, 
p=1. See p. 178. 

® Moursund, first citation, Theorem 9.2. A proof of the existence of the limit in (10) 
with p = 0 at a point where the (r + 1)-th generalized derivative exists is given in the paper 
here cited. 

% Plessner, loc. cit. See Moursund, first paper cited, sections 7 and 9; second paper 
cited, section 5. Footnote 7 of the second paper corrects an error which appears in the 
first paper. 











A CONSTRUCTION FOR PRIME IDEALS AS ABSOLUTE VALUES 
OF AN ALGEBRAIC FIELD 


By SAUNDERS MacLANE 


1. Introduction. The difficulties of actually constructing the prime ideal 
factors of a rational prime p in an algebraic field have had a considerable influ- 
ence upon the development of ideal theory. One of the most practical of the 
methods for this construction consists of three successive “approximations” to 
the prime factors of p in terms of certain Newton Polygons, similar to the 
polygons used in the expansion of algebraic functions. This method, due to 
Ore,' is directly applicable in all but certain exceptional cases. The present 
paper extends the method to all cases by making not three but any number 
of successive approximations. To formulate this extension simply, it is neces- 
sary to replace the prime ideals by certain corresponding “absolute values”’, 
which succinctly express the essential properties of the Newton polygons. In 
terms of these values, the successive approximations are a natural application 
of a method of finding possible ‘‘absolute values” for polynomials. 

To introduce these absolute values, consider the ring 0 of all algebraic inte- 
gers of an algebraic number field, and let p be a prime ideal in 0. Since every 
integer a of the field can be written in the form (a) = p”-b6, where 6 is an 
ideal prime to p, we can write the exact exponent m to which p divides @ as a 
function Wa = m. Because of the unique decomposition theorem, 


(1) W(a-8) = Wa + WB, W(a + 8B) 2 min (Wa, WB). 


Any function Va which has these two properties is called a non-archimedean 
ooo 


value or a “Bewertung’” of the ring 0, while the particular function W obtained 
from p may be called a p-adic value. Every value V of 0 is a constant multiple® 
of some p-adic value W. Hence absolute values can replace prime ideals. 


Received February 10, 1936; presented to the American Mathematical Society, Decem- 
ber 31, 1935. 

10. Ore, Zur Theorie der algebraischen Kérper, Acta Math., vol. 44 (1924), pp. 219-314; 
O. Ore, Newtonsche Polygone in der Theorie der algebraischen Kérper, Math. Annalen, 
vol. 99 (1928), pp. 84-117. These papers will be cited as Ore I and Ore II, respectively. 

2W. Krull, /dealtheorie, Ergebnisse der Mathematik und ihrer Grenzgebiete, Bd. 4, 
Heft 3. This text, cited henceforth as Krull I, contains further references on absolute 
values. 

3E. Artin, Ueber die Bewertungen algebraischer Zahlkérper, Jour. fiir Math., vol. 167 
(1932), pp. 157-159. The theorem may be proved thus: Given V, first show that any 
rational integer n = 1 + 1 + --- + 1 has a non-negative value and then from (1) that 
every algebraic integer has a non-negative value. If the value of an ideal b be defined 
as the minimum of Va for @ e 6, then one and only one prime ideal p can have a positive 
value, and V must be p-adic. A similar theorem holds when 0 is an abstract ring in which 
the usual prime-ideal decomposition holds. (B. L. van der Waerden, Moderne Algebra, 
vol. 2, §100.) 
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In the same way every non-archimedean value V» of the rational integers 
is a “p-adic’’ value for some rational prime p; that is, for any integer a, Voa 
is mé, where m is the highest power of p dividing a and 4 is a constant >0. If 
yp is a prime ideal factor of p in an algebraic field, every p-adic value W, con- 
sidered only as a value of the rational integers, coincides with one of the p-adic 
values Vo. Thus W is an “extension” of Vo. 

The equivalence of prime ideals to values enables us to state the problem 
of constructing the prime ideal factors of a rational prime in the following gen- 
eralized form (with a notation to be used throughout the paper): Given a field K 
and a separable extension K(@) generated by a root @ of the irreducible polynomial 
G(x); given also a “‘discrete”’ (see §2) value Vo of K, to construct all extensions W 
of Vo to K(@). 

This problem will first be reduced in §2 to that of constructing for the ring 
of polynomials with coefficients in K those values V which are extensions of Vo 
and which assign the defining equation G(x) the value +. All values of this 
polynomial ring can be constructed* by successive approximations, which con- 
sist essentially in determining first the values of the polynomials of lowest 
degree (in x and in p). The salient features of this method are summarized 
in §2. Those approximations which can ultimately give G the desired value 
+c we call “approximants” to @ (see §3). Each such approximant is itself 
a value V, of the polynomial ring and can be constructed from a previous 
approximant V;,_, by using a unique “equivalence” decomposition of G(x) 
(see §4) and a ‘“‘Newton polygon” of G(x) with respect to V._; (see §5). After 
a finite number of steps (§8) we obtain a set of approximants corresponding to 
the desired values or prime ideals of K(@). The proof of this fact uses the 
integers of K(§7) and the exponents of prime ideals (§6). The computation 
of the degrees of prime ideals in §9 yields a constructive proof of the usual 
relation between degrees and exponents. Finally, the theorems of §10 sum- 
marize the results. A comparison with previous methods is also made. We 
note that some of the concepts resemble those used by Ostrowski> and by 
Deuring and Krull® in the (non-constructive) theory of Galois fields with 
absolute values. 


2. Non-finite values of polynomial rings. A non-archimedean exponential 
absolute value of a ring S is a function V, such that, for a in S, Va is a uniquely 
defined real number or + ~, with the properties 
(1) V(ab) = Va + Vb, V(a + b) 2 min (Va, Vb) 


4S. MacLane, A construction for absolute values in polynomial rings, to appear in the 
Trans. Amer. Math. Soc. Cited henceforth as M. All theorems from M required in the 
sequel will be explicitly stated, so that we refer to M only for certain proofs. 

5A. Ostrowski, Untersuchungen zur arithmetischen Theorie der Kérper (Die Theorie 
der Teilbarkeit in allgemeinen Koérpern), Math. Zeit., vol. 39 (1934), pp. 269-404. 

®M. Deuring, Verzweigungstheorie bewerteter Kérper, Math. Ann., vol. 105 (1931), pp. 
277-307. 

W. Krull, Galoissche Theorie bewerteter Kérper, 8S. B. Miinchen Akad. Wiss., 1930, pp. 
225-238. 
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for alla and bin S. These properties are called the “product” and “triangle’”’ 
Jaws respectively. If we exclude the trivial cases when Va = 0 for all a or 
Va = ~ for all a, the laws (1) imply that V(1) = V(—1) = 0, that the equality 
in the triangle law of (1) must hold whenever Va # Vb, and that V(0) = + ~<. 
Contrary to previous usage, our definition allows elements not 0 to have the 
value +. However, if Va # ~ for alla # 0, we shall call V a finite value. 
Since V(a~') = —V (a), every value V of a field must be finite. A value V is 
discrete if every Va is an integral multiple of some fixed 6 > 0. The original 
value Vo of K is discrete by assumption. 

Two elements a and b of S are equivalent in V if and only if either V(a — 6) > 
Va = Vb or Va = Vb = «. We write a ~ b for this equivalence. It is a 
reflexive, symmetric and transitive relation. An element a is equivalence- 
divisible by b in V if and only if there is ac in S such that a ~ chin V. For 
this divisibility we write b/a (in V). 

A value V of a ring S is an extension of a value Vo of a subring of S if Va 
and Voa are identical for all a in the subring. Our original problem can now 
be reduced to one concerning the polynomial ring A[z], which consists of all 
polynomials in x with coefficients in K. 

THEOREM 2.1. There is a one-to-one correspondence between the values W of 
K(@) and those values V of K[x] for which VG(x) = ~. Corresponding values V 
and W are extensions of identical values of K. 

The proof depends on the homomorphism of K[z] to K(@). If the value V 
with VG(z) = = is given, two polynomials congruent mod G(x) must have 
the same value, so that the value W for any f(@) can be defined by Wf(@) = 
Vf(xz). The same equation serves to define V when W is given. 

The method of the paper M for constructing finite values of K[z] applies 
without essential change for non-finite values. It consists fundamentally in 
the formation of a sequence of simple values 


(2) i ee, Ae ee 


To obtain any V;, in (2) from the preceding V,_1, we assign a new value yy 
to a suitable polynomial ¢, = ¢;(x). The following conditions’ must hold: 

2.21. dy is equivalence-irreducible in V,x_,; that is, ox/(f(x)-g(x)) in Vir 
always implies ¢;/f(x) or ¢:/g(x) in Vi-n ; 

2.22. o, is minimal in Vy_,; that is, d%/g(x) in V,_; always implies that 
deg o. < deg g(x); 

2.23. o, has the leading coefficient 1 and deg ¢, > 0; 

2.24. Mk > Vin di. 

When these are true, we call ¢, a key polynomial and yu, a key value of o; over 


7 Functions f(x), g(x) or simply f and g, etc., will always represent polynomials in K[z], 
while deg f(x) stands for the degree of f(z). If f = 0, deg f is meaningless, and state- 
ments about deg f are taken to be vacuously true. 
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V1. Given such “key” quantities the new value V, of any polynomial f(z) 
is determined from V,_, by first finding the expansion of f(x) 


(3) f(x) = Sn@@)OE + faa(Z)OT™ + --- + holz), deg f(z) < deg dx 
in powers of ¢(z) with coefficients of degree less than that of ¢, then setting 
(4) Vi f(z) = min (Vi-ifm + mue, Vi-rfn—i + (m — Ime, --- , Ve-rfol. 


The so-defined function V; is always a value of K[z]. We say that V;, is obtained 
by augmenting V;._,, and write 


(5) Vi = (Vi-r, Vibe = uel. 


To apply the condition 2.22 it is convenient to note (M, Theorem 9.3): 

2.3. The polynomial f(x) with the expansion (3) is minimal in V;, if and only 
if f,,(z) is a constant from K and V,f(x) = V,(f,,(2)¢7). In particular, the 
product of two minimal polynomials is itself minimal. 

The construction of any value V of K[z] starts with a “first stage’ value V; 
which is defined as in equation (4), except that the first key polynomial ¢; 
is now taken to be z itself and yw; is arbitrary, while the value V,_, used for the 
coefficients f;, which are now constants, is simply the originally given value 
V, for K. Given such a V,, new values can now, be defined by repeatedly 
augmenting V;. A sequence (2) in which each V; arises by augmenting V ;_; 
with a pair of keys ¢; and uw; from V;_; is called an augmented sequence. Each 
V, of such a sequence is an inductive value, and may be symbolized as 


(6) Vi = (Vo, Viz = wi, Vode = me, -++ » Vibe = mel. 

We assume in addition the conditions (M, Definition 6.1) 
2.41. deg ¢; = deg ¢:-1 (Gi = 2,3, ---); 
2.42. oi ™~ Oi-1 in Vi-n is false (7 = 2, 3, eee ). 


The last key value wu; may be + ~, but then there is no key over V;, satisfying 
these conditions, so that no further augmented value is possible. An infinite 
augmented sequence (2) also gives a limit value, defined by 


(7) Vif(z) = lim Vif (x) (for all f(x)). 


We will consider only those inductive or limit values which are extensions of 
the originally given Vo. 

To put the values of A[z] in a normal form, we first choose in K a complete 
set of “representatives” with respect to Vo, such that each element of K is 
equivalent in V» to one and only one representative. If next the coefficients 
of the expansion (3) are expanded repeatedly with respect to @x-1, dx-2, --- 
then f(x) is expressed uniquely in the form 


(8) f(z) = > a;o)"' b>” vee QM (a; eK). 


i 
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The exponent m,; is always less than (deg ¢;,1)/(deg ¢,), for 7 = 1,---,k — 1 
(see M, §16). If all terms in (8) have the same value in V;,, and if each a; 
is one of the previously specified representatives, then f(x) is in a sense homo- 
geneous in Vy. Any polynomial is equivalent in V, to a homogeneous poly- 
nomial. Henceforth we require in any inductive or limit value (6) that each 
¢@; be homogeneous in the previously constructed V;_;. Then, since the given 
Vo is discrete, every extension of Vo to K{x| can be uniquely represented as an 
inductive or limit value (M, §8, §16). 


3. Approximants to non-finite values. Our program requires the construc- 
tion of values V of A[x] for which VG(x7) = x. Any such V can be obtained 
from a sequence of suitable inductive values V,. A V, which might be so used 
to construct a V with VG = ~ will be called an “approximant’”’, in an explicit 
sense now to be given. This involves the way in which V;G@ increases in a 
sequence of inductive values V;, 7 = 1, --- , 4. This increase is described by 
M, Theorems 5.1, 6.4, and 6.5, for any f(x) and any i # k: 


3.11. Vis = Vif; 

3.12. Vif > Vf if and only if ¢:,:/f (in Vi); 

3.13. Vidi = Vidi, and Vif = Vif whenever deg f < deg $,,:. 
Further analysis uses the expansion of G(x) in ¢,: 


(1) G(x) = gulx)Oe + gmarlaoe + --- + golz). 


Among the exponents j for which V,G = V,(g;¢%), let a be the largest and 8 
the smallest. The difference a — 8, which depends on both V, and G, will 
be called the projection of V; (symbol: proj V,). One application is 

Lemma 3.2. Jf proj V. = 0, then no V with VG > VG can be obtained by 
augmenting V,. 

Proof. The value of each term in (1) is by 3.13 the same in any V as in V;. 
By hypothesis there is but one term of minimum value, so that the triangle 
law (§2, (1)) proves VG = V(g.¢{) = VG. Since we want only those values 
V, leading to VG = =, we are led to 

DEFINITION 3.3. A k-th approximant to G(x) over Vo is a k-th stage homogeneous 
finite inductive value of K{x| which is an extension of Vo and which has a positive 
projection. 

Lemma 3.4. If Vi, given as in §2, (6), is a k-th approximant to G(x), then so 
is V; fori = 1,---,k —1. Furthermore $;/G(x) in Vi_; and 


V.G(@) > VarG(x) > --- > ViG(2). 


First note that in the expansion (1) of G(z) 


(2) ViuG = min [ViaiQgnd?), Vi-rQGmarde), --- » Va-rgol, 
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much as in the definition of V;.. For were V,_,G@ to exceed the indicated mini- 
mum, then by the triangle law V,_:(g,¢;.) would equal this minimum for at 
least two 7’s. Were y the largest such 7, then 


919) ~9adk' + ++ +9 Fags. 


Then ¢} would be an equivalence-divisor of the polynomial on the right, which 
is of smaller degree than ¢7 , a contradiction because ¢ and hence ¢} is minimal 
(see §2, 2.3). 

By hypothesis proj V; > 0, so that there is an a > 0 with ViG = V,(g,¢%). 
As Vi-ids < Vidz, we have by (2) and 3.13 


VinG S VisGadt) < ViGgadi) = ViG. 


Hence by 3.12 ¢;/G in V,_,, and the remaining conclusions follow by Lemma 
3.2. Another useful fact is 

Lemma 3.5. Let a(x) be a minimal polynomial in V;., and r(x) the remainder 
of the division of a polynomial f(x) by a(x). Then Vir > Vif if and only if 
a(x) / f(x) in Vy. 


The proof is exactly like that of M, Lemma 4.3. 


4. Unique equivalence-decomposition. The construction of an approximant 
Visi from a given approximant V; must by Lemma 3.4 use a key polynomial 
¢x41 Which is an equivalence factor of G(x). These factors can be found from 
the unique equivalence-decomposition of G(x), the existence of which will now 
be established by a modified euclidean algorithm.’ We first introduce for any 
V; an “effective degree’”’ thus: if f(x) is any polynomial, expanded in powers 
of ¢ as in §2, (3), the largest exponent 7 for which V;.f = V;.(f,¢;) is the effective 
degree of f in ¢, and is denoted by D,f. Equivalent polynomials have the 
same effective degree. The proof of the product law (§2, (1)) for any inductive 
V, (see M, §4, end) shows that 


(1) Ds(fq) = Def + Dog. 


If we call a polynomial of effective degree zero an equivalence-unit, then e(x) 
is an equivalence unit if and only if there is an “equivalence-reciprocal” h(x) 
such that e(x)-h(x) ~ 1 (in V,). For if e(x) has such a reciprocal, then (1) 
proves that D,e = 0. Conversely, if Dse = 0, then, by definition of D,, e(x) 
is equivalent to the last term e(x) in the expansion of e in powers of @. As 
deg eo < deg ¢, 0 is prime to ¢, so that there are polynomials g(x) and 
h(x) with g(x)o, + A(x)eo(x) = 1. Using the minimal property of ¢; , we then 
conclude that A(x)e(x) ~ 1. 

Lemma 4.1. Any polynomial f(x) can be _ represented in the form 
f(x) ~ e(x)-a(x), where e(x) is a unit and a(x) is minimal and has the first 
coefficient 1. In addition, f(x) and a(x) have the same equivalence-divisors. 


8A similar algorithm has been used by A. Fraenkel, Ueber einfache Erweiterungen 
zerlegbarer Ringe, Jour. fiir Math., vol. 151 (1920), pp. 120-166. Compare Ore I, Theorem 6. 
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Proof. Expand f(x) as in §2, (3), pick out the first term f,(2)¢% of minimum 
value, and find the equivalence-reciprocal h(x) for the equivalence-unit f,(2). 
Then expand the polynomial h(x)-f(2) and drop out all terms not of minimum 
value. There remains an equivalent polynomial a(x), with an expansion 
beginning with ¢f. This a(x) is minimal, and we have f(r) ~ f,(2)-a(z), 
as required. 

To carry out the euclidean algorithm for two polynomials f(x) and g(x) with 
Def 2 Deg, write g(x) ~ e:(x)ai(x) in accordance with Lemma 4.1 and divide 
f(x) by a,(x), getting 


(2) S(x) = q(x)-ai(x) + re(x) Deore < Dgay. 


If Virz > Vif, a: and hence g is an equivalence-divisor of f. Otherwise, since 
a, is minimal, Virz = Vif and all three terms in (2) have the same value. 
Repeat this process with a;(z) and re(x) ~ e2(x)a2(x), ete., until a remainder 
exceeding the dividend in value is obtained. The preceding remainder d(x) is 
the greatest common equivalence-divisor of f(z) and g(x). As usual, 


(3) d(x) ~ s(x)f(x) + t(x)g(z) (in Vx) 


for suitable s(x) and t(x). To establish (3), it is convenient to note that, unless 
g(x) / f(x), all the terms in (3) must be of the same value in V;. 

The properties of equivalence-irreducible polynomials are now obtained as 
usual from (3). A decomposition of any f(z) into such irreducible factors must 
exist (because of D,). If we factor out a suitable unit, these irreducible factors 
can as in Lemma 4.1 be made minimal and hence key polynomials (§2, Condi- 
tions 2.21—2.23). 

THEOREM 4.2. Jn an inductive value V;, every polynomial f(x) has a decom- 
position 


(4) S(z) ~ ea ilazalz) --- ve(z) (in Vi), 


where e(x) is a unit and each y,(x) is a key polynomial. This decomposition is 
unique, except for the order of the factors and except that e(x) may be replaced by 
any equivalent unit and y;(x) by any equivalent key. 

If we require the factors y;(x) to be homogeneous in V;, (see §2, (8)), they 
are then unique. Note also that ¢, itself may occur as a factor, by 

Lemma 4.3. In an inductive Vi, is a key polynomial. 

Proof. Since ¢% is a key in V;_;, it has the first coefficient 1. Furthermore 
Ded. = 1; hence in any factorization of ¢, one factor is a unit, so that ¢, is 
equivalence-irreducible. Finally, ¢; is minimal in V;. 

In many cases the construction of the unique equivalence-decomposition (4) 
for a given polynomial f(x) in a given V;, can be carried out in a finite number 
of steps. 

THEOREM 4.4. The decomposition (4) is constructive when K is the field of 
rationals. 


The original value V> is then associated with a rational prime p, so that 
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every rational number is equivalent in Vo to one of the numbers c-p”™, ¢ = 0, 
1,---, p — 1; Vop = 1. Hence the complete set of representatives for Vo 
(see §2, end) includes but a finite number of representatives of each possible 
value® m. 

There are but a finite number of minimal homogeneous polynomials b(7) 
of a given degree d and with first coefficient 1. For any such b(x7) may be 
expanded in powers of x, 2, --- , @% as in §2, (8) with a highest coefficient 1 
of value 0. Because of the homogeneity, this determines the value of every 
other non-zero coefficient in the expansion. Since these coefficients are repre- 
sentatives, there is but a finite number of possibilities for each coefficient, and 
hence but a finite number of polynomials b(z). 

If f(z) is to be decomposed, write f(z) ~ e(x)a(x) by Lemma 4.2, find all 
minimal homogeneous polynomials b(x) of degree less than that of a(x) as 
above and by trial find which products, if any, are equivalent to a(z). 

The decomposition (4) can often be constructed by first decomposing the 
residue-class of f(x) (ef. §9 and M, part II). We can assume that all factors 
¢, if any, have already been removed from f. Then V:f(zx) will be in the 
previous value-group IT’... (M, Lemma 9.2), so that there is a unit polynomial 
R(z) such that V,(Rf) = 0. In the value V; the residue-class of any polynomial 
g(x) is denoted by Hig and is itself a polynomial in a new variable y (M, 
Theorem 12.1). In particular, H,(Rf) is a polynomial with a decomposition 


(5) Hi(Rf) = oaly) arly) +++ aly) 


into irreducible polynomials a;(y). But there is essentially just one key poly- 
nomial ¥;(y) in V; with the residue-class H,(y¥;R;) = a, for a suitable unit 
R;(M, Theorem 13.1). Since the residue-class of a product is the product of 
the residue-classes 


H.(Rf) = Hi (Riv Reve wake Ri¥.), 


and since polynomials in the same residue-class are congruent, 
Rf = RiR2 +++ Refaya ++ ve (in V;). 


If we multiply by an equivalence-reciprocal of R, we get the decomposition (4). 
Consequently, (4) can be constructed in this way whenever (5) can be found; 
that is, whenever polynomials can be constructively factored in the residue- 
class field of Vo in K (see §9). In particular, this method applies when K is 
the field of rationals. 


5. The construction of approximants. If 


(1) G(x) = agz™ + Qy_y2""! + --- + a, 


® Theorem 4.4 is true for any K and V» with this property. 
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the key ; of any first approximant V; = [V>, Viz = w| must by Definition 3.3 
be so chosen that, for suitable a > 8, 


(2) oui + Voda = Bur + Voas S iui + Voa; (i = 0,---,n), 


where the inequality holds fori > aor 8 > it. To interpret this, plot the points 
P; = (n — i, Voa;) in a cartesian plane. Then (2) states that the line P, Ps 
has slope yw; and that all the points P; are either above this line or on the 
line between P, and P;. The line segments P, P3 with this property for some 
u, form a convex broken line stretching from P, to Py. This broken line seg- 
ment is called the Newton polygon of the points P; ; it may be characterized 
as a convex polygon such that each corner is one of the points P;, while none 
of the points lie below the polygon. We have shown that each first approxi- 


mant V, corresponds to a side of this polygon of slope u4,; = Vir and of hori- 
zontal projection equal to the “projection” of V,. Hence 
(3) > proj Vi = deg G, 
the sum being taken over all first approximants V;. 
Next, given any (k — 1)-th approximant V,_; we wish to construct all k-th 


approximants V, which can be obtained by augmenting V,_,. Consider first 
the “terminating case” when G(x) is a homogeneous key polynomial” over 
V..1. Then by Lemma 3.4 the key polynomial ¢, must be an equivalence- 


divisor of the equivalence-irreducible G(x), whence ¢, = G. We obtain no 
finite approximants, but only the non-finite value V; = [Vi_1, ViG(xz) = ~], 


which by Theorem 2.1 corresponds to a value of K(@). 
Suppose instead that G(x) is not a homogeneous key polynomial over Vy_1. 
PI g yY poly 
Then by Theorem 4.2 and Lemma 4.3 


(4) G(x) ~ e(x)oea(x)" Yala)” «+» Yelar)™ (in Via), 


where the ¥;(z) are homogeneous keys over V;_,, all different and different 
from G(x) and $1, while the exponents n; are all positive, except perhaps 
for no. An augmented V; must have a key ¢, with ¢; /G@(x) in Vi, (Lemma 
3.4) and @% # dx: (§2, Condition 2.42). Hence ¢, is one of yi, --- , we. 

If one of these factors ¥; has been selected as ¢;, then G(x) has as in §3, (1) 
an expansion with coefficients g(x). To determine the new value uw, = Vi de 
to be assigned to @,, we again use a point Q; = (m — 7, Vx_igi(x)) for each 
term in the expansion and construct the Newton polygon for these points. The 
requirement that proj V; > 0 again means that yu; must be the slope of some 
side of this polygon. An inductive value requires also that uw, > Vids, so 
that we use only the principal part'' cf the polygon, composed of those sides 
of slope ph > Vy_ide. 


10 For convenience, we assume henceforth that the first coefficient in (1) is a, = 1. 

1 In special cases, this has been called a “‘Hauptpolygon’”’ by Ore (Ore I, p. 229; Ore II, 
p. 88) and a “verkiirztes Polygon’’ by Rella, Ordnungsbestimmungen in Integritdtsbereichen 
und Newtonsche Polygone, Jour. fiir Math., vol. 158 (1927), pp. 33-48. 
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THeoreM 5.1. Jf Vi_; is a (k — 1)-th approximant in which G(x) is not a 
homogeneous key, then the k-th approximants which can be derived by augmenting 
Vi-1 are all values Vi = (Vir, Vibe = wel in which o% # o.-; is any one of 
the keys in the decomposition (4) of G(x), while, for given ., ux is the slope of 
any side of the principal Newton polygon of G(x) with respect to o, and Vy. 
Furthermore 


(5) : (proj V,)-(deg @(V.)) = (proj Vy_,)- (deg @(Vix_.)), 


where the sum is taken over all such augmented V;, and where o(V) represents 
the last key of V. Hence there is at least one approximant V,. from Vi, . 

It remains to prove (5). On the left side of (5) suppose first that @, is the 
factor y; in (4), and consider the power n = n, to which ¢, divides G. Since 
¢, and hence ¢? is minimal in V,_;, the remainder 


a— 


r(x) - In—1 Px , + In20i + nae + Jo 


obtained on dividing G by ¢; must by Lemma 3.5 have V.yr > VuyuG. Caleu- 
lation of V,_yr as in §8, (2) gives 


(6) min [ViaQn1f), «<5 Viagel > Vir@ndi) = VinG, 


with the equality because n is the largest exponent with @! /@in Vy_,. If we 
set v = V,_1¢, and use §3, (2), this becomes 


Vi-rgn + nv = Ving; + jv Qj n+ 1, coe, m) 
< Vi-ngi + iv (@ = 0,---,n — 1). 


Geometrically, this means that the line L of slope v through the point Q,, lies 
above none of the points Q; and lies below Q,_1, --- , Qo. The convex Newton 
polygon is hence above or on L, so that the principal polygon, containing those 
sides of slope exceeding v, consists of the sides joining Q, to Qo. The horizontal 
projection of the principal polygon for ¢ = y; is therefore n = ny. 

However, proj V, is by definition (§3) the projection of the corresponding 
side of the principal polygon. Hence a sum taken over those Vy with y, as 
the last key gives }> proj Vi = m:. Similar equations for all y, yield 


(7) > (proj V,)-(deg ¢,) = n, deg y, + --- +n, deg y, = deg (yt --- wy"). 


But y}'---y7! is minimal, so that its effective and actual degrees in @ = @,—; 
must agree. Thus 


(8) deg (Vi! «+. WE) = Doi «++ ¥P)- (deg b.1). 
Because of (4) the effective degree is 
(9) Dot --- ¥i) = DG — Do, = D.G — ng. 


If the expansion of G(x) is }> h,(x)¢)_,, then D,G is by definition the exponent 
of the first term of minimum value, while no, the highest power with ¢2°,/G 
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in Vy_1, is by the argument used in (6) simply the exponent of the last term 
of minimum value in the expansion of G(x). By the definition of the projection, 


(10) DsG — no = proj Vir. 


The last four equations combine to give the result (5). By induction on k 
we obtain from (3) and (5) the following result. 

THEOREM 5.2. If the “terminating” case does not occur by the k-th stage, there 
is a finite number of k-th approximants, such that * 


(11) d (proj V.)- (deg o(Vx)) = deg G, 


the sum being taken over all k-th approximants V;.. 

THeEeoreM 5.3. (Terminating case.) Jf there is a non-finite homogeneous 
inductive value V, with ViG = =, then fori < k the value V; from which V;, 
is obtained is the only i-th approximant 

Proof. By Lemma 3.2, Vx_1, and hence by Lemma 3.4 each Vj, is an approxi- 
mant. Since V.G = ~ and G is irreducible, G must be the last key of V;, 
whence G is minimal in V;_; (see §2, 2.3): 


G(x) = or 1 + Gu1Or—1 + itr odin + Jol). 


Since G is minimal and (§2, 2.42) cannot be equivalence-divisible by 4—:, 
the first and last terms here take on the minimum value V,_,G, so that 
proj) Vis = m. Thus 


deg G = m(deg 1) = (proj Vy_-1)- (deg dx—1), 


and by (11) Vy_, is the only (k — 1)-th approximant. Hence each V; is the 
only i-th approximant. 


6. Exponents for values. To estimate the growth of yu, we need “value- 
groups”. If in an algebraic number field the prime ideal p is a factor of the 
rational prime p, and if the corresponding p-adic value W is an extension of 
the p-adic value Vo, then the highest power e to which p divides p is character- 
ized by Vop = e(Wp). Hence the group of all numbers used as p-adic values 
is a subgroup of index e in the group of p-adic values. For any value V of 
a ring S, the additive group I which contains all real numbers Vb — Ve for 6 
and ¢ in S is called the value group of V. This group is cyclic if and only if 
the value V is discrete (§2). If V is an extension of Vo to K[z] or to K(@), 
the value group I’) of V» must be a subgroup of the value group T of V. The 
order of the factor group I'/T, is called the exponent'*, exp (V). 

Now compute this exponent for an inductive value V;, with a value-group 
r,. The definition of §2, (4) indicates that every number in T, has the form 
vy + n-ue, Where n is an integer and y isin Ty_,. If we consider only the case 
when yu, is commensurable with [',.; (by M, Theorem 6.7, this is true when- 


12 An invariant interpretation of (11) will be given in §9. 
13 Similarly defined in Deuring, op. cit., p. 281 and Ostrowski, op. cit., p. 322. 
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ever V, can be augmented to some V,,,), there is a unique smallest positive 
integer 7, with the property that r.u,¢« Tx-,. By group theory 


(1) order (1./lx-1) = re, 

(2) exp (Vi) = ri-T2 +++ Te, 

where 7; fori = 1, --- , k is similarly defined. The assumption that yy, is 
commensurable also proves [, discrete. If u, = ~*, the formulas still hold 


if we take 7, = 1. 

In the course of §8 we shall need an estimate for exp (V,). Since each key 
polynomial ¢,,; is homogeneous (§2) in V;, any two terms in the expansion 
of $4: in powers of ¢; must be of equal value, so that this expansion appears 
as a polynomial in ¢j* (M, §11). Consequently deg $i: 2 ri(deg ¢;). 
Combining these inequalities for all 7, we find 


(3) deg @, = T1T2 --- re = exp (Vy_)). 


7. Integral key polynomials. It is often convenient to use keys with “‘inte- 
gral” coefficients. Here an integer“ with respect to Vo is an element ae K 
with Voa 2 0. All such integers form a ring, and every element of K is a 
quotient of two such integers. After the usual transformations we can assume 
that G(x) has Vo-integers as coefficients and the first coefficient 1. The Newton 
polygon of the first stage then must give a wu; 2 0, so that V,2 2 0 for every 
approximant. 

THEOREM 7.1. In a homogeneous V;,, with Vi.s;x 2 0, we have 


(1) OS wi < we < +--+ < we < wes, 


and the keys $; are all polynomials in x with Vo-integers as coefficients. 
The last key $,,; is minimal (2.3), so has a leading term ¢/* and a homogene- 
ous expansion as in §2, (8): 


(2) dis = Or* + Diaeiv ge” --- or (a;e K, mej < Uy), 


where, if n; stands for deg ¢;, the degrees m;; are limited by 

(3) Miz < Niga/Ni (all j,i = 1,2, ---,k — 1). 
Since ¢,,; is homogeneous, all terms in (2) have the same value. Hence 

(4) Mesa > Vidbrur = Urwn = (Mepime)/me. 


Since wu; = 0, (4) for every k gives (1). We next estimate the terms of (2). 
Lemma 7.2. In any V, with V;.x2 = 0, a term 


T = ¢3'¢3' --- $,", (mz < nisa/n; for all i) 
has a value V;.T S Vi-dx. 


4 (Cf. Ostrowski, op. cit., p. 288, or the “Bewertungsring”’ in Krull, /dealtheorie, p. 101. 


= ee nen 
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This inequality can also be written as 


mMypy + es + Mei me-ia S me. 
It is true for k = 1 or 2, by hypothesis and (4). If we assume it for & — 1, 
then, since n,/n,_, is integral, 
k-1 k-2 ‘te 
YS mii = mearwea+ DO mini S (mat Dua S et < be 
i=l i=l k-1 


Theorem 7.1 now follows by induction. It is true fork = 1. If all the keys 
of V, have V>-integral coefficients, all terms in the expansion (2) of ¢,., have 


the same value. But @{" -.-. @7 = T-o7*"' has by the lemma a value not 
exceeding V,¢;"'*' = V,@;". Hence the coefficient a; has a non-negative 


value, and a; is V>»-integral. 

Note. If K is the field of rational numbers, G(x) with leading coefficient 1 
can be so chosen that all its coefficients are ordinary integers (with non-negative 
value in every Vo). The same proof then shows that all ¢, have ordinary 
integers as coefficients, provided only that the representatives (§2) for each 
p-adic value Vy are chosen as the numbers c-p”, ¢ = 0, ---, p — 1. Similar 
results hold when K is an algebraic number field. 


8. The finiteness theorem. Each k-th approximant may give rise to one or 
more (kK + 1)-th approximants, so that the number of k-th approximants can 
increase with k. Ultimately, the number of approximants stops increasing, 
but for a finite construction we must be able to tell how soon this is the case: 

THEOREM 8.1. One can find an integer k’ so large that each k'-th approximant 
has the projection 1. As a result, only one (k + 1)-th approximant can be ob- 
tained by augmenting any given k-th approximant, for any k = k’. 

The second conclusion follows from the first, because in §5, (5), deg o;, can- 
not decrease (§2, Condition 2.41). To establish the first conclusion, we will 
show that a projection not 1 gives G a multiple factor “mod u,”’, in the sense 
in which A(z) is a common factor “mod v” in 

Lemma 8.2. If, in any homogeneous V; with Vix 2 0, f(x) and g(x) are 
polynomials with Vo-integral coefficients and a resultant R(f, g), if there are poly- 
nomials h(x), a(x), and b(x) with 


Vi(f — ha) 2 », Vig — hb) 2 v (v real), 


and if h(x) is not a unit in V,, then V,.{[R(f, g)|] = v. 
Proof. Since R(f, g) = 0 would imply V.R = ~, we can assume R(f, g) + 0, 
so that there exist e(x) and d(x), with Vo-integral coefficients, such that 


e(x)f(x) + d(x)g(x) = Rf, g). 
(van der Waerden, Moderne Algebra, vol. 2, p. 4). Hence 
R(f, g) = (ca + db)h + c(f — ha) + d(g — hb). 
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Since V.2 = O and therefore Vc 2 0 and V;d 2 0, the last two terms here 
have values not less than v. Were V;R < »v, we should have 


R(f, g) ~ (ca + db)h (in V,). 


Since R is a constant, this makes h a unit (see §4), contrary to hypothesis. 
To apply this lemma when R is a discriminant, use 

Lema 8.3. In any homogeneous V;, with Vix 2 0 and Vid, = yx the deriva- 
tive f’(x) of any polynomial f(x) has a value V;.f'(x) 2 Vi f(x) — ux. 

For k = 1 the result follows readily, since the value of a natural integer 
1 + --- + 1 is never negative. If the lemma is true for V,_,, and if f(x) 
has the expansion > S(x)oi as in §2, (3), then 


f(z) = Tf wei + ¥ iweie.(2). 


The value of the first sum exceeds V;f — yu, because of the induction assumption 
and because nu. > wr1. The value of the second sum is 2 Vif — ux, since 
V.j = 0 and V,¢, = 0, the latter because ¢; has V>-integral coefficients by 
Theorem 7.1. 

To establish Theorem 8.1, consider a V, with a projection a — 8 > 1. The 
expansion of §8, (1), used to define this projection gives 


(1) Vi-rga + ome S Ve-igi + tur (i = 0, ---,m). 
Division of G(x) by $f yields, in terms of this expansion, 

a—l 
(2) G(x) = g(x)ei + r(x), r(z) = QE gi(x)ei. 

1=0 


For this remainder r(x) the triangle law (§2, (1)) and (1) show 

Vinurd= min [Vingi + 7-Vind:] = min [Vi-rga + (a — t)ux + ¢-Vi-rgel, 
where 7 ranges from 0 to a — 1. Since uw, > Vyu_1¢,%, the minimum is at 7 = 
a—1: 
(3) Vir 2 Vi-rga + we + (a — 1)Vidx. 
As the divisor ¢f has V>-integral coefficients and first coefficient 1, the quotient 
and ga(x) likewise have integral coefficients, whence V;._:g2 2 0, since Vy_,x 2 0. 


Further, (4) of §7 proves Viid, 2 wx. 1, while a 2 proj V, was assumed to 
exceed 1, so that (3) becomes 


(4) Veur 2 we + wea. 
Differentiation of (2), with Lemma 8.3, now proves 
Vil@ — (agg, + 9'¢,)67") = mei ViulG — get] => my. 


Thus G and G’ have a “common factor” ¢{"', with a — 1 > 0. This factor 
is not a unit because ¢, is minimal in V,_;. Thus Lemma 8.2 with §7, (1) 
gives 

(5) VialR(G, G’)|) 2 we 2 wer (k > 1). 
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For large k this is impossible. For if T',~,, the cyelic value group of Vy_; 
has the generator 6,., > 0, while the group Io for Vo» is generated by 4) > 0, 
then, because of §6, (3), and §5, (11), 


(6) 60/6... = exp Va_r S deg d& S (deg G)/(proj V;,). 


Hence 6,_; is bounded below by 69/deg G. But the sequence yu; fori < k — 1 
lies in T'y_; and is increasing (§7, (1)), so that it increases by steps of at least 
5:1. Therefore n»1— © withk. But the field K(@) was assumed separable, 
so that G has no multiple roots, whence R(G, G’) # 0 and V,.4[R] = VoR is 
finite. Thus the inequality (5) is impossible for large k, and the assumption 
proj V; > 1 is untenable for large k. 

This proof can be used to estimate how soon proj V, becomes 1. 

If one combines (5) and (6) as indicated above, then 


Vis [R(G, G’)] = [((k — 2) 50-proj Vil/(deg G). 


This gives an upper bound for any k with proj V, > 1. If we use the worst 
value, proj V; = 2, in this bound and compute k’ as the next larger integer, 
we find that the integer k’ of Theorem 8.1 may be taken as 


1» _ | pn ‘ 
(7) k’ = B + 3, 


where n is the degree of G(x) and p the integer determined by Vo|R(G, G’)| = pio. 

Several improvements in this estimate are possible: (i), the term uw. — wii, 
neglected in (5), can be estimated as not less than 69/n; (ii), if n is odd and 
proj) V, = 2, the last inequality of (6) can be improved, while the remaining 


eases of proj V, = 3 or n even, proj V; = 2 can be treated by the original 
method. If this is carried out, one finds 

. n 
(8) k’ = (| + 2. 


The whole argument can now be repeated with proj V, replaced by the pro- 
jection of the principal polygon for ¢,. This shows that once ¢, is chosen for 
k 2 k’, the principal polygon has only one side, so that yu, is completely de- 
termined. In other words, only the first half of the k’-th stage is needed for 
Theorem 8.1. 

In the algebraic number case, p is the power to which the prime p under 
consideration divides the discriminant of G. If p = 0, then two stages suffice. 
This is essentially a part of the result of Dedekind, that under these conditions 
the prime ideal factors of p correspond to the irreducible factors ¢2(x) of G(x) 
modulo" p. Presumably the estimate (8) could be improved by introducing 
the index (involving the non-essential discriminant divisors) of the original 
equation. 


18 R. Dedekind, Ueber den Zusammenhang zwischen der Theorie der Ideale und der Theorie 
der hiheren Kongruenzen, Gesammelte Werke, vol. I, pp. 202-233. 
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9. The degree of a value. To interpret the relation (11) of §5 we need the 
notion of the “degree’’ of an absolute value. In an algebraic number field, 
the “inertial’’ degree of a prime ideal factor p of a rational prime p is just the 
degree of the residue-class field of p over the field of the integers mod p. To 
generalize to any value V of a ring S, use the ring of all “integers’”’ a « S with 
Va = 0, and call two integers a and b congruent mod V if V(a — b) > 0. The 
set of residue-classes of the integers with respect to this congruence forms as 
usual a ring, the residue-class ring S/V. If S is a field, so is S/V. If W is 
any extension of our original value V» to K(@), the usual arguments show that 
the residue-class field K(@)/W contains a subfield Fo isomorphic to K/V» and 
that K(6)/W is algebraic over this Fy. The degree of W is defined to be the 
degree, deg W, of K(0)/W over Fo. 

To compute the degree, we use the results of M, part II, which show that 
for a sequence of discrete inductive values V,, V2, ---, Vx the residue-class 
ring of each V; has the form of a polynomial ring F j{y], where F; is an algebraic 
extension of Fy = K/V>. Furthermore (M, Theorem 12.1) F; = Fo, while, 
fori # 0, F;,; is an algebraic extension of F; of a degree which is exactly the 
degree of ¢;,; considered as a polynomial in ¢7*. In other words (M, Theorem 
12.1), 


degree (Fi.4: F;) = deg o41/(ri-deg $3) (¢ = 1,---,k — 1). 


These formulas, combined with the interpretation of 7; in §6, (2), give 


(1) degree (F.: Fo) = ae... as <5 : 
TT. *** Tea exp (Via) 





These results can be extended to non-finite inductive values thus": 

THEOREM 9.1. For a non-finite value Vi = (Via, Vidi = ©] the residue-class 
ring K[x]/V, is isomorphic to a field F;,, which is an algebraic extension of Fy. 
of a degree determined as in (1), where Fy_s[y] ts the residue-class field of Vi4. 

Proof. Exactly as in the proof M, Theorem 12.1, F; is defined as the set 
of all residue-classes modulo V; which contain a polynomial f(x) with Vif = 0. 
But if a polynomial g(x) in any residue-class is divided by ¢, giving 


g(x) = g(x)oe + r(x), 


then the term g¢ has value «©, so that g and r belong to the same residue- 
class, while Vir = Vir => 0. Hence F;, includes all residue-classes and is the 
residue-class ring. Its degree is found as in M, Theorem 12.1. 

THEOREM 9.2. If W, an extension of Vo to K(6), corresponds as in Theorem 2.1 
to an inductive value V;,. with V.G(r) = ~, then 


(2) (exp W)-(deg W) = deg @&. 


© Theorem 9.1, as well as the last paragraph of §4, was revised July 15, 1936. 











508 SAUNDERS MacLANE 


The correspondence of W to V, yields an isomorphism between the residue- 
class rings K(@)/W and K[zr]/V;. Hence by (1) and the definition of the 
degree of W, 

deg W = degree (Fy: Fo) = (deg ¢,)/exp Vi. 


But since any V;f is either + * or some value from V,_,;, the value-groups of 
V, and V,_, are identical, and V,_,, V;., and W have the same exponent. 
Therefore (2) results. 


A similar interpretation holds for a limit-value V, = lim V,. We first 
prove as in M, Theorem 14.1, that, as soon as deg @, = deg dk41 = --- , We 
have Fy, = Fy,, = --- , and that this constant F;, is the residue-class ring 


Klx|/V,,. As before, this F; is then also the residue-class field of the corre- 
sponding value W of K(@). Consequently, using (1) again, we get 

THEOREM 9.3. Jf W is an extension of Vo to K(@) which corresponds as in 
Theorem 2.1 to a limit-value V, = lim V,, with V,,G(x) = ~, then 


(3) (exp W)-(deg W) = lim deg ¢,, 
kn 


and the limit on the right is actually attained for large k. 


10. The totality of values. The existence theorem is 

THEOREM 10.1. There are only a finite number of extensions W’,W”, --- ,W® 
of a given discrete value Vo of K to the separable field K(@), where 0 is a root of 
G(x) = 0. Furthermore, 


(1) (exp W’)-(deg W’)+ --- +(exp W™).(deg W®) = deg G(z). 


The relation (1) is a generalization of a well-known property of prime ideals. 
We first show that all W come from approximants. Every value W of K(@) 
corresponds by Theorem 2.1 to a value of A[z], which must be either an induc- 
tive value V, or a limit-value V,. In the latter case, V,, is the limit of a 
sequence V,;, V2, --- , in which each V, is by Lemma 3.2 an approximant. In 
the former case, V.G(r) = ~* and V,_; is by §2 and Lemma 3.2 a finite ap- 
proximant. Since V;, is not finite, Vio, = we = *&. Then only the multiples 
of ¢, have non-finite values, so that the last key ¢, must be G(x) itself. This 
is the “terminating case” of Theorem 5.3. In this case there is only one se- 
quence of approximants and hence only one value W of K(@). The equation 
(2) of §9 thus gives the relation (1) above. 

In the non-terminating case, we can construct one or more sequences of ap- 
proximants V,, V2, V3, ---. We must show that each such sequence gives 
a value W of K(@). By Lemma 3.4 


(2) ViG(x) < VoG(x) < V3sG(z) < ---, 


while ultimately proj V.; = 1 and deg ¢, is constant (Theorem 8.1 and §5, 
(5)). The index 7, of each value-group [,-; in the succeeding Ty is thus 
eventually unity (§6, (1) and (3) ). Therefore all the values in (2) lie in some 
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one discrete group Ty, so that V.G must approach x. The limit-value V, 
then has V,G = ~, so that V,, corresponds to a value W of K(@). The re- 
lation (1) for all these values follows from Theorems 5.2 and 9.3 because 
proj V;, = 1. 

The complete limit-value V, cannot be written down, but its essential prop- 
erties can be calculated. 

THEOREM 10.2. Each value W of Theorem 10.1 is uniquely determined by 
an “approximant” inductive value V\'’ of K|x), for some k = k’. If it is possible 
to construct the irreducible factors of polynomials with coefficients in the residue- 
class field K/V., the approximants V‘!’ can be computed in a finite number of 
steps by finding certain slopes u'!! of the Newton polygons of G(x) and certain key 
polynomials ¢\'? as the irreducible factors of G(x) in various equivalence-decomposi- 
tions. In this case one finds, in a finite number of steps, (i) the number s of ex- 
tensions of Vo to K(@); (ii) the exponent and degree of cach such W“; (iii) the 
values Wa for any previously given a in K(@). 

This is a restatement of previous results, except for the last assertion, which 
gives a construction of the “prime ideal’ decomposition of any a If a = 
g(0) # 0, then we need only compute V,g(2) for each limit value V, involved. 
If for every k, Vig > Vi_ig, the argument following (2) proves Vg = ~ and 
a = 0. Otherwise Vig = Vi_ig for some k, so that V, is not an approximant 
to g(x) in the sense of Definition 3.3 and Vg = Vig as in Lemma 3.2. Hence 
Wea can be computed in k stages. 

In the algebraic number case (K = the field of rationals) the construction 
of a prime ideal with inductive values can be extended to give a representation 
of the prime ideal as the greatest common divisor of integers. It can then also 
be proved that the “terminating case”’ of the construction arises whenever the 
prime p in question has only one prime ideal factor. The proof depends on 
the fact that every rational integer can be expressed as a sum of a finite number 
of terms cp”, with c = 0, 1, --- , p — 1. Thence it can be argued that any 
approximant V, with deg @, = deg G must ultimately lead to the terminating 
case. 

It remains to connect our results with previous investigations on this topic. 
Ore" developed (Ore I) a construction for prime ideals in algebraic fields which 
for this special case is equivalent to the first 2} stages of our method, which 
involve the approximants V2 and the key polynomials ¢;. This part of the con- 
struction does not suffice’® for all equations G(x). In a subsequent paper 
(Ore II, especially Kap. 2, §5) Ore made an extension equivalent to one more 
stage of our method, coupled with successive transformations of the defining 
equation G(x), which have the effect of reducing several stages of our method 


17 Ore uses uw, = 0, which is possible because @ is assumed integral. 

18Q. Ore, Weitere Untersuchungen zur Theorie der algebraischen Kérper, Acta Math., 
vol. 45 (1925), pp. 145-160. Here it is proved that for every p and every algebraic field 
there ‘‘exists’’ a regular defining equation for which the second stage is sufficient. How- 
ever, the existence proof is not constructive. 
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to one stage. This method is constructive and applies in all cases, but is justi- 
fied only by appeal to another, more elaborate construction'’® of prime ideals 
in terms of congruences mod p*. Berwick has developed® approximations 
equivalent to 2} stages of our method, and mentions the possibility of a third 
stage. The investigations of Wilson,*' although they are formulated in terms 
of group-bases for ideals, are closely related to the first two stages of our method. 
However, if the method of successive approximations is to be universally ap- 
plicable, it must be formulated in terms of an arbitrary number of steps; for, 
given an integer k and a prime p, an irreducible polynomial G(x) can always 
be constructed so that the decomposition of p in the field defined by G(z) will 
require more than k stages. 

Our construction can also be employed to give a simple form to a number of 
irreducibility criteria,” to prove one of the fundamental theorems relating 
Hensel’s p-adic numbers to prime ideals and to constructively establish the 
unique decomposition theorem in terms of the ‘““Hauptordnungen”’ of Krull.” 
I plan to discuss some of these topics in a later paper. 


HARVARD UNIVERSITY. 


19Q. Ore, Ueber den Zusammenhang zwischen den definierenden Gleichungen und der 
Idealtheorie in algebraischen Kérpern, Math. Ann., vol. 96 (1926), pp. 313-352; vol. 97 
(1927), pp. 569-598. 

20 W. E. H. Berwick, /ntegral Bases, Cambridge Tracts in Mathematics and Mathe- 
matical Physics, No. 22. 

21N. R. Wilson, On finding ideals, Annals of Math., vol. 30 (1928-29), pp. 411-428. 

22 §. MacLane, Abstract absolute values which give new irreducibility criteria, Proc. Nat. 
Acad. Sci., vol. 21 (1935), pp. 472-474; The ideal-decomposition of rational primes in terms 
of absolute values, Proc. Nat. Acad. Sci., vol. 21 (1935), pp. 663-667. 

23 W. Krull, [dealtheorie, p. 104. 




















ON THE CLOSURE OF {e**7} 
By Norman LEVINSON 


1. A set {e®»*} is said to be closed L?(—z, x) if for any f(x) « L»(—7, r) 


(1.0) f(xzje® dx = 0 
implies that f(x) is equivalent to zero. 
Here we will concern ourselves with the closure properties of the set 


fen} (-_2x <n < aw), 
where 


n 


(1.1) lim — =], 
that is, the A, are positive for sufficiently large n > 0 and have density 1,' 
and a corresponding result holds for n < 0. 

The question of the closure of such sets was first investigated by Wiener and 
Paley,? who considered closure in L?(—z, 7) of even sets (A_, = —X,). They 
made a special study of the set {1, e*®»7}, n > 0, where 


(1.2) |~A —n| 3s B, n> 0, 


and showed that if B < 3, the set is closed L?(—7, 7). Here it will be shown 
that for closure L*(—7, 7) it suffices that B < }. 

First we shall obtain a general closure criterion. We shall use this criterion 
to get results under conditions of the type (1.2) and we shall show that these 
results are the best possible. 

Our basic criterion is given by 

TuHeorEM I. Let {X,,} satisfy (1.1). Let A(u) be the number of |», | Su. If 


(1.3) A(u) du > 2v — &=5 logy — C 
1 u Pp 
for some constant C, the set {e®*}, —2» <n < ~, is closed L*(—7, r), p 2 1. 
A corollary of Theorem I is 
TueoreM Il. Jf 


(1.4) lA. — n| sin+?o- (-o<n<o), 


Received June 4, 1936. The author is National Research Fellow. 

1 If the density is different from 1 (and not zero or infinite), the problem is reducible 
to this one by making a change of scale. In case densities do not exist, see Levinson, Proc. 
Camb. Phil. Soc., vol. 31 (1935), pp. 335-346. 

2? Wiener and Paley, Fourier Transforms in the Complex Domain, Am. Math. Soc. Coll. 
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the set {e®»*|, if it ts not closed, becomes closed on adjoining to it at most any N 
terms e'™, 1 Sn Nn. 
In particular, then, in order that a set be closed L"(—7, +), p = 1, it suffices that 


. «oie es. 
- ae 


THeoreM III. Jf we replace (1.4) by 


(1.5) An — stresS = 2, 
2p 


r é, 
where 6 > O, there exist sets je%"| satisfying (1.5) which do not become closed 
when N terms are adjoined to them. Thus (1.4) is a best possible result. 

In connection with Theorems II and III the following result is of interest, 
although the proof is quite trivial. Note that it holds with no restrictions 
whatsoever on the }A,!. 

TueoreM IV. [f the set {e%"| is closed L"(—x, x), p = 1, it remains closed 
if we replace any X,, by some other number. 

Obviously this result is equivalent to the one obtained by replacing “closed”’ 
by “unclosed” in the above statement, for if an unclosed set becomes closed, 
we apply Theorem IV to the closed set and obtain a contradiction at once. 


2. Proofs of the theorems will now be given. 
Proof of Theorem 1. Let us suppose the theorem is not true. There exists 
an f(x) « L*(—7, 7) such that 


(2.0) H(w) = | f(x) e* dx 


has* zeros atw = A,, —* <n < «. Let us assume that none of the X, is zero. 
(How to proceed if one of them is zero will be obvious.) 


Let 
(2.1) F(w) = I] (1 _ *) ite, 


Since H(w) vanishes at \,, — ~« <n < &, 
o(w) = H(w)/F(w) 


is an entire function. Denote the number of zeros of H(w) not exceeding r in 
magnitude by n(r), and those of ¢(w) by m(r). Then clearly 


(2.2) n(r) = n(r) — A(r). 
It follows from Jensen’s theorem that 
r 2r 
(2.3) / n(u) du s I / log*+ | H(re®) | d@ + A, 
1 u 2r Jo 


3’ The use of an entire function H(w) in connection with the closure of trigonometric 
functions is due to Sziisz, Math. Annalen, vol. 77 (1916), pp. 482-496. 
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where A will be used throughout to represent various constants. By (2.0) 
H (re**) _ O(e™" sin @ ). 
Using this in (2.3) we have 


/ n(u) du S 2r+A. 
1 u 
Using this and (1.3) in (2.2) we have 


m(u) dus P Loge + A. 
1 








u 


Since (p — 1)/p < 1, it is clear that m(r) = 0, or in other words that $(w) 
has no zeros, that is, the zeros of H(w) coincide with those of F(w). By the 
Hadamard factorization theorem it follows therefore that H(w) = aeF(w), 
and in particular that 


(2.4) | H(iv) | = e* | aF (iv) | 


i? 
where a, b and ¢ are constants. 
From (2.0) we have for all sufficiently small « > 0, 


| H(iv) | < |S +[" +f Je | fla) | ae 


Using Hélder’s inequality we have 


| H(iv) | s | /  eneenlte-D ax |" fc | f(x) |” ax)” 
r eed (p-1)/p —rte Lf Up 
+ [2 et |vip/(p-1) dy ( }- ) | f(x) |p az 
$s ei" | 9 |[-@-0lp {ae : Lf | f(x) |? ax)" 
—~“FT+e r / 
+a[(f0" +f" )iserieae]”} 


For any 6 > 0 we can choose an ¢€ so that 


a + [.) (sa) rae |” = 6 


| H(iv) | 


Thus 


IIA 


Ae™!"! | py |-@-vip (e~* I"! 4. 8), 


' 


Or 


log | Hie) | 3 «|| —2— 





log | v | + log (eS '"' + 8) + A. 


P 
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From this it is clear, since we can choose 6 arbitrarily small, that 


tim (ioe | H(iv) | — xlv| + pat log |) = —o, 
Or using (2.4), we get 
(2.5) lim (ioe | F(iv) | + ev — lv) + P = i log | v ) — s 


| o|—+20 


On the other hand, by (2.1), 


log | F(iv) | = | log (1 + ") dA(u) - | A(w) ‘got 
0 u 0 uw “ 
= Qu? u " A(y) 
a) pon’ ou dy. 
» (e+ ap f y * 


If we use (1.3), we have 


ie Ph Qe u p-1l 
, a nniiitaeicinssdiamaee == = 
log | F(iv)| = | Ge + (2. : log u A) du 








ee p—-1f[" vu 
= lv| — ~ o@ pee log udu — A 
= Tv -— Pp 2 l log v | | aa du — A 
= rlv =n CF eee te — A. 

Pp 


But this contradicts (2.5). 








Proof of Theorem II. Let us consider A(u) for u > N + 1. From (1.4) it 


follows that 





A(u) >1+4+ 2[u — 4N — (p — 1)/2pl, u>N+1, 
or 
’ v 1A’ 
I A(u) 4, = | 1 + 2[u — 4N — (p — 1)/2p) ,, 
V+1 u N+1 u“ 
= 2 | BNP 
V+1 u 





. y , ny ~ 
a 2[ u — 4N — (p — 1)/2p — ot Bo — 1)/2p\ — 3 yy 
V+1 


Since u — [u] — } is periodic, we get 


[ A(u) du > 2 | u — 3N — (p — 1)/2p te a 
v+1 U N 


N+1 u 


> 2 — N log v — P 


of. log v — A. 
p 
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Now let us add N terms e“™, 1 < n < N, to fe} and denote the new set by 
j\e*"*|. Then clearly for sufficiently large u, u(u) = A(u) + N, where u(u) 
is the number of | wn | <u. Thus 


i u(u) du > 2v — E=2 log v — A. 
: = p 


By Theorem I it follows at once that {e'*™} is closed L°(—7, r). 

Before proving Theorem III we shall prove Theorem IV. 

Proof of Theorem IV. Let {e'***} be closed and the set {e'*"*}, n # 0, and 
e'*7, a # Xo, not be closed. There exists an f(x) « L»(—7, +) not equivalent 
to zero such that 


il f(x)e*™ dx = 0, n #0, 


f(x)e'*? dx 


II 
S 


Let us consider 
g(x) = f(x) + (ho — ale? | f(y)ei dy. 
Clearly g(x) « L?(—7, x). Moreover, 


[E otaree de = [ peeresedtz + ida — a) [* eto-omd fone dy, 


or 


(2.6) [ g(x)e™ dx = —— me f S(x)e dz. 
7 uU—aQa J—r 





It follows at once from this, on setting u = A,, that for any n 


i] g(xje* dx = 0. 


But je} is closed L*(—7x, r), and therefore g(x) is equivalent to zero. 
But by (2.6) this means that 


il " Hae dx = 0. 


If we set u = 0, +1, +2, --- , this implies that f(x) must also be equivalent to 
zero, contrary to our assumption. 
Proof of Theorem III. First let us take the case when N is odd. We take 


An = n+ 4N +564 (p —1)/2p (n > 0), 
(2.7) 1. = —h, (n > 0), 


ho = 4N + 6 + (p — 1)/2p. 
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By Theorem IV it does not of course matter where we take A> (or any other 
finite number of X,,). Let us set 

2+6+ (p — 1)/2p =. 


Then clearly cos*-* 3x ¢ L?(—z, 7). Moreover, for n = 0 
2 ’ ’ ’ 


Tr 
| ei(ntOz egs2t-2 dy dx 


Yr 


rr 


r 
_ pase f eintDz(] 4. cz) 2-2 dx 


- 
lim gave | et(ntDz(] + reiz)2t-2 dx 
0 = 


r 


lim 2-2+2 rk 2i—2 eilntktDz dy = 0). 
k s 


r—1—0 k=0 ® 


r—1— 


A similar result holds with e~**t®, n = 0. Thus cos**-* 42 is orthogonal 
to et#("+0, n > 0. But the set |+(n + t)}, n = O, contains the set {A,} 
defined in (2.7) and N additional terms. This proves Theorem III if N is odd. 

If N is even, we proceed similarly but we now use sin 32 cos*~'}xr, where 
t = 6 + (p — 1)/2p. In this case {1, e**"*97}, n > 1, is orthogonal to 
sin $x cos*—'32, 
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REPRESENTATION OF POSITIVE HARMONIC FUNCTIONS 
By A.LFreD J. Maria AND Rosert S. Martin 


We are concerned with the problem of representing the positive harmonic 
functions in a given region, and are primarily interested here in pointing out 
the relevance to this problem of a number of other problems, some of which 
have been discussed in the literature. The representation, by means of the 
Poisson-Stieltjes integral, of the positive harmonic functions in a sphere is an 
instance of the type of representation with which we are concerned. The 
analytical technique customarily employed in establishing the Poisson-Stieltjes 
representation or one of its generalizations requires relatively stringent smooth- 
ness conditions (e.g., bounded curvature) upon the boundary of the region.! 
It is true that the criteria we here cite as sufficient for a solution of the repre- 
sentation problem are less explicitly connected with the nature of the boundary 
than are the usual conditions just referred to, and it does not seem a trivial 
problem to characterize intrinsically the regions for which these criteria are 
satisfied. Nevertheless, as we shall show elsewhere, our criteria are satisfied 
by classes of regions considerably broader than those to which the customary 
technique applies. This would seem to make it clear that the representation 
problem does not depend essentially on smoothness conditions, even in three 
or more dimensions where conformal mapping no longer serves as a deus ex 
machina. 

In the present note we shaJl point out the criteria and give one two-dimensional 
application: a direct representation—that is, a representation not depending 
upon the intervention of conformal mapping—-of the positive harmonic func- 
tions in a finitely multiply connected Jordan region. For simplicity we shall 
restrict the discussion to bounded regions and shall use two-dimensional lan- 
guage, but it is to be emphasized that, except in the application at the end, 
the argument is independent of the number of dimensions. 

The representation in question is of the form 


(1) u(P) = J S(S, P)dyles), 


where u(P) is a non-negative harmonic function in a bounded region A, where 
A* is the frontier of A, where f(S, P) is a certain function which depends only 
upon the region A and which is defined for S « A*, P ¢ A, and where u(e) is a 
finite, non-negative, completely additive function of Borel sets which vanishes 

Received June 5, 1936. 

‘de la Vallée Poussin, Propriétés des fonctions harmoniques dans un domaine ouvert 
limité par des surfaces 4 courbure bornée, Annali della R. Seuola Normale Superiore di Pisa, 
f2), vol. 2 (1933), pp. 167-197; George A. Garrett, Necessary and sufficient conditions for 
potentials of single and double layers, Am. Jour. of Math., vol. 58 (1936), pp. 95-129. 
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in the complement of A*. The integral is to be taken in the sense of Stieltjes- 
Radon.? The representation theorem is to the effect that for suitably chosen 
f(S, P) equation (1) sets up a one-to-one correspondence between the non- 
negative harmonic u(P) in A and the u(e) which vanish in — A*. 

We shall denote by W the entire finite plane. If M C W,then M, M*, — M 
will be respectively the closure, frontier, and complement of M. The symbols 
u(P), v(P), --- , ete., will always denote non-negative harmonic functions 
having as domain some bounded region (non-void, open, connected set). 

We recall certain notions and results of which we shall make frequent use.* 

For any bounded region A, let m,(e, P) = m(e, P) denote the mass distri- 
bution obtained by sweeping out a unit mass located at P « A onto A*. For 
a fixed P « A, m(e, P) is a non-negative completely additive function of sets e, 
measurable Borel; the total mass is 1 and is located upon A*. For a fixed 
e, m(e, P) is a non-negative harmonic function of P « A. If every boundary 
point of A is regular, the solution of the continuous Dirichlet problem for 
boundary values U(S) on A* is given by 


(2) u(P) = [. U(S)dm(es, P). 


The harmonic character in P of m(e, P) shows that for any two fixed points 
P, Po, m(e, P) and m(e, Po) are as set functions each absolutely continuous 
with respect to the other, for they vanish on exactly the same sets e. In particu- 
lar, this says that for any fixed Po, a derivative function [dm(e, P)]/[dm(e, Po)] 
exists. We shall eventually exhibit this derivative as a suitable choice for 
J(S, P). 

Suppose that the domain of u(P) contains or is a region A. We shall denote 
by Elu(P), A] = E(u, A) the set of all points S¢«A* for which lim u(P) > 0. 

P—S,PEA 
We may call E(u, A) the exceptional set for u(P) relative to A; it is the set of 
boundary points of A at which u(P) does not take on continuously the boundary 
value 0. It is clear that if E(u, A) is void, then u(P) vanishes identically 
throughout A. Furthermore, if the boundary points of A are all regular, then 
for every Borel set e we have® 


(3) E|m(e, P), A] C é. 


The non-negative harmonic functions defined in a region A form a normal 
family; in particular, any collection of them which is bounded at some particular 


2J. Radon, Theorie und Anwendungen der absolut additiven Mengenfunktionen, Wiener 
Sitzungsber., (1913), pp. 1295 ff. 

ide la Vallée Poussin, Extension de la méthode du balayage de Poincaré et problime 
de Dirichlet, Annales de Institut H. Poinearé, vol. 2 (1932), pp. 169 ff. The results are 
more special than those needed here but the general result is valid and is in fact contained 
implicitly in results of N. Wiener, Certain notions in potential theory, Journal of Math. 
and Phys. (M.I.T.), vol. 3 (1924), pp. 24 ff. 

* Radon, loc. cit., p. 1351. 

“de la Vallée Poussin, loc. cit. 
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point of A—and thus at each point of A—is compact in the sense that every 
infinite subcollection contains a pointwise convergent sequence. More gener- 
ally, if we speak of a family of subregions of A as eventually covering A whenever 
each closed subset of A is contained in all except possibly a finite number of 
these subregions, then again any infinite family of u(P), whose domains even- 
tually cover A and whose values are bounded at some point of A, contains a 
sequence pointwise convergent throughout A. In both cases the limit function 
is non-negative harmonic, and the convergence is uniform in every closed 
subregion of A.® 

We shall say that a region A is approximated by a nested family of subregions 
A,, Ao, --- , if (i) A, C Angi, (ii) lim A, = A. It is clear that such a family 


eventually covers A. 

Now let us turn to the representation problem. Consider the following 
conditions upon a bounded region A. 
(a) The classical continuous Dirichlet problem is solvable for A; that is, every 
boundary point of A is regular.’ 
(8) Every boundary point of A admits the so-called ‘principle of Picard’. That 
is to say, if So ¢ A* and E(u, A) = E(v, A) = {So}, then u(P) = c-v(P), where 
¢ is a positive constant. 
(y) For any closed set B C A*, the condition E(u, A) C B is a closed property 
of u(P), that is to say, any limit element of a set of u(P) having the property 
also has it. 

Let us first observe that for regions A satisfying (a), the condition (y) is 
equivalent to each of the following conditions: 
(y’) For any closed B C A*, the functions of a compact family of u(P) with 
E(u, A) © B are uniformly bounded near every S C A* — B. 
(y’’) For any closed B C A*, the functions of a compact family of u(?) with 
E(U, A) C B take on uniformly the boundary value 0 at every S « A* — B. 

It is evident that (y’) implies (y). (@) and (y’) imply (y’’). For (@) says 
that a barrier® Vs(P) exists at every S « A*, and (y’) says that if S « A* — B, 
we can majorize near S the u(P) of any compact family of u(P) for which 
E(U, A) C B by a suitable positive multiple of Vs(P). Finally, (y) implies 
(y’). If (y’) were not satisfied, there would be a convergent sequence of 
u(P) [E(u, A) C B] unbounded near some So « A* — B; thus a sequence of 
points P,, — So(P,, « A) and a convergent sequence of functions u,(P) [E(u, A) 
C B] such that u,(P,) > 2". It can readily be verified that the functions 


" 
v,(P) = > 2-u,.(P) would form a compact, increasing, therefore convergent 
m= 


sequence, and that they together with their limit function would violate the 


*A well known consequence of the Harnack inequality and the theorem of Ascoli. 
See for example P. Montel, Lecons sur les Familles Normales de Fonctions Analytiques, 
Paris, 1927, pp. 39 ff. 

70. D. Kellogg, Foundations of Potential Theory, Berlin, 1929, p. 328. 

* Kellogg, loc. cit., p. 327. 
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closure condition (vy). This completely establishes the equivalence of (y), 

(y’), (v’) under the assumption of (a). 

(e) f(S, P) = [am(e, P)]/[dm(e, Po)] can be defined for S « A*, P € A in such 

a way that it is continuous in S for fixed P and for fixed S is positive harmonic 

in P and equals unity at P = P». 

(¢) If So « A*, then lim f(So, P) = 0 uniformly for all S outside a neighbor- 
Ps 


hood of So. 

THeoreM 1. If A satisfies (a), (8) and (y), then (€) and (¢) are satisfied. 

We separate the conclusion of the theorem into these two parts for technical 
reasons that will soon be clear. 

In order to establish the theorem, let us consider a function f(S, P) defined 
for S « A*, P « A which (i) is positive harmonic in P and 1 at P = P» ; (ii) 
satisfies E{f(S, P), A} = {S}. If such a function exists, it is by (8) clearly 
unique; by (i), (ii) and (y”’) it must satisfy (¢); further, it must be continuous 
in S. To see the continuity, assume that S, — So (So, S, « A*). Let v(P) be 
any accumulation element of the sequence {f(S,, P)}. For n 2 m we have 


~ 
E(f(S,,P), A] CB, = :™ {Sx} + {So}. Application of (y) yields E(v, A) C B,, 
k=m 

for all m. But II B,, = {So}. Therefore v(Po) = 1 and E(v, A) = {So}. 
From (8) we get f(So, P) = v(P). This says that for any sequence S, — So 
there is a subsequence S, — Sp such that f(S., P) + f(So, P), which fact clearly 
implies continuity in S. 

All we have, therefore, to show is that a derivative [dm(e, P)|/[dm(e, P»)| 
exists satisfying (i) and (ii) above. 

It is convenient to compute the derivative over a net. Select in W rectangu- 
lar cartesian coOrdinates z, y. Form the quadratie net consisting of all half- 
open squares ¢* | 


. eset, rs» te 
2" ;= Qn 1 Qn = = Qn 1 
where p,q = 0, +1, +2,--- ;n = 1,2,---. Forany S ¢ W denote by e,(S) 


that ¢% , which contains S. Form the sum ¢ of alle? , 
Obviously m(eo, Po) = 0, and —eg C A*. 

Now —éo is dense in A*. Otherwise e9 would contain a subset ¢; relatively 
open in A*. There would then be a continuous V(S) defined in A*, positive 
in €;, and zero in A* — e,. The v(P) determined from the boundary values 


for which m(e",, Py) = 0. 


V(S) by means of (2) would vanish at P» and thus identically. This would 


contradict (a@). 

Now for any particular S ¢ —e, form the sequence of g,(S, P) = [m(e,(S), P)|/ 
Im(e,(S), Py)\. ga(S, P) is positive harmonic in P, 1 at P = Po, and Elg,(S, 
P), A| CeS). Let gS, P) be any accumulation element of the g,(S, P). 
Applying (7) and the facet that the ¢,(S) form a descending sequence of sets, 
we get Elg(S, ?), A} Ce,(S) for all n. Thus, since IT e,0S) = {S}, g(S, P) 
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satisfies (i) and (ii) when those conditions are restricted to points S «€ —e- 
Furthermore, g(S, P) is by its construction a derivative function® [dm(e, P)]/ 
[dm(e, Po)], and this property will not be destroyed by an arbitrary extension 
of the domain of g(S, P) to include points of the null set ¢. Extend g(S, P) 
to all of A* as follows. For any S ¢ A*eo, choose a sequence S, — S, where 
the S, lie in the dense set —eo. Define g(S, P) as some accumulation element 
of the sequence {g(S,, P)}. By an almost exact reproduction of the argument 
at the first part of the proof we get E[g(S, P), A] = {S}. Thus the extended 
g(S, P) satisfies (i) and (ii). The proof is therefore completed by taking 
S(S, P) = g(S, P). 

We may now deduce a number of immediate consequences of (€) and (¢). 

THEOREM 2. Suppose A satisfies (e). If B is Borel and CA*, and ule) is 
non-negative, completely additive, then 


(4) u(P) = | f(S, P)dules) 


represents a non-negative harmonic function in A. Further, if A satisfies (¢) and 
B is closed, then E(u, A) C B. 

These statements follow from the approximation to the integral on the right 
of (4) by Riemann sums. 

If A satisfies (€) and u(P) is representable by (1), where the f(S, P) of (1) 
is that of Theorem 1, we call u(P) representable. 

THeoreM 3. If A satisfies (€), the totality of representable u(P) form a closed 
class which contains every u(P) taking on continuous boundary values over A*. 

Suppose u,(P) — u(P), where 


(5) u(P) = [ I(S, P ey, (es). 


Since f(S, Po) = 1, we get from (5) u,(Po) = | du, (es) = w,(A*). Thus the 
° 


u»(e) have uniformly bounded total mass all contained in the compact set A*. 
A subsequence {u,,(e)} therefore has a weak limit u(e)," and we get 


u(P) = lim u,(P) = lim | S(S, P)du, es) 
(6) aes cial 
= lim | S(S, P) du. (es) = S(S, P)dules). 
no Ja® ° 


“A 


"For a net such as we have used the extension of the Vitali covering theorem to com- 
pletely additive set functions is readily established. From this one proceeds as with 
Lebesgue integrals. 

'© Radon, loc. cit., p. 1337. 
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This shows that the class of representable u(P) is closed. If u(P) takes on 
continuous boundary values U(S), we have 


u(P) = / U(S)dm(es, P) = J visa f(T, P)dm(er, 9) 


(7) - | U(S)f(S, P)dm(es, Po) = / f(S, P)d / U(T)dm(er, Pp) 
ae _\ es 
= / f(S, P)dyles), 
where we have put ule) = U(T)dm(er, Po). 


THeoreM 4. Jf A satisfies (€) and (¢) and if u(P) is representable, then the 
corresponding ule) is uniquely determined by u(P). 

It is sufficient to prove that the value of u(e) is determined for any closed 
subset @ of A*. Let eo be such a set. 

Let A,, Az, --- be a nested family of approximating regions for A, which 
have, say, analytic boundaries. Or at least let them all satisfy condition (a). 
Such a family always exists.'' We may assume that P» is in all A,. Let 
m,(e, P) = ma,(e, P). 

Define M(P) = lL.u.b. f(S, P). For a positive e define D, as the set of all 


S€ eo 


P ¢ A for which M(P) > «. D, is an open set. (¢) shows that D, cannot 
have a frontier point in A* — é ;1.e., that DA* Ceo. 
Now for S ¢ A* define 


(8) h,. (S) = [. f(S, P)dm,(er, Po). 
1D. 
We have 
(9) 0<h,.(S) < / S(S, P)dm,(er, Po) = f(S, Po) = 1. 
ad 

We now show that 

hk.(S) =l—e« (S €€), 
(10) lim hn.(S) < € (S«A* — é). 


For S ee, and Pe A* — A*D,, we have f(S, P) s M(P) s «. Thusif Seeo, 


h,.AS) = (| -| Jats, P)dm,(epr, Po) 
_ A*—A*D, 


>1 -| edm,(ep, Po) 21 —«, 
A A*D, 


and thus the first of relations (10) is true. 


" Kellogg, loc. cit., p. 319. 
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If S ¢ A* — eo, the set Cs,, of all P ¢ A for which f(S, P) > ¢€ can have only 
one frontier point in A*, namely, the point S. Thus Cs,, has points in common 
with only a finite number of the sets A*D,; otherwise S would be a limit 
point of D,. Hence if S « A* — eo, 


lim Ay,(S) < lim / f(S, P)dm,(ep, Po) 
A* Dg Cg 


no n--2 ) 


~~ Be 


| edm,(ep, Po) = «€. 


IA 


Thus (10) is established. 
Now form 


(11) Fas = / hy. (S)dules). 
Z 
We have 


lim T,.. < | lim hy.<(S)du(es) 


n-70 A* n—o@ 
e 


(12) -(f + | ) i aaa) 


< wleo) + eu(A* — e), 


and similarly 


(13) lim T,,, = / lim h,,.(S)du(es) = (1 — e)uleo). 
n—-2 4* n--+2 

(12) and (13) together show that 

14 ») = li lim T,..). 

” Hes) = tn (Tis on) 


Now, by Fubini’s theorem 


/ hy. (S)dules) = i || I(S, P)dm,(ep, ») | tute 
is ‘* "Dp, 
/ | I(s, Pye) Jamar, Po) 
“Dp, ‘* 
= / u(P)dm, (ep, Po). 
‘"p, 


The last expression does not explicitly involve u(e). Therefore this, together 
with (14), show that the value of u(eo) is determined by u(P). 

So far we have shown that in a region A satisfying (e) the u(/) representable 
by (1) form a closed class which contains all u(P) with continuous boundary 


Rice 


(15) 


values. If A also satisfies (¢), the representation of a u(P), if it exists, is unique. 
A fortiori these remarks are valid if A satisfies (a), (8), and (y). For a com- 
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pleted representation theory we should need the representability of every u(P). 
We shall not here investigate what minimum of conditions beyond (a), (8), (y) 
are sufficient to secure this result, but rather shall add to them a fourth con- 
dition (6) which we shall formulate presently. This procedure might appear 
somewhat unsatisfactory in view of the fact that, as will later turn out, the 
completed representation theorem under a very natural restriction implies (a) 
(8) and (y), whereas there would at best be considerable difficulty in showing 
that it also implies the condition (6). However, there is a very good technical 
justification for the condition (6); namely, a sound technique, designed to estab- 
lish (y) for a class of regions, quite frequently yields (6) as well, when suitably 
modified. 

We now formulate the condition (6). 

(6) There exists for A a nested family of approximating regions A;, Ae, --- , 
each satisfying (a), (8), (y), and their totality together with A fulfills the fol- 
lowing condition: Pp» being a fixed point of A, and B being any closed set, the 
u(P) which have as domain some A, , which are less than some fixed bound 
at P = Po», and which have their exceptional sets contained in B, are uni- 
formly bounded near any point not in B. 

It is clear that this condition states a kind of uniformity of the way in which 
the A, satisfy the condition (y) [in its equivalent form (y’)]. An application 
of the barrier condition analogous to that in the discussion of (y), (y’) shows 
that when A satisfies (a) and (6) and when there is given any convergent 
sequence of u(P) whose domains are among the A, , and eventually cover A, 
and whose exceptional sets are contained in B (closed), then the limit function 
of this sequence also has its exceptional set contained in B. 

Now let A satisfy (@), (8), (vy), and let Ay, Az, --- be a nested family of 
approximating regions satisfying (a), (8), (y). Select a fixed Po ¢ Ay. Form 
for A relative to P> the function f(S, P) of Theorem 1. Similarly form relative 
to P, for each A, the corresponding function f,(S, P). Form the set H = A* + 


> A*. His closed. Define 


n=1 
S(S, P) (Se A*, Pe A), 
(16) F(S, P) = 5 
f.AS, P) (Se A*, PeA,). 


THeoreM 5. If A satisfies (a), (8), (y) and (6), then: () there exists for A a 
nested family of approximating regions satisfying (a), (8), (y), such that the 
function F(S, P) defined above is for fixed P continuous in S ¢ H. 

To prove this, observe that as each A®* is at a positive distance from the rest 
of H, the only possible discontinuities (in S) of FCS, P) would be at points of 
A*, and these, if they existed, would be effective discontinuities only when 
approached over H — A*. Thus if F(S, P) were for some P; ¢ A discontinuous 
at Sy « A*, there would be a sequence S, —> So with S, « H — A* and lim F(S,, 


nn 


P,) A F(So, Pi). We could without loss of generality assume that P; € Ai 
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and that S, « A*. We could then let v(P) be an accumulation element of the 

sequence {f,(S,, P)}. By an argument like that used in the proof of Theorem 1 

we should have v(Po) = 1, E(v, A) = {So}, and thus o(P) = f(So, P). Hence 
lim F(S,, P,) = lim fASa, P,) = f(So, P,) = F(So, P,). 


ns n-—?2 


This would contradict lim F(S,, Pi) # F(So, P:). 


n—20 
It is clear that () implies (€) not only for A but for each of the approxi- 
mating regions A,.. 
THEOREM 6. (») implies the representability of every u(P) in A. 
For a fixed P « A; , F(S, P) is continuous in S over the compact set H. Sup- 
pose u(P) any non-negative harmonic function in A. Applying Theorem 3 to 
u(P), relative to the region A,, we get 


u(P) x [. SAS, P) du, (es), 


where the total mass of u,(e) is located upon A*, and »,(A*) = u(Po). In 
particular, for P ¢ A d we have 


u(P) = [Fs P) dun(es). 


The u,(e) have uniformly bounded total mass all contained in the compact 
set H. Hence a subsequence {y,,(e)} has a weak limit u(e). As each closed 
subset of A has points in common with only a finite number of A*, the total 
mass of u(e) must all be located upon A*. Therefore, for P € Ai, 


u(P) 


lim | F(S, P) ditn,(es) 
_ H 


k—+20 


[ F, P)dp(es) = / S(S, P) dples). 
H - 


Since, however, both sides of this equation represent harmonic functions through- 
out A, the equation must hold for all P ¢ A. 

We thus see that (») implies the representability of every u(P) by means 
of (1), where f(S, P) is taken as [dm(e, P)|/[dm(e, Po)|. (€) and (¢), 4 fortiori 
(nm) and (¢), guarantee that this representation is one-to-one. Under the as- 
sumption of (») and (¢) the representation has the further property, a conse- 
quence of Theorem 2, that a u(P) takes on continuously the boundary value 0 
at every point of A* where the corresponding yu(e) has no mass. (A mass 
function is said to have no mass at a given point if the point has a neighbor- 
hood of zero mass.) We may call a representation having this last property 
decomposable. The condition for decomposability can readily be put in the 
form: if B is closed, and u(e) is such that u(eB) = ule) identically in ¢, then 
for the corresponding u(P) we have E(u, A) C B. 

Tureorem 7. Jf A is such that an f(S, P) = [dm(e, P)|/[dm(e, Po)| represents 
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by (1) in a one-to-one way all positive harmonic functions in A, and if this repre- 
sentation is decomposable, then A satisfies (a), (8), and (y). 

Under these hypotheses f(S, P) is for each S ¢ A* a positive harmonic function 
of P. In fact, f(S, P) is the function corresponding to a u(e) due to a point 
mass located at S. 

Furthermore, the one-to-one character of the representation implies a certain 
converse to the condition of decomposability; namely, if u(P) is such that 
E(u, A) C B (closed) and y(e) is the corresponding mass function, then (eB) = 
u(e) identically ine. For assume B closed and CA*. Let wo(P) be such that 
E(u, A) C B, and Jet pole) be the corresponding mass function. Let e; be 
any closed subset of A* — B. Definé ui(e) = wo(ee), and let u,(P) correspond 
to wile). Clearly ui(ee:) = wile); hence by the decomposability E(u; , A) Ce. 
But since u;(e) S pole), we have u,(P) S uo(P), and therefore E(u, , A) C 
E(u, A) C B. Thus E(u, A) C Be,, which is void, and hence u,(P) = 0. 
Therefore yi(e) = 0. In particular, wo(e:) = wilei) = 0. As e; is any closed 
subset of A* — B, uo(A* — B) = 0. Hence pole) = po(eA*) = wole(A* — B)] + 
uoleB) = puol(eB). Now turn to the condition (a). Let e; be relatively open 


in A*. Put eg = A*—e;. Take uo(P) = m(e2, P) = / I(S, P)dm(es, Po) = 


| f(S, P)dm(ege2, Po) - | S(S, P)duoles). This wo(e) satisfies yo(e-e2) = pole). 
a* A* 


Hence E(up, A) C eg, and from this fact follows the solvability of the continuous 
Dirichlet problem.” 

For (8), observe simply that if E(u, A) = {So}, then the corresponding 
ule) satisfies u(e-{So}) = ule). In other words, this u(e) is a point mass at So. 
But two such mass distributions for the same point So are clearly multiples 
of one another, and the same must therefore be true of the corresponding 
harmonic functions. 

Finally, for (vy), assume that u,(P) — u(P) in A and E(u,, A) C B (closed). 
Let u,(e) correspond to u,(P). As u,.(A*) = u,(Po), the u,(e) have uniformly 
bounded total mass, and therefore a subsequence {y,,(e)} has a weak limit u(e). 
We therefore have 


u(P) = lim u,(P) = lim u,,(P) = im | S(S, P) dun,(es) 
n-ne k--e kn a* 


-| fis, P) dyes). 


By what we proved above, u,(eB) = u,(e). By the properties of a weak limit 
u(eB) = ule). Thus from the decomposability, E(u, A) C B. This con- 
cludes the proof, 

As a corollary to this result, we see that (») and (¢) imply (a), (8), and (y). 


'?de la Vallée Poussin, loe. cit., p. 208. The conclusions of the theorem in §44 are a 
special case of the statement above, and these conclusions are sufficient to give the solu- 


tion of the continuous boundary value problem (loc. cit., p. 205, §45) 
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We now apply these results to finitely multiply connected Jordan regions. 

First, however, consider a special case. Consider a circular region C of 
center Po and radius R. The swept out mass m(e, P) here has a continuous 
non-vanishing density with respect to are length on C*. Hence the derivative 
f(S, P) is given by forming the quotient of these densities for the two functions 
m(e, P), m(e, Po). It turns out that this quotient is R? — PP}/PS*, which 
is precisely the Poisson kernel. From this explicit expression it follows at once 
that C satisfies (e€) and (¢). Furthermore, if we choose as approximating 
regions concentric circles interior to C, then (») follows, again from the explicit 
expression. Thus there results a complete representation theory for the circle." 

We wish now to establish the conditions (a), (8), (y), and (6) for finitely 
multiply connected Jordan regions. This could be done quite readily by map- 
ping such a region conformally upon a region whose boundaries are all circles. 
In these circumstances the conditions (a), (8), (vy), and (6) would be invariant 
under the conformal mapping. For a region bounded by circles the swept out 
mass could be written down explicitly and use made of an argument similar 
to that above for the interior of a single circle. The approximating regions 
would be regions bounded by circles concentric with those of the original region. 
Since the approximating regions enter through (6) but disappear in the final 
representation theory, it is irrelevant that they are restricted. However, it is 
of interest to observe that (6) may be secured with much less stringent condi- 
tions on the approximating regions. It is for this reason that we choose a seem- 
ingly less obvious argument. 

Let A be a finitely multiply connected Jordan region. Let F,, --- , Fy be 
the components of A*. 

It follows from known results that such an A satisfies (@)." 

The above results show that (8) holds for a circular region, and a conformal 
mapping extends this to any simple Jordan region. A further extension to the 
present case may be argued as follows. Let So be a boundary point of A, 
say a point of F,. Let J be a simple Jordan are joining two points of F, in A 
and separating So from Ff, 4+ --- + Fxin A. There results a simple Jordan 
region A» having as boundary J plus a piece of F; containing So. 

Now let u(P) and v(P) be two harmonic functions in A with E(u, A) = 
E(v, A) = {So}. Denote by uo(P) the harmonic function in A whose boundary 
values agree with u(P) along J and are zero along A}F. Similarly, define 
vo(P) in terms of v(P).  uo(P) and vo(P) are bounded, whereas u(P) and v(P) 
cannot be.” Therefore E(u — uo, Ao) = E(w — v9, Ao) = {So}. Now (8) 

8 Cf. G. Herglotz, Ober Potenzreihen mit positivem reellem Teil im Einheitskreis, Ber. 
der Ges. Wiss. Leipzig, vol. 63 (1911), p. 501; G. C. Evans and H. E. Bray, La formule de 
Poisson et le probléme de Dirichlet, Comptes Rendus, vol. 176 (1923), p. 1868; G. C. Evans, 
Sur Uintégrale de Poisson, Comptes Rendus, vol. 177 (1923), p. 241. 

4 For example, for simple Jordan regions it follows from a conformal mapping on a 
circle. The fact that regularity is a local property extends this to the present case 

% A harmonic funetion in A whose lLu.b. is Wo < +0 has a superior limit > ent 
a set of positive capacity in A*. Cf. Wellogg, loe. cit., p. 3385 
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for Ag yields u(P) — w(P) = c-[v(P) — vo(P)]. Hence u(P) — c-v(P) = 
uP) — c-vo(P). The right side of the last equation represents a bounded 
harmonic function in Ao, the left side a harmonic function which takes on zero 
boundary values, except possibly at So. Therefore in Ao, and hence through- 
out A, u(P) — c-v(P) = 

As condition (y) will follow from an obvious modification—in fact, simplifi- 
cation—of the method used for (4), we establish (6). 

Let P —» P’ = ¢(P) be any topological mapping of A upon a closed region 
whose a consists of a finite number of circles."* As a convention, if 
M CA, then M’ will be ¢(M); if M’ C ¢(A), then M will be g"(M’). 


We oa that (6) holds when the approximating regions Ai, As, +++ are 
chosen as the (inverse) images of a nested family of regions A,, A5, --+ ap- 
proximating A’, their boundaries consisting of circles concentric with those 
of A’*. We make such a choice of Aj, Ao, ---. 


Let B be a closed subset of A, So a point of A* — A*B, Py a point of Ay. 
If (6) were false, we could assume without loss of generality the existence of a 
sequence {u,(P)}, with u,(P) defined in A,, u,(P) = 1, E(u,z, An) C B, and 
of a sequence |P,}, with P, € A,, un(P,) > n, Pn — So. 

Now the set of points P « A,, where u,(P) > n, will have a maximal con- 
nected open subset O, containing P,. For every P ¢ A,,O* we must have u,(P) 
= n; otherwise O, could be enlarged. Hence not all frontier points of O, can 
lie in A,, for we should then have u,(P) = n throughout O,. Obviously no 
point of A* — A*B could be a frontier point of O,. Therefore O*B cannot 
be void. This shows that there must be a point Q, ¢ O, distant < n~' from B, 
and a simple Jordan are J, joining P, and Q, in O,. Along J, we have u,(P) 
> n. Thus no closed subset of A can have points in common with more than 
a finite number of J,. 

Suppose F; to be that component of A* which contains So. We now say: 
there exists a simple Jordan are K joining two points of F; in A in such a way 
that (i) K, together with a piece of F,, bounds a simple Jordan region A» ; (ii) 
A,(B + So) = 0; (iii) there is a point R € Ao such that for infinitely many n 
it is true that a piece L, of J, divides Ao into two simple Jordan regions, one 
of which, D,, contains R and is included in A,. The frontier of D, thus 
consists of L, and a piece of K. 

The existence of such a K is readily established by carrying out the construc- 
tion in the image region—taking there, for instance, K’ (the eventual image of K) 
as a small are of circle about a point of F{. In fact, if we take K, and Ky as 
two sufficiently small ares of circle with respective centers on either side of S, 
in a connected relatively open patch of Ff; free of B’, then one or the other of 
K,, Ks can be used as K’. 


% B. Kerékjdrté, Topologie, Berlin, 1923, p. 121. 














REPRESENTATION OF POSITIVE HARMONIC FUNCTIONS 529 


For infinitely many n we therefore have 


u,»(R) -[ un(P)dmp, (ep, R) = | un(P)dmp,(ep, R) 
p* Ln 


(17) 
> / ndmp,(ep, R) = nmp,(La, R). 
Lan 
But 
(18) Mp,(Ln, R) = ma (AgF;, R) > 0. 


The last inequality is a consequence of the fact that m,,(e, R) may be computed 
by first sweeping out a unit mass at R onto D*, and then sweeping out the 
resulting mass on L, onto Aj. All the mass on A}F, is contributed by the 
second sweeping out. As total mass is conserved in sweeping out, the first 
part of the inequality follows. Furthermore, since A}F;, is of positive capacity, 
the second part of the inequality holds. 

Combining (17) and (18), we see that the u,(P) are not bounded at the in- 
terior point R. This contradiction establishes (6). 

If in the above argument we choose the A, all equal to A, and carry through 
the argument mutatis mutandis, (y) follows. 

Thus we have a completed representation theory for finitely multiply con- 
nected Jordan regions." 


INSTITUTE FOR ADVANCED Srupy. 


17 de la Vallée Poussin has proved a direct representation theorem for regions subject 
to the requirement that the irregular boundary points (i.e., points where the boundary 
curve is not of bounded curvature) constitute a perfect non-dense set in A*. Cf. Pro- 
priétés des fonctions harmoniques de deux variables dans une aire ouverte limitée par des 
lignes particulitres, Comptes Rendus, vol. 195 (1932), p. 92 ff. 











FUCHSIAN GROUPS AND TRANSITIVE HOROCYCLES: 
By Gustav A. HEDLUND 


Let U denote the unit circle in the complex z-plane and let © be its interior. 
The metric 


4| dz |? 


(0.1) be poke 
, * (1 = 2a)? 


zz<l1 

defines a hyperbolic geometry in ¥, the geodesics or hyperbolic lines of which 
are ares of circles orthogonal to U’. These hyperbolic lines will be designated 
as H-lines. The hyperbolic distance between two points of W is defined as 
J ds, where ds is given by (0.1) and the path of integration is the H-line 
segment joining these two points. The hyperbolic distance between P; and P,» 
will be denoted by H(P,, P:). The metric (0.1) is invariant under linear 
fractional transformations taking U into itself and W into itself and these 
transformations transform hyperbolic lines into hyperbolic lines. Hence hyper- 
bolic distance is invariant under all such transformations and these are the 
rigid motions of the geometry under consideration. 

The curves in ¥ of constant geodesic curvature fall into four groups according 
to their geometrical properties (see e.g. Carathéodory,' pp. 22-25). If we 
denote geodesic curvature by g., these classes are as follows: 

Class 1. g- = 0. These are the H-lines and are ares of circles orthogonal 
to U. 

Class2. 0<g. <1. These are the hypercycles. They are ares of euclidean 
circles each of which meets U in two distinct points. The angle at which these 
curves meet U is uniquely determined by g, and assumes all values between 
QO and 3x. The hypercycles are equidistant curves. That is, all the points of 
any given one are equidistant, in the hyperbolic sense, from the H-line which 
has the same end points on U. 

Class 3. g- = 1. These are the horocycles (oricycles). They are euclidean 
circles which are internally tangent to U’. 

Class4. g.->1. These are the hyperbolic circles and lie entirely interior to U. 
All points of any given one are at the same //-distance from a fixed point in W. 
These hyperbolic circles are also euclidean circles. 

Let F be a fuchsian group with U as principal circle. If points congruent 
under F are considered identical, a two-dimensional manifold M, of constant 
negative curvature, is defined. The combinatorial topological properties of M 
are determined by F. If, in particular, F has a fundamental region lying, 
together with its boundary, in ¥ and F contains no elliptic transformations, 


Received March 31, 1936. 
' The references are to the bibliography at the end of the paper. 
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M is a closed orientable Riemannian manifold of genus greater than one and 
without singularities. 

A curve of class C’ on M is transitive if its elements are everywhere dense 
among the totality of elements on M. The question of the existence of transi- 
tive H-lines on M has been treated extensively. It is known (Koebe, p. 349) 
that there exist transitive H-lines if F is a fuchsian group of the first kind 
(Ford, p. 68). These are the groups which cease to be discontinuous at all 
points of U. 

The H-lines are curves of constant geodesic curvature zero. What can be 
said with regard to the transitivity of curves of constant geodesic curvature 
not zero? This question is readily answered for the Classes 2 and 4 in the 
above classification. Class 4 is disposed of at once, for the curves in this class 
are closed, of finite length, and cannot be transitive. Any curve of Class 2, 
or hypercycle, is equidistant from a hyperbolic line and it is not difficult to 
show that a hypercycle is or is not transitive according as the H-line from 
which it is equidistant is or is not transitive. 

The question of the transitivity or intransitivity of the curves of Class 3 
or horocyeles is not answered so readily. Jt is the object of this paper to study 
the behavior of the horocycles, particularly with regard to transitivity. 

These results concerning the transitivity of the horocycles admit two 
applications. In the first place, it is possible to prove a mixture property of 
the flow defined by the hyperbolic lines on M. Secondly, there are derived 
properties concerned with the behavior of automorphic functions on circles 
internally tangent to U’. 


1. The existence of transitive horocycles. A horocycle is a euclidean circle 
which is internally tangent to U. It is completely determined by its euclidean 
radius and its point of contact with U. This point will be called the point at in- 
finity on the given horocycle. The horocycle with euclidean radius r, 0 <r < 1, 
and point at infinity Q will be denoted by C(Q, r). 

Two sets of points, within or on U, are congruent if there is a transformation 
of F taking one of these sets into the other. Either will be said to be a copy 
of the other. 

Let E denote the set (x, y, ¢), where 2? + y2 < land 0 < ¢ < 2x. Any such 
point determines a point P(z, y) of VW and a direction ¢ at this point, where 
directions at a point of V are measured in the counterclockwise sense from the 
direction through the point parallel to the positive axis of reals. Conversely, 
a point in VW and a direction at this point determine a point in EF. A point in ¥ 
and a direction at this point will be called an element. Thus E is the space of 
elements. To define neighborhoods in E, let pi(ai, y:, ¢:) be an arbitrary 
point of E and 6 an arbitrary positive number. Let N,, be the set of points 
(x, y, ¢) of B which satisfy the inequalities 


H(P, P,) < é, ifgf~—- vi f 2nr | _ é, 
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for some integral n, where P is the point (x, y) and P, is the point (2, y;). 
The set N,, defines a neighborhood of pi(xi, y:, ¢:). It is easily seen that E, 
with neighborhoods thus defined, is a Hausdorff space. 

Let C(Q, r) be a directed horocycle. It has been defined as transitive if its 
elements are everywhere dense among the totality of elements on M, the two- 
dimensional manifold defined by identifying congruent points. This is equiva- 
lent to the following definition of transitivity. 

DEFINITION 1.1. A directed horocycle C(Q, r) is transitive if the totality of 
elements on C(Q, r) and all its copies form a set which is everywhere dense in E. 

Let P be any point other than Q of the horocycle C(Q, r). The points P 
and Q divide C(Q, r) into two parts, both of which will be termed semihoro- 
cycles. A point P of ¥ and a point Q of U determine two semihorocycles on 
both of which Q is the point at infinity. To distinguish between these, let C 
be a small circle with P as center and let A be the point in which C intersects 
the H-ray PQ. If C is traced out in the counterclockwise sense beginning at A, 
it intersects one of the semihorocycles determined by P and Q first. This one 
will be called a right semihorocycle and will be denoted by SC,(P, Q), the other a 
left semihorocycle and denoted by SC,(P, Q). 

DEFINITION 1.2. <A directed semthorocycle is transitive if the totality of elements 
on it and on all its copies forms a set which is everywhere dense in E. 

It is evident that if a directed semihorocycle is transitive, the directed semi- 
horocycle obtained by the removal of any finite segment of the given one is 
also transitive. If a directed semihorocycle is transitive, the directed horo- 
cycle of which it is a part is also transitive. 

The remainder of this section is devoted to a proof of Theorem 1.1, which 
concerns the existence of transitive semihorocycles. A lemma which is of aid 
in the proof of this theorem is first derived. This lemma gives a criterion for 
determining when the transitivity of a set of semihorocycles implies the existence 
of an individual transitive semihorocycle. A set of semihorocycles is transitive 
if the totality of elements on all copies of all members of the set is everywhere 
dense in E. This, of course, does not necessarily imply that any single member 
of the set is transitive. 

Lemma 1.1. Let P be a point of ¥ and Q,Q2 an interval of U. Let QQ: be a 
subinterval of Q:Q. and SC,[P, (Q{Q:)] the set of directed left semihorocycles with P 
as initial point and with points at infinity in QiQ:. If the set SC,[P, (Q{Q3)] is 
transitive for every subinterval Q{ Qs of Q,Q2, there exists an infinite set of transitive 
left semihorocycles with P as initial point and with points at infinity everywhere 
dense in QiQe. The same result holds if left semihorocycles are replaced throughout 
by right semihorocycles. 

The neighborhoods N,, in E obtained by restricting the cobrdinates (2, y1, ¢1) 
of p; to rational values and likewise for 6 form a denumerable set of neighbor- 
hoods, N,, Ne,---. If a set of points in Z has points in each member of 
this denumerable set of neighborhoods, it is evidently an everywhere dense set. 

To prove the lemma, let Q{Q) be an arbitrary subinterval of Q,Q2. Since 
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it is assumed that the set SC,[P, (Q/Q:)] is transitive, there is in this set a 
directed semihorocycle with an element on it such that either this element or 
some copy of this element lies in N,. Let Q” be the point at infinity of this 
semihorocycle. Since the neighborhoods are open, there exists a closed interval 
Q7Q7 of U containing Q” and such that each semihorocycle of the set 


SC.IP, (Q7Q3)] 


either has on it an element of N; or an element with a copy in V,. But the 
set SC,[P, (Q7Q2)] is transitive, so that the argument can be repeated with Ny 
replaced by Ne. By repetition of this procedure, a sequence of closed intervals 
I,,n = 1, 2,---, of U is obtained with J;,, contained in J;, 7 = 1, 2,---, 
and each of the directed semihorocycles of the set SC,[P, /,,| either has on it 
elements or there are copies of its elements which lie in the neighborhoods 
Ni, Ne,---,N». This sequence of intervals has at least one point Q in 
common. But then the elements on the directed semihorocycle SC,[P, Q] and 
its copies form a set which has a point in each of the neighborhoods N, , Ne , 
This implies that this directed semihorocycle SC,[P, Q] is transitive. Since 
the interval Q'Q: was an arbitrary subinterval of Q:Q2, the points Q with the 
property that SC,[P, Q] is transitive form an everywhere dense set in Q;Qe. 
This is the statement of the lemma. 

If right semihorocycles are considered in place of left) semihorocycles, the 
proof is similar. 

There are fuchsian groups with Ul’ as principal circle for which none of the 
horocycles or semihorocycles is transitive. This is evidently true in the case 
of all fuchsian groups of the second kind (Ford, p. 68). These are the groups 
with limit points nowhere dense on U. 

But in the case of fuchsian groups of the first kind, that is, groups with limit 
points everywhere dense on U, it is possible to prove the existence of transitive 
semihorocycles, and hence of transitive horocycles. In this case certain results 
are known which will be useful in the following. The transformations of any 
fuchsian group F with principal cirele U’ are either hyperbolic with fixed points 
on U, parabolic with fixed point on U, or elliptic with fixed points inverse to U. 
If A and B are the fixed points on U of a hyperbolic transformation of F, the 
hyperbolic line AB is called the aris of the transformation. It will also be 
called a periodic hyperbolic line. It is known (Koebe, p. 349) that if F is of the 
first kind the periodic H-lines are everywhere dense among the totality of 
H-lines. This means that if /; and J; are arbitrary intervals of U’, there is an 
axis of a transformation of F having one end point in /; and the other end 
point in 7». 

THeoreM 1.1. Jf the group F is a fuchsian group of the first kind with principal 
cirele U', P is an arbitrary point of ¥ and Q,Qs is an arbitrary interval of U, there 
exist points C and D of Q,Qs such that SC,(P, C) and SCx(P, D) are both transitive. 

If it can be shown that, QQ. being an arbitrary interval of U’, the sets 
SC LLP, (QiQ2)] and SC,LP, (Q:Q2)| are both transitive, the theorem will follow 











534 GUSTAV A. HEDLUND 


from Lemma 1.1. It is sufficient to prove that the set SC,[P, (Q,Q2)] is transitive. 
The proof is similar for the set SCx[P, (Q,Q2)]. 

Let hag be an arbitrary periodic directed H-line with A as initial point at 
infinity and B as terminal point at infinity. There exists a hyperbolic trans- 
formation of the group F with one of its fixed points interior to Q:Q. and the 
other fixed point neither at A nor B. A properly chosen power of this trans- 
formation transforms hy, into hy» with A’ and B’ both interior to Q:Q.. 

Let Ro be a point of hy» and s the hyperbolic are length on hy» , s being 
measured from ) and taken as positive in the positive sense of hay. Each 
point of Ay-y is then uniquely specified by a coérdinate s, and R, will denote 
the point determined by s. If s is sufficiently large in numerical value, R, 
does not coincide with P and P and R, determine a unique directed left semi- 
horocycle SH,(P, Q,), with P as initial point, with R, on it and with Q, as 
point at infinity. As s becomes infinite in numerical value, Q, approaches 
either A’ or B’, so that for s sufficiently large, Q, lies in the interval Q:Q . 

It will be convenient to denote by {¢}, where ¢ is a real number, that unique 
number which satisfies the two conditions |g} = ¢, mod 27, and0 < {gy} < 2r. 

Let ¢, be the direction of hy, at R, and ¢! that of SH,(P, Q,) at the same 
point. There are two possibilities with regard to the behavior of {g! — ¢,} 


as s becomes infinite, depending on the order of A’ and B’ on Ul’. Either 
, . , e , e 
Casel. lim !¢, —¢,} = 32, lim {¢, — ¢,} = 31, or 
s+ s—>—o 
’ . , P . , 
Casell. lim {¢, —¢,} = 37, lim {g, — ¢,} = $n. 
x sO 


In either case, given 6, there exists an § such that Q; lies in Q:Q» and 
‘ d ’ 1 S 
‘Os — O35 — FF < 6. 


Let (2a, Ya, Ga) be an arbitrary element of hy». Since hy» is periodic, there 
exists an w such that all the elements (2aimu, Yasme, Gaim), mM = 0, +1,---, 
are congruent, and hence copies of (7., Ya, ¢a). From the preceding, given 
6 > O, there exists an m such that Qesico lies in Q,Qe and | {essins — Parme} — 
kr <6. Thisimplies that for some integer n, | grjinu — {asim + 3} + 2nr | 
<6. The elements (Zasmo, Yarme, {[Garme + 37}), m = 0, +1,---, are all 
copies Of (ta, Ya, {va + 32}); hence given any element (7a, Ya, ¢a) Of hag and 
a neighborhood N of the element (Za, Ya, {ga + 37}), there exists an element 
in N and either on one of the set SC,[P, (Q:Q2)] or on a copy of this set. But 
the element (7., Ya, ¢a) Was an arbitrary element on a copy of an arbitrary 
periodic H-line and the preceding result can be stated as follows. Given an 
element (z, y, ¢) of an arbitrary periodic hyperbolic line and an arbitrary neigh- 
borhood, N, of the element (2, y, {¢ + 37}), there is an element on a copy of a 
member of the set SC,[P, (Q,:Q.)| and in N. But the elements (zx, y, ¢) on the 
periodic H-lines are everywhere dense in E and the same must be true of the 
corresponding elements (x, y, |g + }3}). Thus the elements on the copies 
of the set SC,[P, (Q:Q2)] must be everywhere dense in EF. The proof of Theorem 


1.1 is complete. 




















FUCHSIAN GROUPS AND TRANSITIVE HOROCYCLES 535 


The following theorem is an immediate consequence of Theorem 1.1. 

THEOREM 1.2. Jf F is a fuchsian group of the first kind, there exists an infinite 
set of transitive directed horocycles through any point of V. The points at infinity 
on these transitive horocycles form an everywhere dense set on U. 


2. The number of transitive horocycles. The horocycle C(Q, r) is deter- 
mined by its point at infinity, Q, and its euclidean radius r. A directed horo- 
cycle will be called a right horocycle, Cx(Q, r), if the sense of rotation on it is 
clockwise. If counterclockwise, it will be designated as a left horocycle, C ,(Q, r). 
If a right horocycle is transitive, the left horocycle, which coincides with it 
except for sense, is transitive, and conversely. 

The following theorem shows that the transitivity or intransitivity of a 
directed horocycle is determined by its point at infinity and is independent of 
its euclidean radius. 

THEOREM 2.1. If one directed horocycle with Q as point at infinity is transitive, 
all the directed horocycles with Q as point at infinity are transitive. 

It is sufficient to restrict the discussion to right horocycles and to show that 
the transitivity of Ce(Q, 7:) implies that of Cx(Q, m2). Since the method of 
proof when 7; > 72: closely resembles that in the case r; < rz, the proof will be 
given only with the assumption that r; < 7. 

The two horocyeles Cx(Q, r:) and Cx(Q, rz) cut off equal hyperbolic lengths 
Ly on the set of H-lines with Q as point at infinity. Let po(re, ye, g) be an 
arbitrary point of E. There is a unique right horocycle C,(Q’, r;) having Pe 
as an element of it. Let Cx(Q’, r{), r, <7rs, be the right horocycle such that 
C(Q’, ri) and C(Q’, r3) cut off equal hyperbolic lengths Ly. on the hyperbolic 
lines with Q’ as point at infinity. Let pi(x., y1, ¢1) be the element of Cx(Q’, r;) 
at the point P;(2,, y:) where it intersects the hyperbolic line determined by 
P2(x2, yo) and Q’. Since Cx(Q, 7:) is assumed to be transitive, there exists a 


sequence of elements e}, n = 1, 2,---, which are copies of elements e,, 
n = 1, 2,---, of Ce(Q, ) and are such that lim e} = pi(m, mi, o). If T 
n--2 

denotes the transformation of F taking e, into e+, n = 1, 2, --- , the sequence 
TACK(Q, m1)] = Ce(Qn, rin), n = 1, 2,---, evidently has the properties 
. . , + *,¢ . 

lim Q, = Q’ and lim nr, = r,. Under such conditions the right horocycle 
na n--2o 


U . . ° —e ° . . 

C,(Q’, r,) is said to be the limiting right horocycle of the sequence, and this is 
. . . , 7“ ° = 

written lim Cx(Q,, rin) = C(Q, 7r,). Since the transformations of F preserve 


n-?o 

hyperbolic distances, the sequence 7',[C x(Q, r2)] must have Cx(Q’, rs) as limiting 
right horocycle. This implies the existence of a sequence of copies of elements 
of Cx(Q, 72) having the element po(r2, ye, ¢2) as limit element. But since p» 
was an arbitrary element in FE, Cx(Q, rz) is transitive. 

Theorem 2.1 suggests a classification of the points of U. The point Q of U 
is h-transitive if all the directed horocycles with Q as point at infinity are transi- 
tive. The point Q of U is h-intransitive if none of the directed horocycles with Q 
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as point at infinity is transitive. From Theorem 2.1, all points of U are con- 
tained in these two categories. 

As to the number of h-transitive points of U, so far it is only known that 
they form an infinite set. As a step in specifying precisely which points of U 
are h-transitive, the following theorem is derived. 

THEOREM 2.2. If F is a fuchsian group of the first kind, the end points of all 
axes of (hyperbolic) transformations of F are h-transitive. 

It is again sufficient to restrict the discussion entirely to right horocycles. 

Let hy» be the axis of a hyperbolic transformation Tz of F. If P is a finite 
point of hy» and P’ is the point T,(P), the hyperbolic distance w between P 
and P’ does not depend on how P is chosen on h4g. Under the transforma- 


tions T,, n = 0, +1, ---, the right horocycle Cg(A, 7) is transformed into 
a set Cy(A, r,), n = 0, +1, ---, all with A as point at infinity. The pairs of 


right horocycles Cx(A, r,) and Ce(A, rays), n = O, £1,---, cut off equal 
hyperbolic lengths » on all hyperbolic lines with A as point at infinity, hence, 
in particular, on the H-lme through A and the origin. If a denotes the 
euclidean distance from the origin of a point at H-distance w from the origin, 
there is a Cy(A, r) in the set C,(A, r,), n = 0, +1, --- , such that 3(1 — a) < 
rs 3. If A’ is a point of U which is a copy of A, the transformation of F 
taking A into A’ transforms the set C,(A, r,), n = 0, +1, --- , into an infinite 
set of right horocycles with A’ as point at infinity. Using the same argument 
on the new set, there is at least one member C,(A’, 7’) of it such that $(1 — a) < 
r’ Ss 3, and C,(A’, r’) is a copy of Cx(A, 7). 

From Theorem 1.2, there exists an h-transitive point Q of U. Since F is a 
fuchsian group of the first kind, there are hyperbolic transformations of F' with 
fixed points arbitrarily close to Q, and hence there are copies of A arbitrarily 
close to Q. Let Ag = A, Ai, Ao, --- be a sequence of copies of A such that 
lim A, = Q. If Ce(A, 7) is an arbitrary right horocycle with A as point at 


infinity, it has been shown that there exists a copy C,(A,, r,), n = 0,1, ---, 
such that 4(1 — a) <r’ < }. The set of numbers r,, n = 0, 1, ---, has at 
least one cluster value, 7, (1 — a) S # S 3; hence there exists a subsequence 
CLA. rads i = 1, 2,---, of the set C,(A,, r,), n = 0, 1, ---, such that 


lim A,, = Q, and lim re, = 7. The sequence C,(A,,, ray i=1,2,.---, has 
the right horocycle C2(Q, 7) as limiting right horocycle. The elements on the 
set C,(A,,, 7.,), 1 = 1, 2,---, have among their limit elements all elements 


of C,(Q, 7). But the elements on C,(Q, 7) and its copies are everywhere dense 
in E, and hence the same must be true of the elements on the set Ce(A,;, Pr), 
7 = 1, 2,---, and on the copies of the members of this set. All such copies 
are copies of Cy(A, ro) and C,(A, rm) must be transitive. This is the statement 
of Theorem 2.2. 


Theorem 2.2 does not yield further information as to the number of h-transi- 
tive points of VU. The end points of the axes form a denumerable set every- 
where dense on (’, but this set might coincide with the denumerable everywhere 
dense set of h-transitive points previously known to exist. 
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However, with the aid of Theorem 2.2 it is possible to give criteria for 
h-transitivity which immediately yield extensive results. 

THEOREM 2.3. If F is a fuchsian group of the first kind and there are copies 
of the horocycle C(Q, r) with radii arbitrarily close to 1, Q is h-transitive. 

Again, the proof can be given with consideration of only right horocycles. 
From a sequence of copies of Ce(Q, r) with radii approaching 1, a subsequence 
Cr(Q,, Tn), n = 1, 2, --- , can be chosen such that lim Q, = Q and lim r, = 1, 


n+ n> 
where Q is a point of U. Let has be an axis of a transformation of F such that 
A #Qand B # Q. The fact that F is of the first kind implies the existence 
of such an axis. For all values of n sufficiently great, Ce(Q,, 7.) intersects his 
in two points and the angle of intersection of Cx(Q,, rn.) and A,» at either of 
these points approaches }z as n becomes infinite. But if we use w as previously 
defined, any point of hy» is seen to have a copy in a fixed interval of hys of 
hyperbolic length w. Hence there exists a sequence of copies of Cx(Q, r), each 
of which intersects h4s in a fixed interval and such that the angle of intersection 
approaches 37. This last implies that the points at infinity of the members 
of this sequence must either approach A or approach B. A subsequence 
CA(Q;, r’), n = 1, 2,---, can be so chosen that lim r, = r* > 0 and either 


no 


° , . , . a he) 
lim Q,, = AorlimQ, = B. In either case the sequence has, from Theorem 2.2, 


a transitive right horocycle as limiting right horocycle and by the reasoning 
used in the proof of Theorem 2.2, Ce(Q, 7) must be transitive. This is identical 
with the statement that Q is h-transitive, thus proving Theorem 2.3. 
THEOREM 2.4. Let F be a fuchsian group of the first kind, Q a point of U and hog 
the hyperbolic ray with the origin O as initial point and with Q as point at infinity. 
If there exists on hog a sequence of points Op, O,, --- , such that lim H(O, 0,) = 


nn? 
+c and such that O, has a copy O.,n = 0,1,.-.-- , with H(O, 01) bounded, 
n arbitrary, Q is h-transitive. 

Consider the horocyele C(Q, 3), which passes through O and contains hog. 
Given L > 0, arbitrarily large, there exists a K such that the H-distance 
between Ox and any point of C(Q, 3) exceeds L. Let Tx denote the trans- 
formation of F taking O, into O; and let H’ be an upper bound of the distances 
H(O, O01). Assuming that L has been chosen greater than H’, the H-distance 
from the origin to any point of the horocycle T,,{[C(Q, })] is not less than L — H’. 
But since L can be chosen arbitrarily large, this implies that there are copies 
of C(Q, 3) with radii arbitrarily close to 1. From Theorem 2.3, Q is h-transitive. 

With the aid of the criterion of Theorem 2.4, the A-transitivity of a large 
class of points of U is readily shown. 

THeoreM 2.5. If F has a fundamental region Ro which, together with its 
boundary, lies entirely interior to U, all points of U are h-transitive. 

Under the hypothesis of the theorem, F must be of the first kind. This 
theorem is then an immediate consequence of Theorem 2.4, for any point of ¥ 
has a copy in Ry, hence at an H-distance from the origin less than a fixed con- 
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stant. Thus if F has a fundamental region which is bounded, in the hyperbolic 
sense, all horocycles are transitive. 

What can happen if F is not so restricted, but is still of the first kind? It is 
easily seen that all points of LU’ are no longer necessarily h-transitive. For F 
may contain parabolic transformations, and if Q is a fixed point of a parabolic 
transformation, and hence on U, all the horocycles with Q as point at infinity 
are periodic and cannot be transitive. The periodicity follows from the fact 
that a parabolic transformation with F as fixed point transforms each C(Q, r) 
into itself. 

But are these fixed points of parabolic transformations of F the only points 
of ( which are not h-transitive? The answer can be shown to be in the affirma- 
tive in those cases where the fundamental region, Ry , has as boundary points 
on l only parabolic points. 

THEOREM 2.6. Jf.F is of the first kind and if the only boundary points of R 
on ( are parabolic points, all points of LU’, with the exception of those which are 
fixed points of parabolic transformations of F, are h-transitive. 

From the hypothesis of the theorem, F has a finite set of generators and Ry 
a finite set of sides (Ford, p. 75) and thus the boundary points of Ry on U' must 


form a finite set P;,---,P,. If the radii r,;, 7 = 1, ---,m, of the horo- 
cycles C(P;, 7), 7 = 1, --- , m, are chosen sufficiently near 1, it is geometrically 
evident that any point of Ry or its boundary and interior to U’ will be interior 
to some one of the set C(P;, rj, 7 = 1, 2,---, m. Denoting by C the set 


of horocycles consisting of C(P;, r:), 7 = 1, --- , m, and all copies of these, any 
point in ¥ is interior to some member of the set C. 

There exists a parabolic transformation T; of F, with fixed point P;, 
7 = 1,---, m. Hence each point of C(P;, r,), with the exception of P;, 
has a copy within H-distance D; of the origin O, where D; depends on C(P; , 73) 
and not on the chosen point on it. Let D be a constant as great as any D;, 
7 = 1,---,m. Any point of the set C which is not a point of U has a copy 
within H-distance D of the origin. 

Now let Q be any point of U which does not belong to the set Sp consisting 
of Py, ---, P,, and the copies of these points. If Q’ is any point other than Q 
of the H-ray OQ, it lies interior to one of the horocycles of the set C. But the 
ray Q’Q cannot lie entirely in any one member of the set C, for this would 
imply that Q@ belonged to the set Sp. Hence Q’Q must intersect one of this 
set of horocycles and has on it a point Q” with a copy within //-distance D of 
the origin. From Theorem 2.4, the point Q is h-transitive. 

If F is of the first kind, but has an infinite set of generators, Theorem 2.4 
can be applied to prove the existence of at least a non-denumerable infinity of 
h-transitive points on ©. For if Q is a point of U such that the geodesic rays 
with Q as point at infinity are transitive, the conditions of Theorem 2.4 are 
satisfied and Q is h-transitive. It is not difficult to show that a non-denumerable 
eet of points of U have this property with regard to the geodesic rays. The 


precise analysis of h-transitivity in these cases requires further study, however. 
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3. Asymptotic transitivity. With the aid of the derived theorems concern- 
ing the transitivity of the horocycles, an interesting property of the flow 
defined by the hyperbolic lines on the manifold M can be shown to hold. To 
define the flow, we again consider the space E of elements (x, y, ¢). Two such 
elements, (x, y, ¢) and (2’, y’, ¢’), are congruent if there is a transformation of F 
taking P(x, y) into P’(2’, y’) and the direction ¢ at P into the direction ¢’ at P’. 
Let E be the space obtained from E by considering congruent elements identical. 
Neighborhoods are defined in E by the definition of neighborhoods in EF (Seifert- 
Threlfall, pp. 31-35). The space E is essentially the space of elements on M. 

Let A(X, Y, &) be a point of EZ. This point is a set of elements and let 
(x, y, ¢) be an arbitrary element of the set. The element (x, y, ¢) defines a 
directed H-ray, r;, , namely, that one with (2, y) as initial point and with direction 
¢ at that point. On r, let (x,, y.) be the point at hyperbolic distance s from 
(x, y) and let ¢, be the direction of 7, at (a, y,). The element (2,, y., ¢.) 
and those congruent to it define a point A,(X,, ¥Y,, ®,) of FE. The transforma- 
tion or flow A — A, of E into itself is evidently one-to-one and continuous, 
and depends continuously on the parameter s. If N denotes a point set of F, 
the transformation A — A, transforms N into a set N,. 

DEFINITION 3.1. The flow A — A, is asymptotically transitive (O) if, N and N* 
being arbitrary open sets of E, there exists an S* such that the set N,-N*,s > S*, 
is not empty. 

THeoreEM 3.1. Jf F is of the first kind, the flow A — A, ts asymptotically 
transitive (O). 

It will be convenient to use the notation ¢ to denote the member of the 
set |g + 2nr |, n = 0, +1, --- , which has the least numerical value. 

Let ACY, Y, #) be an arbitrary point of N. This point determines an 
infinite set of points of E and let (#, J, 2) be one of these. Since N is open, 
there exists a 6 > 0 such that all the elements (%, 9, ¢), ¢ — @ < 4, deter- 
mine points of N. The elements (%., 9, ¢.) are defined as before, ¢ being 
restricted to the set || ¢ — @/|) <6. The points (%, , j,) thus defined form, for 
fixed s, an are C, of a hyperbolic circle with (%, 9) as (non-euclidean) center. 
Let C, be directed by directing the hyperbolic cirele of which it is a part in 
the clockwise sense. The elements of C, are given by the set (2,9. , |e. — de! ), 

¢—@|| <4. 

Let (X, ¥, & — $x) be the point of E defined by the set (24, y, te — 44}), 
where (2, y, ¢) is the set of B defining (XY, Y, ®). Let N* be the open set of E 
obtained from the set N* by replacing the points (XY, ¥, &) of N* by the corre- 
sponding set (Y, ¥, @® — $9). If it can be shown that there exists an S* such 
that all C,, s > S*, determine sets of E with a point lying in N*, the desired 
theorem is proved. For then the elements (2, 9, ¢., 8 > S*, determine a set 
of F with a point lying in N*. 

‘To complete the proof, let Q) be the point at infinity on the //-ray determined 
by the element (2, 9, }@ — 4}) and let Qe be that on the //-ray determined by 
(@, J, }@ + 4f). Let QiQe be the interval of U consisting of the points at 
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infinity on the H-rays determined by the elements (%, 9, ¢), | ¢ — @) < 4. 
Since F is of the first kind, there exists an axis hy» of a transformation 7 of F 
with B and D both interior to Q,Q2. Let T be such that it moves points away 
from B and towards D. Let P; be a finite point of hy» and P2 the point T(P,). 
Any finite point of hyp has a copy in the interval P; P2, this copy being obtained 
by applying a properly chosen power of T. 

For all s sufficiently large, C, intersects hyo in two points B! and D/, the 
notation being so chosen that B! lies between D! and B on hgp. As s becomes 
infinite, the angle of intersection of C, and hgp at both B, and D‘ approaches 37. 

If n is chosen properly, the transformation 7” transforms B/ into a point BY 
in the interval P,P: of hap. As s becomes infinite, the power of T required 
to transform B¢ into B” also becomes infinite and under these increasing powers 
of T the ends of C, are transformed into points which approach D, in the 
euclidean sense. Thus, given « > 0, there exists an S* such that for s > S* 
there is a copy, C’,, of C, intersecting the segment P,P: of hg» in BY, at an angle 
which differs from 32 by less than ¢ and with the end points of this C’” within 
euclidean distance « of D. This copy is, of course, an are of a euclidean, as 
well as a hyperbolic, circle. 

Now consider the right horocycle Ce(B”’, D), B’ a point of the interval 
P,P: of hen. By Theorem 2.2, Ce(B”, D) is transitive, so that it has on it an 
element which determines a point of E in the set N*. Let C” be an are of a 
euclidean circle directed in the counterclockwise sense, C’’ lying in V, passing 
through B” and having its end points near D. If ¢€ is chosen sufficiently small 
and two conditions are fulfilled, namely, that the end points of C” are within 
euclidean distance « of D and the angle at which C”’ intersects hgp differs from 3x 
by less than ¢, the are C” will have an element determining a point of E lying 
in N*. This is evident because C” then approximates closely a large segment 
of C,(B”, D) and N* is open. If C” satisfies the same conditions, with the 
exception that it no longer necessarily intersects P;P2: in B”, but in a suffi- 
ciently small interval containing B’’, it will still have an element determining 
a point of E lying in N*. By the Heine-Borel theorem, there exists an ¢ such 
that each such directed are C”’, with end points within distance « of D, inter- 
secting hg» in any point of PP, and with angle of intersection with hy» differing 
from 3x by less than ¢, will have an element determining a point of E in N*. 

But from the preceding, there exists an S* such that every C,, s > S*, has 
a copy C”’ satisfying these conditions on C’”, hence C,, s > S*, determines a 
set of points of EF containing a point in N*. This completes the proof of 
Theorem 3.1. 


4. Double /-transitivity and automorphic functions. Before taking into 
consideration the behavior of a funetion, automorphic with respect to F, on 
circular ares internally tangent to U’, it is desirable to extend the notion of 
h-transitivity. 

Derinition 4.1. The directed horocycle C(Q, r) is doubly h-transitive if both 
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of the directed semihorocycles SCx(P, Q) and SC,(P, Q) into which a point P 
of C(Q, r) divides C(Q, r) are both transitive. 

TueoreM 4.1. If one directed horocycle with Q as point at infinity is doubly 
h-transitive, all the directed horocycles with Q as point at infinity are doubly 
h-transitive. 

The proof of this theorem parallels that of Theorem 2.1 so closely that it 
does not seem necessary to give the details. It is a rather obvious consequence 
of the fact that two right (left) semihorocycles with the same point at infinity 
are, except possibly for a finite segment of one, equidistant curves, and if the 
elements of one determine a set which is everywhere dense in FE, the same must 
be true of the other. 

Dertnition 4.2. The point Q of U is doubly h-transitive if all the directed 
horocycles with Q as point at infinity are doubly h-transitive. 

This definition has significance because of Theorem 4.1. 

Theorem 2.4 can be extended to double h-transitivity. 

THEoreM 4.2. Let F be a fuchsian group of the first kind, Q a point of U 
and hpg a hyperbolic ray with P, a point of V, as initial point, and with Q as point 
at infinity. If there exists on hpe a sequence of points P,, P2,--- such that 
lim H(P, P,.) = +, and such that each of these points P,, has a copy P|, with 


n~?e 
H(O, P!) bounded, n arbitrary, Q is doubly h-transitive. 

The proof will be given considering only directed left semihorocycles, since 
the proof for right semihorocycles is entirely similar. It will be sufficient to 
show that the directed left semihorocycle SC,(P, Q) with initial point P is 
transitive, for then every directed left semihorocycle with Q as point at infinity 
is transitive. 

Since P,, has as copy P_ , the directed H-ray PQ has a copy passing through 
P!. Let e, be the element of this copy at the point P;, and let 7, be the trans- 
formation of F taking P, into P!. Since H(O, P!) is bounded, a subsequence 
ce” n = 1,2,---, of the sequence ¢,, n = 1, 2, --- , ean be chosen such that 
lim ¢, = e, where ¢ is an element of the set E. Let Q, be the initial point at 


infinity of the directed hyperbolic line determined by ¢ and let Q2 be its terminal 
point at infinity. Then lim 7,(P) = Q; and lim T,(Q) = Q:, both in the 


n~*o LT ed) 
euclidean sense. 

Let Q:Qo(~*) be the segment of U traced out by a point starting at Q,, 
tracing (’ in the counterclockwise sense, and terminating at Q.. Since F is 
of the first kind, there is an axis hy, of a transformation of F, with both A and B 
in Q:Q(—*). For n sufficiently large, T,[SC.(P, Q)] intersects has in two 
points and as n becomes infinite, the angle of intersection at both of these points 
approaches }r. Denoting by 7 the transformation of F of which hy» is an 
axis, let D be a point in ¥ and on hy, and let D = T(D). By applying « proper 
power of 7, there exists a copy of any curve in ¥ and intersecting hys such that 
the copy intersects hy» at some point of the interval DD. Hence, as we see 
by choosing n sufficiently large, there is a copy of SC;,(P, Q) intersecting the 
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interval DD of hy, at an angle nearly 47 and with the end points of this copy 
near B, in the euclidean sense. A sequence of these copies can be chosen 
such that the points of intersection of these copies with DD approach a point D’ 
of DD as limit point, such that the angle of intersection approaches }2 and such 
that the end points of these copies approach B. But then there are copies of 
elements of SC,(P, Q) with any given element of the left horocycle C,(D’, B) 
as limit element. Since C,(D’, B) is transitive, by Theorem 2.2, SCi(P, Q) 
is also transitive. 

By proofs entirely analogous to those of Theorems 2.5 and 2.6, the following 
two theorems are obtained. 

THroreM 4.3. Jf F has a fundamental region Ry which, together with its 
boundary, lies entirely interior to U, all points of U are doubly h-transitive. 

TueoreM 4.4. If F is of the first kind and Ro has only parabolic points on U, 
all points of U, with the exception of those which are fixed points of parabolic trans- 
formations of F, are doubly h-transitive. 

These theorems can be interpreted in terms of automorphic, or, in the case 
under consideration, fuchsian functions (Ford, p. 87). 

Tueorem 4.5. Let F(z) be a fuchsian function with group F of the first kind. 
Let C be a circle internally tangent to U at P and PQ any arc of C with Q as one 
end point. If F has a fundamental region Ry which, together with its boundary, 
lies entirely interior to U, the values which f(z) assumes on PQ are everywhere 
dense among the totality of values assumed by f(z) in Vv. If F is of the first kind 
and Ry has only parabolic points on U, the same property of f(z) holds on any 
such are PQ, provided Q is not a fixed point of a parabolic transformation. 
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STUDIES IN THE SUMMABILITY OF FOURIER SERIES BY 
NORLUND MEANS 


By Max AsTrRACHAN 
I. Preliminary remarks and formulas 


1. Introduction. In a paper published in 1932, E. Hille and J. D. Tamarkin 
[3]' discussed the application of Nérlund means to the summation of Fourier 
series and certain associated series. They gave conditions for the effectiveness 
of the method in certain senses. 

It is our purpose to consider further effectiveness problems of the theory for 
this method of summability. In particular, we shall consider its effectiveness 
for the summation of the Fourier series and conjugate Fourier series at points 
of “(C, a) continuity”, and for the summation of the r-th derived series of 
both of these series. We shall also consider the strong summability of the two 
series when the partial sums are replaced by their Nérlund transforms. 


2. Noérlund means. For a sequence {x,}, the generalized Nérlund limit 
(if it exists) is defined as 
(2.01) (N, p,)-lim x, = lim P7'(p,.% + Pyi% + ++: + Porn); 
n--o 
where {p,} is a sequence of complex numbers such that P,, = po + pr + <--> + 
pn # 0. The conditions of regularity are 


(2.02) > |m| <CiP, |, p,/P,, > 9, 
k=0 
where C is a fixed positive constant. 

N. E. Nérlund [8] proved some properties of these means assuming p, > 0 
and p,/P, — 0. Such a definition of limitation, however, had already been 
given by G. F. Woronoi [17], who assumed that p, > 0 and that n-*P, is bounded 
for some value of a. We shall use the symbol (N, p,) to denote the Nérlund 
method of summation defined by the sequence {p,{. If 


— e+n—1 p,~(*t") 
Pr n ’ n n ’ 


the corresponding method (N, p,) reduces to the Cesaro method (C, ¢). 


3. Notation. We shall consider functions f(x) integrable in the sense of 
Lebesgue and periodic of period 27. If 


a, = | S(® cos nt dt, b, = . f(0 sin nt dt, 
v © 


T 
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then f(x) generates the Fourier-Lebesgue series 


a 
(3.01) kag + >> (a, cos nz + by sin nz), 
n=1 
whose conjugate series is 
aa 
(3.02) (—b, cos nr + a, sin nz). 
n=1 


We designate by s,(x) and &,(x) the n-th partial sums of the series (3.01) 
and (3.02), respectively. The corresponding Noérlund transforms will be de- 
noted by N,[f(x), p,] and N,,[f(x), p,]; and those of the r-th derivatives of the 
partial sums by N\’[f(x), p,] and NS’’[f(x), p,]. Further, we set 


(3.03) ot) = f(x +0 + f(x — 0d) — f(x); 
1 t 
(3.04) gi) = = (t — u)* ou) du (a > 0); 
P(e) Jo 
(3.05) V(t) =f(lr +0 — f(a — 0); 
(3.06) ¥At) = c. | (t— u)™'Y(u) du (a > 0); 
l(a) Jo 
re . t « ibs 
(3.07) j(x) = —5- lim | Y(t) cot 3 dt; 
“<7 e 7 e 
(3.08) $(t) =f(la+0+ (-D*Yf(e -— 0; 
M, si — COS 2 
(3.09) H(t) = COs ot veooke n+ at. 
sin 3¢ 
: l sin (k + $)t 
(: 3 0 N nf = a n—/, . = 
ee 9 2rP,, >» , sin 3¢ 
(3.11) N,(t) = > pr H(t) = = 5 cot 4t + N,(0. 


ae k=0 


Finally, we shall denote by N.’’(t), N‘\"'(t), and N‘S’(t), the results obtained by 
differentiating N,(t), V(t), and N, (0), , seupectivell, r times with respect to ¢t. 


4. The Norlund transforms. It is well known that 


(4.01) s,(zr) = ‘ | f(t) - sin (n + s(t — xr) dt 
T -r 


sin A(t — 2) 


and 


(4.02) &,(x) = Le | SOOH Aa — 0 dt. 
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Integrating (4.01) by parts and using (3.03), (3.04) with @ = 1, and (3.10), 
we have by (2.01) that 


Nalsle), pal — ste) = [onal at = 2D F yy a1) 


ail) ; % ne (be i 
~ 2eP, Jo sin Y p> kpn« cos (k + $)tdt 


1 ie gilt) . 1 
— n—k COS (Kk + $)td 
InP, i sin pe Pn—« cos (k + $)tdt 


+ 3 | gi(t) cot 3¢.N,,(t) dt 
0 


(4.03) =], a Le — L3 + Ly. 
Similarly, from (4.02) we have 
, vo 


2r =P, |i «0 ; sin te k=0 


+5 - - ee 7? kp, sin (k + 4)tdt 
Tin Je 
1 


N,Lf(z), p.] — se Pn—« Cos (k + 4)e 





) sin 4 k=0 


0) } Pn—« sin (k + $)tdt 


+ 4rP,, 0 sin ht 


— }lim " W(t) cot 4t.N,,(t) dt 


(4.04) = Ly + | P + Ls aa | ‘ 


Differentiating (4.01) r times with respect to 2, we have by (2.01) with the 
notation of (3.10) that 


N\ (f(x), p] = I f(t) i N,(t — x) dt 

Ox’ 
(4.05) 
= (—1) [ [fla + 0 + (-—1)' fla — ONS at. 


Similarly, from (4.02), 


(4.06) NY Uf(x), p.| = (pf & (tN (0) dt. 
0 


5. The (N, p,)-C,; method. This method of limitation is obtained by super- 
imposing the method (N, p,) on the Cesiro means of order one. It is well 
known that if {o,(7){ is the sequence of arithmetic means of the sequence 
is,(r)}, then 


1 ‘. sin 4(n + 1)? 
nn I = P _ a in . ; 
onlr) — fiz) 4(n + 1)r I (| sin }f 
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Integration by parts gives 


_——— v 1 [* e(t) sin (nm + 4)t 
Me «T= eager e+ hse | at ay 
1 








1 ay wet ay] SO FD HF 
Y ten + 5 | ar(t) cot u| sin ¥ dt 
(5.01) Cy(n) + C2(n) 4 C;( n). 


Hence if we denote by N,,-Cilf(x), p,| the Nérlund transform of ¢,(2), it follows 
from (2.01) that 


, 


YS poolCilk) + Cx(k) + Cx(h)] 
nk=0 


C1+ 24+ Cs. 


Nu-Cilf(x), p.| — f(x) 


(5.02) 


6. Previous results. For the convenience of the reader and for the sake of 
completeness, we state here certain results of Hille and Tamarkin. Their 
main theorem is the following: 

THEOREM A. A regular Nérlund method of summation (N, p,) is Fourter- 
effective if the generating sequence |p,\ satisfies the following conditions: 


(6.01) n|pa| <C|Pal, 
(6.02) 7. k Pr — Pr-i < C P.. 9 
k=1 
. — | Py ie 
(6.03) > = < C|Pal; 

k=1 


where C is a fixed positive constant independent of n. 
They also prove the following 
Lemma 6.1. Jf {p,} satisfies the condition 


n 


(6.04) is | Pi — fr-i1| = oA(P,.), 


then 
> pe* = o( P,) 
k=0 
uniformly int for0O <6S t Sf. 
It is also shown in their paper that (6.04) holds if (N, p,) satisfies, for example, 
conditions (6.02) and (6.03). 


II. Summability at points of (C, a) continuity 


7. Definitions and results. The Lebesgue sets of points associated with 
g(t) and y(t) are those for which 


t t 
I g(u)| du = o(t), y(u)| du = o(t), 
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and f(x), f(z) exist and are finite, respectively. At such points the conditions 
of Theorem A are sufficient in order that the method (N, p,) sum the corre- 
sponding series to its correct value. We shall consider here summability at 
sets of points wider than the Lebesgue sets. 

For a > 0, the a-th integral of a Lebesgue integrable function F(z) is de- 
fined as 

t 
F(t) = 25 I (t — u)*"F(u) du. 

I'(a) 0 
It is well known that F,(¢) exists for almost all ¢ and is integrable; and that if 
F(t) exists at t = to, so does F(t) for all 8B > a If F.(t) = o(t*) ast > 0, 
we say that F(¢) is continuous (C, a) at t = 0. It is well known that if F(¢) 
is continuous (C, a), it is also continuous (C, 8) for all B > a. 

DEFINITION 7.1. A point x for which f(x) has a definite value is said to be 

(i) A, regular if y(t) is (C, a)-continuous; 

(ii) K. regular if y(t) is (C, a)-continuous and f(x) exists and is finite. 

DeFInitIon 7.2. A summation method is said to be 

(i) K, effective if it sums the Fourier series of f(z) to the correct value at 
all the K, regular points; 

(ii) K. effective if it sums the conjugate Fourier series of f(x) to f(x) at the 
K,, regular points. 

We shall prove the following 

TueoreM I. A regular Nérlund method of summation (N, p,) is K, and K, 
effective? (0 < a & 1), if the generating sequence {p,} satisfies the following con- 
ditions: 





(7.01) n|pa| <C| Pal, 
(7.02) > kl px — mal < C|P,l, 
k=1 
(7.03) >. k(n — k) | pe — Qua + pe-a| < C| Pal, 
k=1 
- — | Px | |Pa| 
(7.04) >> os <co, 


where p_, = 0 and C is a positive constant independent of n. 


8. Preliminary lemmas. In this and the next two sections we shall assume 
that unless otherwise stated f(x), f(x), and the generating sequence {p,} satisfy 
the conditions of Theorem I. 

Lemma 8.1. Jf {p,} satisfies conditions (7.03) and (7.04), then 


(8.01) > (n—k)| 42m | = O(P./n) = o(P.), 


where Mp, = Ap, — Apes = Pe — Zea + Pe-o2, with p_y = p_s = YO. 


2? K, effectiveness for the case a > | will be discussed in §20. 
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Putting 
U, = i k(n — k)| Mp | = O(P,), (>) = 0, 
k=1 


we have 


: ; se . U’, U 
( a k) A° I = ({ L— T - == -s ool 
2, pel 2p Us Ud = Bey tag’ 
whence, by use of (7.04), the conclusion follows immediately. 
Lemma 8.2. Jf |p,} satisfies conditions (7.02) and (7.04), then 


(8.02) > | pe — pea! = O(P,/n) = o(P,). 
k=1 
The proof is similar to that of the previous lemma. 
Lema 8.3. If |p,| satisfies (7.04), then 


n > 
P, <C|P.|. 


This result follows at once from the hypothesis. 

These lemmas show that a regular Nérlund method of summation (N, p,) 
which satisfies the conditions of Theorem I is automatically Fourier-effective. 
Therefore, since at the K, and K, regular points in the case 0 < a < 1, we 
have ¢i(t) cot(3t) = o0(1), ¥:(t) cot(4t) = o(1), respectively, it foliows that at 
all such points x, L, of (4.03) and ZL, of (4.04) are each 0(1) as n — «. Further, 
by the well known Riemann-Lebesgue theorem and the regularity of the sum- 
mation method, 3 and J are also o(1). 

Lemma 8.4. Asn— «,L,—O0and L, > 0. 

From (4.03), by using Lemma 6.1, we get 


tee n n 
lL, = aE gil) >» pie <= o(1). 


For 1, , we have from (4.04) that 


( ; ( 
A. p SC lim Wile) | _ 0 


=r 
Li <= lim oy 
« «+9 SIN 9 


.o 28in $e rP, (% 
at the K, regular points. 

The remaining integrals, Le and Le, are essentially of the same type, for 
gi(t)/sin 3¢ and y,(t)/sin 3¢ are functions of 2 and t which at the K, and K, 
regular points, respectively, approach zero with ¢; and the remaining parts of 
each integrand form the real and imaginary parts, respectively, of the complex- 
valued function 


I 
2rP,, 


(8.03) WM) = e~t(nt bt > p(n — k)e**, 
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Our theorem will therefore be proved if we show that 


(8.04) im f g(t) M,,(t) dt = 0, 

where g(t) = o(1) ast > 0. 
9. Estimates of the kernel. We proceed to estimate the kernel (8.03), 
Lemna 9.1. We have 

(9.01) M0) = O(n). 
From the definition (8.03), 


; Pi (an —k) <= =: Yo pe, = O(n), 


s 
2r|P, | k=0 2r| P,| k=0 


Me. (d) 


in virtue of (2.02). 
Lemma 9.2. Jf {p,} satisfies (6.04), (8.01) and (8.02), then 


(9.02) y ¥ pi(n — k)e* = o(P,) 
k=0 
uniformly in t forO <6 S(t, S a, 6 being fixed. 
Let us set q, = pi(n — k) so that 


Aq: =<d. - qi: i= (n _— k) Ape om Pr-1 5 


(9.03) 
A*q: — Aq: — Aqi-1 = (n — k)A* px _ 2Ape-1, qn = 0, AY, = —Dnr-1. 
Then, putting 
k : 1 — ellk+bet 
(9.04) Sin = Det = a S, = 1, So = 0, 
v=0 —e' 


and applying Abel's partial summation method twice, we have 


" 


ma Uk eikt a ) (yy, i- qe) Sx + Gund 


0 i 0 


= (1-— 7 p> (Agnde PP as inci] 
kh 


o 


II 


(1 — e*) [> (A? gy )e* — (Agno — g,(1 — esperar], 
ko 


where g-) = q-2 = 0. Using (9.03), we get 


> p(n — k)e™ = (1 — e*) fe (n — k)(A* py, oe 
k=O 


k=O 


n-l 
=a 2 >. (Ap, Jeter Pe + Dn nnn 


k=O 
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Hence under the assumptions of the lemma, 


| > p(n — ke#t| < | 1 — ett b> in +001 ite 


k=0 | k=0 
+ 2 ym | Ap | + | Pn-1 | = O(t)-o(P,), 
k=0 


66,99 


where the “O” refers to t — 0 and is independent of n, and the “‘o” refers to 
n — « and is independent of t. This completes the proof. 

It remains to find suitable estimates for the kernel M,(é) in the interval 
(n-, 6). To do this we first consider the sum in (8.93). 

Put 
(9.05) Pn| = Tn; R,=rottit-:-- +n; 
and introduce the step-functions 
(9.06) r(u) = rw; Riu) = Ruy, 


where [u] as usual denotes the largest integer Su. Finally we put 


n 


Vo = 0, V, = yh | Pk — Pk-1 |; V(u) = Vins 


(9.07) _ =" 
WwW, = > (n —k)| A? px |, Wu) = Wha. 
k=0 
Consider 
(9.08) i» pln — k)e* = YX, + Le (t > 0). 


k=O 
where in 2; k ranges over the integers < + = [1/t], and in Y»2 over the integers 
>-7but <n. Then 


T 


(9.09) zis > pr (n—-—k) <n > r. = nR, = nR(t-). 


k=0 k=0 


Further, with our previous notation, 


Ye = —Ge Seq. + >» (qa — We) Se + GnSnyr 
; r+ 
= (1 — et) | > (Aq e™ + g,etrtpt — incioso | 
k=r+l1 








os > (A? q,.) e* (Aq, ert! sie (Aqnet™tt 4 gretintnt os gneintne 
(1 — e)? (1 — eit)? : ; i , a 


tg [i « ett)? 


fn rT) per t ‘ (n — t)(p, a Pr-retrtne _ Pr—yetr tnt + Da—eatn: 


" a 2 ike n 5 pike 
‘=? oe += ae 
k=r+l => } 


sone (1 — et)? * 
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Hence if A denotes a generic constant independent of » and ¢, 
|22| Ac] nt\ pe + n| py — Pra| + | Pral + | pra 


+ > (n —k)| A°p,| + 2 > |p| 


k=r+1 k=r+l 


and using (9.05), (9.06) and (9.07), we get 
\Z2| S At? [nér(t) + n| Ap, | + r(t — 1) + r(m — 1) 
+ W(n) — W(t’) + 2Vin — 1) — 2Vi(e" — 1)). 


Referring now to formulas (8.03), (9.08) and (9.09), together with the above, 
we have in virtue of (2.02) that 








| M(t) | s xP | > pln — k) este | <A > M,,(t), 
where 
Ma(t) = RG Ra RO: M(t) = RG Ra)" ), Mys(t) = FRI) Ap, |, 
Mult) = 5 ts aE - 0 ~ Se 
M(t) = Ae eR@ | (n)- W(t)), = Mar(t) = »RG ny lV (n — 1) — Vie" — 1)). 


10. Proof of the theorem. The truth of (8.04) under the conditions of the 
theorem now follows from the following lemmas. 
Lemma 10.1. Asn — &, we have 


[ g(t)M,.(t) dt = o(1). 
0 
By (9.01) and Definition 7.1, we have at once 


r g()M,(t) dt = o( [7 nit) = (1). 


Lemma 10.2. For fired 6 > 0, we have asn — & that 
i g()Mn(t) dt = o(1). 
3 


This follows at once from Lemma 9.2 and formula (8.03). 
In order to show finally that as 6 — 0 


F 
I g(OM,(t) dt = o(1) (1/n < 8) 


nn! 
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at the points under consideration, it suffices to prove that 
t) 
(10.01) / M(t) dt| < 


That (10.01) is satisfied under the assumptions of the theorem follows from 
Lemma 10.3. If |p,} satisfies the conditions of Theorem I, then 


“3 
| M,,t)dt = M (j= 1,2,---, 7). 
For j = 1, we have 


~——. : —1 i " RCs) | 
[stato dt _— R(n) / Rt )dt = R@) J; — 


n 


This is bounded since ” ye Ri is bounded by (7.04). 


R, k=1 k? 


For j = 2, 


t= a dt_ _n_ [*r(s) 4, 
[ M,,2(t) dt a MG ; = Rim [ . Is. 


This is bounded since ry p> : 
For j = 3, 


= 1 — Dr— 
I M,,3(t) dt = a n) I. p Pra 


is bounded by (7.04). 


dt n [ , d 
== . Dis) — als 8 
2 Rn) Jeo Pts) ~ Peel 
n - 
< -— Pra. 
= R(n) p> Pr ly 1 
This is bounded by (8.02). 
For j = 4, we have simply 


e M(t) dt . i (7 1) . [ox )ds <1 
Ma = —— r _ ———— "(s)ds é 
<< Rin) J, 2” Rn) Jes = 


For j = 5, in virtue of (7.01), 


’ (n — 1) nr, 
M.s(t) dt = 7 = O 4) oO 
[ Ins Rin) [3 (* =) ) 
For 7 = 6, 


é 1 1 , 
¢ — == ( ! ( —_ ds 
[ato It eno fm Wit 1" * aw |. [W(n) — W(s)] 


]  » 
A — W(s)|ds 
iw [W(n) W(s)]¢ 


1 fr i yi 
- sfW(s) = bs ble 
Rin) / wie) = p> dt dal 


This is bounded by (7.03). 


A 














or 
a 


SUMMABILITY OF FOURIER SERIES BY NORLUND MEANS 55: 
Finally, for 7 = 7 as for 7 = 6, we have 


Mult) dt = aay [We y-ve—-ye< l Seipn—p 

mm 5 in? 
This is bounded by (7.02). 

The proof of Theorem I is thus completed. 


11. The (N, p,)-C,; theorem. For this method of summation we have the 
following 

TuHeoreM II. The (N, p,)-Ci method is K, effective (0 < a & 1) provided 
the generating sequence {p,{ satisfies conditions (6.01), (6.02) and (6.03). 

From (5.01) it is obvious that C,(n) — 0. Further, the kernel of C3(n) is 
essentially a Fejér kernel, and at the points under consideration ¢(t) cot }¢ = 
o(1), so that by a well known theorem, C;(n) — 0 as n — «©. Hence, since 
the generating sequence {p,} is regular, C; and C; of (5.02) approach zero. 

Finally, since by hypothesis we also have ¢,(t)/sin}¢ = o(1) and the kernel 
of C2(n) is essentially a Dirichlet kernel, the result of Theorem A applies to 
show that C2; 0asn —> ~. 

A similar theorem can be proved for the (N, p,)-C; summability of the con- 
jugate Fourier series. 


III. The derived series 


12. In a paper published in 1926 M. Jacob [4] proved that the r-th derived 
series of the Fourier series generated by a Lebesgue integrable function is 
summable by the Cesaro method (C, r + 6) to the value of the r-th generalized 
derivative of the function (in the sense of de la Vallée Poussin [5]) whenever 
this derivative exists. By means of an example he showed that the restriction 
5 > 0 is necessary. 

For the r-th derived conjugate series, summability in the case r = 1 has 
been considered by Plessner [10], Sayers [12], and Takahashi [15]. In the 
general case, Plessner [11] has defined an r-th derived conjugate function and 
stated without proof a theorem on the Cesdro summability of the series. 

A. H. Smith [13] has introduced an r-th derived conjugate function to which 
he showed summability by the Bosanquet-Linfoot method of the r-th derived 
conjugate Fourier series. A. F. Moursund [6] has defined another such function 
and stated theorems for a method of summability introduced by F. Nevanlinna 
and extended by the former, as well as existence theorems for his function. He 
also stated such a theorem for the Bosanquet-Linfoot method [7]. 

We give here theorems concerning the summability of the r-th derived Fourier 
series and conjugate series for the Nérlund method (N, p,). 


13. Definitions and notation. The generalized derivative in the sense of 
de la Vallée Poussin is defined as follows: 
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DeFiniTion 13.1. If at a point z the function f(x) satisfies an equation 
of the form 


c+t —iyf«-t) VW. tri t" 
eee = ei gay te 5 
where w(z, t) — 0 as t — 0, then f(x) is the r-th generalized derivative of f(z). 
From the existence of the r-th generalized derivative follows that of the 
(r — 2j)-th, 0 < 27 < r. Whenever the ordinary 7-th derivative exists, it is 
identical with the generalized derivative f(z). 
With the notation of §3, we set 


[k(r 1)] e 
- 12 it 1-23 
A(t) = (1) —§ f(x); 
(13.01) A,(t) = ,(t) — 2 »y ponoergoreraes x) 
(13.02) A*(m) = | A,(u) a cot 4u du; 
. dur " 
[}(r—1)] ad jr-* 2 it ae 
3.0: -(n) = 2(-—1)' (r—1—2i) es et 
(13.03) B,(n) = 2(—1) p> f i) [ ——7 ae we NO 
[br]-1 [grJ—-1 2j—2i [2i+t 
ria ad 
3.( = rt pet “cot H 
(13.04) C= 2 f ) 2% goaapian tH 
[tr]-1 [}r]-1 Dp 2i-2i+1 


yy (27 — 2°41)! 


qeitt 1 1 ‘a : 
piri E cot jt — N ol , 
1)] 2j—26+1 


[ify 
(13.06 “Cr = (—])rt!? . v ase NGG ' 
3.06) Ur, n) ) > ajoaapin= Ol 


(13.05) 


In any of the above expressions, whenever the upper limit of summation is 
less than the lower limit, that sum must be interpreted as zero. Thus Bo(n) = 0 
and C, = D,(n) = 0 when r = 0,1. We shall show later that By(n) = 0. 

With this notation we define the r-th derived conjugate function f(x) at 
a point where f(z) exists by 

DEFINITION 13.2. For r = 0, 1, 2, ---, 


, (—1). [* ir 
J(2) = x - lim J A,(t) cot tdi —C,. 


The function f(z) is identical with f(z) as given by (3.07), and f(z) is 
essentially the same as f’(z) as this limit is usually defined. 

This definition was first given by Moursund. He proved* that if f‘ (x) exists, 
so does f(r), 0 < 2i < r, and that f(x) exists (i) wherever f+" (x) exists, 


‘In [6], p. 284; [7], p. 132. Note that we do not use quite the same expressions for the 


various symbols 
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and (ii) almost everywhere when the (r — 1)-th ordinary derivative is of bounded 
variation. 

DEFINITION 13.3. A point x for which f(z) has a definite value is said to be 

(i) H, regular if f(a) exists and is finite; 

(ii) H, regular if f(x) exists and is finite. 

DEFINITION 13.4. A summation method is said to be 

(i) H, effective if it sums the r-th derived Fourier series of f(x) to f(x) at 
all the H, regular points; 

(ii) H, effective if it sums the r-th derived conjugate Fourier series of f(x) 
to f(x) at all the Ai, regular points. 


14. Statement of results. We set 


(14.01) A’p, = Ap, — A pen, +--+, Ape = px — Per, Ape = pr, 


where p_, = 0,k = 1,2,---. 

We shall prove the following two theorems. 

THEOREM III. A regular Nérlund method of summation (N, p,) is H, effective 
if the generating sequence |p,} satisfies the following conditions: 


(14.02) ni| Ai", | < C| P,| (j= 1,2,---,r+1), 


(14.03) 26 Mn — Rm ate | < C| Pal 
— k=1 


(j= 1,2,---,r+2;m=0,1,---,r), 
n | P;. 


k=1 an 


(14.04) -<C 





P,|n-, 


where C is a fixed positive constant independent of n. 

THeoreM IV. A regular Nérlund method of summation (N, p,) ts H, effective 
if the generating sequence {p,\ satisfies conditions (14.02), (14.03) and (14.04), 
with r replaced by r + 1. 

In what follows we shall assume that unless otherwise stated the conditions 
of Theorem III hold if we are dealing with the r-th derived Fourier series and 
those of Theorem IV hold if we are discussing the r-th derived conjugate series. 


15. Basic formulas. In order to simplify the expressions (4.05) and (4.06) 
we shall use the following lemmas. The proofs depend on elementary computa- 
tions and are omitted. 

Lemma 15.1. 

1 " 
yo Pn—k 1 + 2(cos t + cos 2t + --- + cos kt)]; 


2nP, 


N,(t) a 


he ae 
| NQdt= 3; sar Nalt) = 0 att = 0,r. 
0 at 
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Lemma 15.2. Forr — 2i 2 0, 


‘U(r, n) (¢> 90), 
r  gr-2i 
omnes NG) a on § (—1)r 
| (r — 22)! Uolr, n) + —2 (i = 0). 
2 
Lemma 15.3. 
N,(t) = > pr—«{sin t + sin 2¢ + --- + sin kt]; 
a k=0 
fe gue di. 
a | me lti=0; . N, = 0 = Tr. 
/ qa N,,(t) dt 1 V(t) = 0 att =O0,7 


It follows from this lemma and formula (13.03) that By(m) = 0. 


Lemma 15.4. B,(n) = C, — D,(n). 
We proceed to transform the expression for N)’ [f(x), p,}. 
Definition 13.1 and Lemma 15. . we have 


dre) [" Ne wat 


+ 2(-1)" [" w(z, t) 5 N‘(t) dt 


From (4.05), 


Ni. If(z), p.| = 2(-1) 


lar} 
f(x) +2(-—1)" > Uj(r, nf 20(7) 


i=0 


II 


(15.01) a 
+ 2-9 fol, 0 FN at 
0 Pr? 


For N,' '[f(2), p,], we have from (4.06), using (13.01), (13.03), (13.04), (13.05) 
and Are together with Lemmas 15.3 and 15.4, that 


vu [f(x), p,] = (-ye [ A,(t)N\(t) dt — C, + DAn) 


0 


ee [" A(t) © cot Hat — C, + Dn) 


+ (-1)"*! | A,(t)N(t) dt 


6 3 
ceoapilf! of amano 
n i 8 


-_ r+l al r 
(15.02) a‘) [40S cot seat — 6, + Din) + Int Jat de. 
2r n-! dt’ 


16. Preliminary lemmas. For the values of j indicated in (14.02) it follows 


at once that 


(16.01) | A'p, | = O(P,). 
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Further, from (14.04), 
me Sy | Py | 
(16.02) Pu »y in O(1) (6 = 0,1,2,---,7r). 


Finally, 
(16.03) ps (n — k)™| A™*!p, | = O(P,/n) = o(P,/n) (m=0,1,2,---,7). 
This follows at once if in (14.03) we put 7 = m + 2, set 

T, = To = Do k(n— Kamp | = OP), ‘To =, 


and proceed as in Lemma 8.1, using (16.02). 
In virtue of (3.10) and (3.11) we write 


—@e~ i(n+})e n 


(16.04) N,(t) = V(t) + iN, () = InP, sin i Dd me 


k=0 


and denote by R(t) the result of differentiating N,,(¢) m times with respect to ¢. 
Lemma 16.1. For0 < ¢ S n“, we have 
tN\"'(t) = O(n). 
First, for the values of ¢ indicated and k Ss 
ef sin (k- - 3)t| ‘\ dr ; | dr ) 


a sin < tr{| — cot its kt |__ cos kt |> 
Uae sin ) | ted \lar °° 5¢ sin + dt’ CO { 


<< (r\ @& dr-* . ‘ dr 
<t | z (; 7 cot 4t ae sin kt + sin kts. cot 4t| + (kt) 


r—i 
- o(« s a kr-8 ers. = :) = Ob), 


3=0 


whence by (3.10) and (2.02), 
UN'()| = (rp } k| pn— | ‘) = O(n). 


To simplify the succeeding estimates we set 
1 


(16.05) —. = 2nP,(1 — e*) sin (jt! > 0). 
b(t) 


d* 1 
dt’ b,(t) = of P, yan) 


Lemma 16.3. Jf |p,} satisfies (16.01) and (16.03), then 


LemMa 16.2. We have 


eem’”’(t) = o( 1) (m = 0, 1,2, ---,r) 


uniformly int forO0 <6 St. S am, where 6 is fixed. 
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We first transform the sum in (16.04). Assuming n > r + 1, and applying 
Abel’s partial summation formula as in §9 s 


= m + 1 times, we have 
b pr ei* = x | a — ei(n+bt . a~ "Pn 


aes e ys 








=] qd _ ett)’ 
the differences being given by (14.01). Substituting into (16.04) and using 
the notation of (16.05) give 


1 
Pn» 
k=0 


N,(t) = —b,(t) > (A* pm emn—ket edit > b(t) A 
j= 


whence applying Lemma 16.2, 


9 (m) sai ads _ m a™ 6 | (n—k+4)t 
mo” (t) a ——" ».(t) > (AS pr) - a 


=f) 


a™ Bp ~ 
~ a _ ef y (A~' p,) 00] 


os i , 


- “1 Plata | (n — kb) | SS | AM p, |] 
= 015: piLdy eee +B eae] 





p+8ti 


(n — k)™ Api 


5 
o( Pp > (n —k)™ At, | + > Ai-'p, ) (t= 1), 
n k=0 n j=1 


- A’'p, 
10 : ——— (t< 1). 
k=0 P, tetet " 2d Fe its) 7 
Now for 1 < ¢ S 7, we have the desired result at once upon using the hy- 
potheses, and for 6 < ¢t = 1, 
mn” (t) = (pe = (n — k)™| A*p, ee Ai pa ) = o(1). 
n| k=0 P,| j=l 


This completes the proof of the lemma. 
Lemna 16.4. If {p,} satisfies (16.01) and (16.03), then 
Nn” (t) = o(1) 
uniformly int for0O <6 Ss t (Sn. 
his follows at once from the previous lemma 


It remains to give estimates for the kernel in the interval (m~', 6) 
use the notation of (9.05) and (9.06), and for 8 = 0, 1, 2, --- , r put 


(16.06) Wh = > (n—k)* | ap, |, 
be 


Consider 


(m = 0,1,2,---,7r) 


We shall 


W(u) = We). 


(16.07) > mew = 4+ 22 (t > 0), 
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where in =, k ranges over the integers < +r = [t-'], and in 2, over the integers 
>r but S n. 

The left member of (16.07) has continuous derivatives of all orders. Each 
term on the right has derivatives of all orders everywhere except at a denumer- 
able set of points, viz., those for which + = [t"'], (n“' S ¢t S 6). But upon 
multiplying the r-th derivatives of the left and right members by a bounded 
function and integrating, we find the results are equal. Hence + may be con- 
sidered a constant with respect to differentiation. 

We proceed to transform 22, applying Abel’s partial summation formula 
as in estimating the 2-2 of (9.08). Assuming n > r + l and repeating s = r+ 1 
times, we get 


2 = metres — Camtpdetorne | (arpadet 


-2 = ——— a 


j=l (1 — e#) ~~ 
where the differences are given by (14.01). Substituting this into (16.07) and 
the result into (16.04), we have 


N,(t) = —! + pre mkt bt : # (A! p, )b(the hr 


2rP,, sin at k=0 j=1 


+ >» (\’-' p,, )b,(tet* — b,(t) > (A* p, Jenin 
1 


j=l k=r+ 
(16.08) = Liys(t) + Lro(t) + Las(t) + Las(t). 


We compute now the r-th derivative of each L,,,,(¢) using the notation of 
(9.05), (9.06), and (16.06), together with Lemma 16.2, assuming that f' is 
not an integer. 

For m = 1, 


d’ -1 w/r\< ds a | 
pl La - ‘2 > > 5 eee a 
dt’ 0) 2rP, j= (;) k=0 Pt ap dt’ sin 3¢ 


= o( 5 > > (n — k)*| py |@--t = of 3 > nh(e- >> rs) 


| 8=0 k=O 3=0 =O 


| 1 
= 0. : 8 pdr 
( p. #(?) >> aii ). 


For m = 2, putting s = r + 1, we get 
d* <—[a/r ds ar-3 
Lo(t) = = Vi 1 a y—iinmore dye b. t) 
dt’ > = i P ap atr-8 ri | 


rel r . 
Ap, | (n — 7) 
O le , 
(So 


For m = 3, we have at once 


ar ort J 'P ) 
- Laat) = O oh. 
av" (x P, |e 


g@s f 


ll 
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Finally, for m = 4, 
dr? 


~ bald > (5) YS (ap) itt © a) 
-— Lult) = — 2) ae eink . 
dt . 3=0 \B/ srt - dt? dt™-# 


- ~ (n—k) Ap, ~ W*(n) — W(t) 
(> ss P,, | te+r-841 ) - A(X P,, | t¢+7-3+ ). 


8=0 j=r+1 s=0 


lI 


If we refer now to (16.04) and (16.08), these estimates imply the existence of 
a generic constant A such that 


IA 


A> »™,, (0), 


m=1 


NPM) |, I MW! < | MW 


where 
1 GS | A», | (n — 7)8 


1 : nb 
VM, = —— -~R(t), M(t) = AD laa 
meat t'R(n) 2 er ) UR(n) »y p> tbr , 


1 ‘| art, 1 ht Wn) — W(t") 


Ri =) — po 9 M(t) = 


M,3(t) = 
: t'R(n) s=% {7-8+2 


17. Proof of Theorem III. The truth of Theorem IIT now follows from the 
following lemmas. 

Lemma 17.1. Asn — ~, U,(r,n) > 0. 

This follows at once from Lemma 16.4 and formula (13.06). 

Lemma 17.2. If atu = xr, f'(u) exists, thenasn— =, 


w(x, t) NY’(Odt = o(1). 


By Definition 13.1 and Lemma 16.1, 


/ w(x, t) NY? (t)dt = O(n f | w(a, t) | it) = nf” at) = o(1). 
0 0 0 


Lemma 17.3. If {p,} satisfies (16.01) and (16.03), then for fired 5 > 0, 
/ t'w(2x, t) N“")(t)dt = o(1). 
8 


This follows from Definition 13.1 and Lemma 16.3 with m = r. 
Thus (15.01) is reduced to 


6 
~ 


MPU), pd — f(a) = 2(—1 [ole 0 NMDA + of, 
and to complete the proof of Theorem III it suffices to show that 


é 
| R(t) | dt < C, 1/n < 4, 


n 


as 6 — 0 at the H, regular points. This follows at once from 
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Lemma 17.4. If {p,} satisfies (14.02), (14.03), and (14.04), then 
3 
(17.01) / UM an(t)dt < M (m = 1, 2, 3, 4). 


Property (17.01) holds in the cases m = 2, 3, 4, if we assume only that (14.02) 
and (14.03) are satisfied. 


For m = 1 we have 
lhe ~ ne . .~ " R(u) 
"Mau(t)dt = > —— “edt = du, 
| eM i(t)at >> Raw |. Re edt >> Raw |. we 
which is bounded since 
nos Re 
R, k=1 kot 
° is bounded for 8 = 0, 1, 2, --- , r by (16.02). 
For m = 2, 
é r+il r 
l *(n — r)®| Am "ps 
UM 9(t)dt = — = Ae ll Ed ad 
/ 1 ra(t) j=1 B=0 R(n) a7 p-s+1 “ 
r+l1 r 1 n 
= a p> in) | w-F-1 (n — [u})®| A ppg | du 
r+ r 1 n 
= A= —— pe ki-8-1 (n — k)® | Ap, ) 
j=1 s=0 R,, k=1 


This is bounded by (14.03). 
For m = 3, 


é r+1 i rs r+l 2 
Ap, dt n’ | A’"p, ) 
¢M,3(t)dt = lasing —=0 peal SM EY Y 

| : nal " go R,, I on (x R,, 


This is,bounded by (14.02). 
Finally, for m = 4 we have 


é : f ] Wn) — W(t - 
[emda = % i [ Pear 


1 
= 2 Rw 








‘ ut9[W8(n) — W%(u)] du 
3-1 


- ] ‘ 3 
—_ 8 W8(n) — W(u)l d 
< 2, Rin) i ur—3[ W8(n) W(u)] du 





= 1 . . 
8+ dWA(u) 
L mwece |. . tu 


’ = (> . > kr-+(n — ke)? | Art, ') 


8=0 nk= 


This is bounded by (14.03). 
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The proof of Theorem III is thus completed. 


18. Lemmas for Theorem I\. We give here special lemmas for Theorem IV. 
Proofs are omitted, since they depend on elementary computations. 
) i” 
Lemma 18.1. ForO StS n"'andk <n, HY o/s cot 3t = O(1). 
0 rT 
LemMA 18.2. For0 St 


A 


IIA 


nandk © n, 


d tr) ad’ | a 
di E w/o cot i] = O(n). 


Lemma 18.3. Alt = n-'and forO Sk Sn, 


Cc? eos (k bf r 
CO (h + oF cot 44 = G(1). 
dt sin 4 - 


Lemma 18.4. If atu = x, f(u) exists, then as a, B > 0, 


3 . 
| A(t) “— cot st di—0. 
= dt’ 


We note also, since the restrictions on {p,} in Theorem IV are the same as 
for Theorem III except that in the former we replace the r of the latter by r + 1, 
that all the lemmas and results of the previous two sections may be used here 
with the index r increased by unity. 


19. Proof of Theorem IV. The truth of the theorem now follows from 
the following lemmas. 

Lemma 19.1. Asn — <~, D,(n) 0. 

This follows at once from the expressions (3.11), (13.05) and (16.04), together 
with Lemma 16.4. 

Lemma 19.2. Asn— ~,J,;— 0. 

Integrating by parts and using Lemmas 18.1, 18.2 and 18.4, together with 
the notation of (3.11) and (13.02), we have from (15.02) that 


(-))r4 g n 
J, = ) : Pri / A,(t) Hi '(t) dt 
2rP,, r 0 


(—1)+! < AN ABO | [° ; *(*) do HY (t) 
= -J / / 
2rP, 2 put 1:(%) dr bf ° , n} dt dr a 


cot 4 fs - cot ht 


dt’ 0 dt 


of F > Pr—k ) + of 5 > Pu | kat) = o(1), 


in virtue of (2.02). 

Lemma 19.3. Asn-> «,6 > O being fired, Jz —> 0. 

This follows at once from (15.02), since by (16.04) and Lemma 16.4 the kernel 
of J, tends to zero uniformly in t,0 <6 StS wr,asn—> @., 








om 


her 


vith 


dt 


ernel 
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Lemma 19.4. Asi—0,J2— 0. 
Integrating by parts and using the notation of (13.02), we have from (15.02) 


that 
oa NOW |¢ P d Pird/ /s : 
A ,(6) — + a A,(6) di N‘ (t) ar cot at 


= cot Bt 





J. 


in 
~ 


rs 
o(1) + | (eH RIP CO | +e | NP |) it) = o(1), 


first by using Lemmas 16.4, 18.3 and 18.4, together with (2.02), and then 
Lemma 17.4 under the hypotheses of Theorem IV. 
This completes the proof of the theorem. 


20. (C, a) continuity. We give here the theorem for A, effectiveness of 
the method (N, p,) inthe casea>1. Let rbean integer such that 1 < r—1 < 
asr. 

TueoreM V. A regular Nérlund method of summation (N, p,) is Ka effective 
(a > 1), provided that {p,} satisfies conditions (14.02), (14.03) and (14.04). 

Integrating (4.03) by parts r times, we have 


NAf(a), p.) — f(z) [ e(t)N a(t) dt 


r—l 


r-l1 ¥ 
LD (= en sale) NO), 
m=0 ae 
é x 
+(-1 | +| Jotonyro ae 
0 8 
y Aw +e lk. 
m=0 


By Lemma 16.4 and (16.04), each A“&” — 0 as n — ~, and J, — 0 for fixed 
6> 0. Finally, since a S r, ¢,(t) = o(t"), so that as n — ~ and é — 0, 


é 
|7.| = 4 [ | 90 | at) = 0(1), 


by an argument similar to that used in Lemma 17.2, and by Lemma 17.4. 


IV. Strong summability 


21. Definitions and results. Hardy and Littlewood [2] have shown that if 
forp>1 


(21.01) i lg(u) |?’ du = O(0), ¢i(t) = o(f), 
0 


the Fourier series of f(x) is strongly summable, i.e., 


(21.02) Zz. Su(r) — f(r) |* = o(n) 


m=O 
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for every positive q; and if 
(21.03) / ly(u) |? du = O(8), 
0 


then at points where f(x) has a definite value, the conjugate series is also strongly 
summable. Previously, O. G. Sutton [14] had proved that (21.02) holds if 
we replace s,(z) by its n-th Fejér mean. R. FE. A. C. Paley [9}* showed that if 


(21.04) / g(u)| flog? | o(u) |}'** = Of), g(t) = o(t*), 


where 6 > 0 and k is any positive number, then (21.02) holds. 

From the results of C. E. Winn [16], it follows that for g = 1 (21.02) holds 
if we replace s,(z) by its Cesdro transform of any positive order, assuming 
(21.01) or (21.04); and that the conjugate Fourier series is strongly summable 
for g = 1 if we replace &,(7) by its Cesaro transform of any order assuming 
(21.03). Our purpose is to give theorems on strong summability for the method 
(N, p,). We set r = g/(1 + n) for fixed positive » < 1, and denote by 
IN,{sel}, {Nal8]{ the Norlund transforms of {s;(x)}, {8:(x)}, respectively. 
Our theorems are the following: 

TueoreM VI. If g(t) satisfies (21.04), then for every positive q 


n 


Nmlse] — f(x) |* = o(n), 


m= 


provided the reqular generating sequence \p,\ satisfies the condition® 


‘ | me |” , 
(21.05) > Fos = O(m**’), 
k=0 m 
THeorem VII. The conjugate Fourier series of f(x) is strongly summable at 
all points where f(x) has a definite finite value and 


t 
(21.06) / ¥(u) | flog* | yu) |} '*? du = O(8) (6 > 0). 
Tueorem VIII. Jf P(t) satisfies (21.06), then for every positive ¢ 


> | Noli) — f(z) |* = o(n) 


m=) 
at all points where f(x) has a definite finite value, provided the regular generating 
sequence \p,\ satisfies (21.05). 


22. Proof of Theorem VI. An application of Hélders inequality shows that 
we may assume g > 2 and therefore r > 1. We assume with Paley that the 
standard simplifications have been made so that f(t) ~ 2Za,cos nt is an even 
function with zero mean value, and x = 0, f(0) = 0, so that s,(0) = s,, = 
GO; + Gy + -- + dy», . 


' All future page references are to this paper. 
* For any positive number s we define 3’ by the relation I/s 4+ 1/s’ = 1. 








— Ss — VSS 8 VS we 
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Let us put p = exp(— 6/n), where 0 < 6 < 1. Then® 


k k-1 

, bm 

= = a,p” = &p* + (1 — p) Zz 8,p” = Sep* + o( =) (m Sn), 
v=l1 v=1 


where the “O” is independent of k, m, and n. In virtue of the regularity 
condition (2.02) 


NS] — Ns p*]| s A— Ss Ab (m Sn), 


where A is a generic constant, so that if ¢« < 1 is a preassigned positive number, 
we choose 6 so small that 


(22.01) Nw [8] — Nun[s p*] | 
1 


m= 


IA 


€r. 
Now 
oo i [" u(p, 8) sin (k + 3)0 


1 . : 
Pm 2 i@= - u(p, 4) cot 36 sin kédé 
? 2 sin 46 | Pere ee ere 


(22.02) 
+ | u(p, €) cos kéde, 
T Jo 


where u(r, @) is the Poisson integral of f(t). 

By the Riemann-Lebesgue theorem the last term on the right in (22.02) 
is o(1) ask -—> «©. Hence in virtue of the regularity of the transformation, its 
Norlund limit is zero. The first term we write as 


G(i—p) "r 
+ = sa + Se, 
0 G(l—p) 


where G@ remains to be chosen. Then by Minkowski’s inequality, 
1 1 1 
ae q = . _\e = ss q 4 
(22.03) (= Nn [8] ') < (> | Nn [8ia] *) + (> Nin [Sia] ‘) a pe ), 
m= m= m=1 
Now’ | s:| S € (k S n) for any fixed G, so that by (2.02), 


(22.04) PR | Naw [seal |* S (Ce)? n. 


m= 1 


By Holder's inequality and (21.05), 


(8 


k=l 


r 


lV (Em) sah Eine 
™m k=l 


mM pm 


| m Pm k 
Pian 


kewl Pe 








Hence by a lemma due to Hardy [1], 


n m 1+ om 
> | N[ Sia] jrct-t9) < A (’ > na’) ; s A > Ske .. Ae tn 


m= I Me pm y 


* Loe. cit., p. 433, (4.2). 
? Loe. eit., p. 437. 
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by proper choice® of G. Substituting into (22.03) and using (22.04), we have 


i 
(> | Nin [3x] ) < ni {Ce a Ace + dui} = dP). 
m=1 } 


Hence, again using Minkowski’s inequality, we have finally from (22.01) 
together with the fact that p* = e"' fork < n 


> | Nmlse] [2 = o(n). 
m=1 
This completes the proof of the theorem. 


23. Proof of Theorems VII and VIII. We shall first prove Theorem VII. 
As before, we assume the standard simplifications have been made, i.e., f(z) 
is an odd function, x = 0 and f(0) = 0. The conjugate series is then —2b, 
and 3,,(0) = on = —(bi + be + --- + Dy). 

Lemma 23.1. Let r,, denote the m-th Cesdro sum of the conjugate Fourier series 
of f(x) atx = 0. Then r,, is uniformly bounded. 

From (21.06), 


t 
I Y(u)| du = O(t). 
0 


The rest of the proof is similar to that of Fejér’s theorem. 
As before, 
~ 6m 
on = — > b,p” = ogmnp™ + o() (m = n), 
v=l1 n 


so that by proper choice of 4, 
(23.01) Dd | Gm — omp™| S en. 


Now 
l a cos (m + 4)8 — cos 40 


— u(p, 0) dé 
2 sin 30 


on = 
T 


Lf u(p, 9) cos m9 cot 40d — tf u(p, 8) sin mé d@ 
wT Jo T Jo 


- Lf u(p, 0) cot 36 dé. 
Tio 


The second term on the right is 0(1) by the Riemann-Lebesgue theorem. The 
last term, as p —> 1, is f(O), which is zero by hypothesis. Hence we may write 


| Gp) © 
on = ([ -+- [ ) u(p, 0) cos m@ cot 40 dd 4 o(1) 
us a) a(i—p) 


* Loc. cit., p 436, (7.2) 
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(23.02) = Omi + Tn2 + o(1), 


where G remains to be chosen. 

We consider first the term on: and observe that it is essentially the m-th 
Fourier cosine coefficient of the even function A(t) which is zero in the interval 
[0, G(1 — p) ] and in [G(1 — p), x] is equal to u(p, theot 34. Hence applying 
the Young-Hausdorff theorem we have as in the paper of Paley® 


n 


(23.03) Dd | ome |? S en. 
m=1 
For o,1, we observe that in the interval [0, G(1 — p)] the function cos mt, 
since m < n, has no more than AG turning points. Hence this interval can 
be divided into less than AG subintervals in each of which cos mt is monotonic. 
Applying the second law of the mean to the integral over each such subinterval, 
| om | is not greater than the sum of AG terms of the form 


] =’ ! 
| u(p, @) cot 46d6)\, 
iJé 

where 0 S & < & S G(1 — p). Hence for fixed G and n — «, we have 


n 


(23.04) > 3 | Oma |? S en. 
m=1 
Returning now to (23.02), we have upon applying Minkowski’s inequality, 
and using (23.03) and (23.04), that 


1 
(> | Em “)' s ae (. + a + Li = dat 


Hence by (23.01) and again using Minkowski’s inequality, 
1 1 1 
n q n a n @ 1 
D om” |*) S( Do | amit) + Do | Gm — omp™ |?) = one). 
m=1 m=1 m=1 
Finally, then, since for m S n we have p™ = e', it follows that 


n 


p | om |* = o(n). 


m= 1 
This completes the proof of Theorem VII. 
The proof of Theorem VIII is similar to that of Theorem VI. 
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TWO SYSTEMS OF POLYNOMIALS FOR THE SOLUTION OF 
LAPLACE’S INTEGRAL EQUATION 


By H. BaTeMAN 


1. In the radiation and conduction problems' in which the integral equation 


f(z) = | e~* g(t)dt 
occurs, the variable x takes positive values, and so the function g(¢) is to be 
derived from the values of x for x > 0. In the inversion formulas given by 
Lord Kelvin? 


a 


f(isn=xr 


J0 


oo (u/2xr)F(u)du, 
sin 


COs 


ch (ut)! 
sh 


g(t) = (xt) du 


Jo Jo 


© (ut) O* (u/2x)a Mf(x)dx, 
sin sin 
the integration with respect to x does indeed run from 0 to x, but conditions 
to be satisfied by f(x) or F(u) sufficient to make one of these formulas valid 
have not yet been formulated in a useful form. A similar remark applies to 
the somewhat analogous formula of F. Sbrana.* A more complete inversion 
formula in which the integration runs from xr = 0 to xr = * has been given 
recently by R. KE. A. C. Paley and N. Wiener.¢| In Murphy’s first method® 
of solving the integral equation, zf(.) is expanded in a series of ascending powers 
of x~! and g(0) is expressed as the coefficient of r~' in f(x)e* which, by Cauchy's 
theory, may be expressed as a contour integral. This method was generalized 
by Lerch® for the case in which 2’f(x7) can be expanded in a series of powers 
of 2°! and the resulting expression can be transformed into a contour integral 
resembling that used in the well-known inversion formula of Laplace, Riemann 
and Mellin. 

Murphy also gave a method in which f(x) is expanded in a series of inverse 


Received February 3, 1936; presented to the American Mathematical Society, November 
30, 1935. 

'H. Poinearé, Jour. de Phys., vol. 11 (1912), p. 34; L. Silberstein, Phil. Mag., vol. 15 
(1932), p. 375; H. Bateman, Proe. Camb. Phil. Soe., vol. 15 (1910), pp. 423-427. 

* Lord Kelvin, Camb. Math. Jour., vol. 3 (1842), p. 170; Math. and Phys. papers, vol. 1, 
p. 10. See also H, Bateman, Messenger of Math., vol. 57 (1928), p. 145 

* PF. Sbrana, Rend. Lineei, (5), vol. 31 (1922), pp. 454 456. 

* Fourier Transforms in the Complex Domain, Chapter 3 

®*R. Murphy, Camb. Phil. Trans., vel. 4 (1883), p. 353. 

°M. Lerch, Rozpravy, vol. 2 (1893), p. 9; Fortschritte der Math., vol. 25 (1893), p. 482. 

9) 
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factorials and g(t) in a series of powers of (1 — e~‘); this method has been 
developed by Schlémilch’ and is used now in discussions of factorial series. 

In Murphy’s third method,* zf(x) is expanded in a series of powers of 1 — x7! 
and g(t) in a series of polynomials of Laguerre. This method has been used 
recently with some success by Picone® and Eckart ' and the analysis developed 
further by Picone,"' Tricomi’® and Widder." For the suecess of this method 
simple expressions for f(x) and all its derivatives are needed forz = ~. This 
is true also for a successful application of Widder’s new method" which depends 
on the limiting form, as n — ~, of an expression involving the n-th derivative 
of f(x) and furnishes a convenient way of approximating to g(t) when the 
derivatives can be found with accuracy. When f(z) is given by a graph, it is 
not advisable to use derivatives.” 


2. In order to make use of integrals involving f(x), it seems natural to adopt 
Murphy’s idea of using an expansion in a series of orthogonal functions but to 
apply it to f(z) instead of g(t). The simplest way of doing this is to use a set 
of orthogonal functions of type 


1 z-—1 
oq? (254): 


where P,,(z) is the Legendre polynomial. Writing 


1 a—1 eu 
(1) z + 1 P. (2-4) = ] e U n(odt, 


the entire function U,(t) may be expressed in the form 





Qe 
(2) U,(t) = ~ i exp [—¢(1 + e~*)] P,(1 + 2e#)do. 
0 
The power series for U,(t) is 
— f wy 


7Q. Schlémilch, Zeit. fiir Math. und Physik, vol. 4 (1859), p. 390. 

*’ R. Murphy, Camb. Phil. Trans., vol. 5 (1835), p. 113. See especially p. 145. 

*M. Picone, Rend. Sem. Mat., Univ. Roma, (1933). 

10°C. Eckart, Phys. Rev., (2), vol. 45 (1934), p. 851. 

11M. Picone, Rend. Lincei, (6), vol. 21 (1935), p. 306. 

12 F. Tricomi, ibid., pp. 232, 420. 

13 —D. V. Widder, this Journal, vol. 1 (1935), p. 126. 

44D. V. Widder, Trans. Amer. Math. Soc., vol. 36 (1934), p. 107. 

18 Use can, however, be made of Widder’s solution of the moment problem or some 
analogous formula in which use is made only of the values of f(z) at a denumerable set of 
points. 

16 This plan has already been used by F. Sbrana, loc. cit. He used a Fourier series and 
a corresponding expansion of g(t) in a series of Bessel functions of a type occurring in 
electromagnetic theory. 
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where F(z) is a polynomial of degree n in z which has been studied elsewhere.’ 
If Q,(z) denotes the Legendre function of the second kind, there is a Neumann 
series 


(4) - — exp | -#2 +t) - = (2n a 1)Q,.(z U n(t). 


ft n=0 





This may be proved by using the definite integral for U’,,(é) and the known prop- 
erties of the expansion 


> (2n + 1)Q.(z)P n(n). 


2 
n=O 


By equating coefficients of z~"~' on the two sides of the equation (4), we obtain 
the expansion of a member of Laguerre’s set of orthogonal functions 


m 1 
(5) e' L,,(2t) = i (n + pus [ P,(u)u™dp. 
n=0 a | 
This may be inverted, with the result 
(6) Un(t) =e D> anim Lm(2t) = e* Z,(t), 
m=0 


where a, is the coefficient in the series 


P,(z) = > Ga, at”. 


m=0 


The functions U,(¢) form an orthogonal set in a generalized sense for there is 
a relation 


(7) [ Ua(s)V.(s)ds = [ [: SF Se Wl) = 5, 


where the function V,,(s) may be expressed in the following ways: 


Te elk 
(8) V,(s) = / es U,(0), 
m1 l+z 
(9) V,(s) = | = -¢ * 1=s P,(z), 
« 1 
(10) V,(s) ~ ) e~* La(2s) | z™P,(z)dz. 
m=n 1 
It may also be defined as a coefficient in the Legendre series 
(11) — exp | - s au ‘|- > (n + 4) V,(s)P,(z) (s > 0). 
a =, 


‘7 H. Bateman, Tohoku Math, Jour., vol. 37 (1933), p. 23; Annals"of Math., vol. 35 
(1934), p. 767. 
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To prove the relation (7), we write 





(12) V,(s) oe | a oe } On, m Lim(2t) = 7 An, m W,,(s), 
0 8 a t m=0 m=0 
where 
a on. 
(13) W,,(s) eee e-* L,,(2t) 


Consequently 


| U,,(s) W,(s)ds 


~ [7 vats [" 
0 o § 


- +t e~ L,(2t) 


| dt e~* L,(2t) V(t) 


1 
1 
2 
-1i 


To justify the change of order of integration in the repeated integral, we have 
merely to justify the change of order in the repeated integral 


¢“ L,(2s)ds | s+t e* L,(2t), 


2”P,,(z)dz. 


aw 


| 


for our repeated integral is the sum of a finite number of such terms. 
of these is, however, the sum of a finite number of terms of type 
rel, 


| e~* stds 
0 o s+t 


and in this repeated integral a change of order of integration is permissible 
because the integrand is positive. 
To justify the value adopted for the integral 


Each 


dt 


| dt e L,(2t)V,,(t), 
we have to verify the third expression for V,(s) and to show that this is indeed 
a Laguerre series. It should be noticed that W,,(s) is of the form 
A(s)Wo(s) + B(s), 


where A(s) and B(s) are polynomials in s. Consequently V,(s) is also of this 





form. Now the expansion 
: = dt . 2 
(14) W,(s) = | oF by ‘= V,(s) = >» om 1 e~* Lom(2s) (s > 0) 
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5 
is absolutely convergent because when m is large the m-th term is of order m™ . 
It follows then, by Abel’s theorem, that the series is equal to the limit when 
t — 1 of the sum of the power series which represents 


a ‘oe 1l+4t 
[ ee ex | - tt], 


and this is a Laguerre series in the variable s. The value on the left of (14) 
may thus be regarded as the sum of the series and the coefficients may be shown 
to be those derived by Laguerre’s rule. It may now be shown by means of the 
recurrence relations 


(n + I)Lavsilz) + nb,au(z) = (2n + 1 — 2)L,(2), 
(n + 1) Wats (s) + n Wi (8s) — (Qn + 1 + 2s) W,(s) + 2(-—1)" = 0 


« = , 
that | e* W,(s) L, (2s) ds = | 2™+"dz. 
0 — 


Making use of (12) we have the desired relation which gives 


Un(s)V,(s)ds = > —_— | U»(s)W,(s)ds 


. Jt 


mi 
>> @a.s _ Pale = | P,(z)PA(2)dz, 
p=0 —1 

and so the formula (7) is established. It was derived originally from the 
orthogonal relation for the Legendre polynomials by using the expressions for 
these polynomials in terms of U,,(s) and U,(#). When the integral equation 
possesses a suitable type of solution" g(t), we may write 


i dz » f{z-1 ar 2 greens 
| oe i i P, (5 : t) f(a) = | S(x)dr | t U,(s)ds 


. 


/ U,(s)ds [ ef (x)dx 


[ Uals)de | ax [ e Ft a( tat 


J, Usted | TS 


ll 


g(t)dt / Usls) = g(OV,(dat. 
Ju be 0 


'® We make no attempt here to formulate conditions sufficient for the validity of these 
changes in the order of integration, though this can be done with the aid of the conditions 
of de La Vallée Poussin. See Bromwich, Infinite Series, p. 503. 
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Under these circumstances, if 


m 


Sn(x) = f(r) — i Cn — 


m=1 


1p (-=+) 
z+1 *\r+1/’ 
mit) = g(t) — >> enV (2), 
n=1 
the coefficients c, which make 


[ Ua(2) dz 


as small as possible are the same as the coefficients c, which make 


P r* am(s)ds 
ba) 
| ° | s+ 


as small as possible and so we can form a definite idea of the way in which the 
present method gives an approximation to the solution of the integral equation. 

There are, of course, cases in which the integral equation possesses a solution 
and the integrals representing the constants c, fail to converge. The function 
f(x) = x” furnishes a simple example. It is possible also to choose constants 
c, [such as (2n + 1)P,(u)P,(v)] for which the infinite series of Legendre functions 
converges for almost all positive values of z but the infinite series of terms 
c,U,(t) fails to converge. It is doubtful, however, whether the integral equation 
has then a solution. 

A simple example in which the present method of expansion leads to an exact 
solution of the integral equation is obtained by putting c, = 2”. Then, if 
is} <1, 


= I a z—1 = 2 Pe as De =) 
sie) = 1, & wp, (2=1) (x + 1)?(1 + 2) — 2e(2?- DP," 


_.. 1+2 2tz 
git) = i 1 ex| - t (i = =| Ip la gos = 


The polynomial Z,(¢) thus has the generating function 


ie 1 2tz 2tz — 
aig i-s L- ( => . red > eee. 


This may be verified with the aid of the equation" 


2 an 
(1 — t)-*"*(1 + O° P,, (+*) = + i"F,.(— 2m — 1). 
ae n=0 


By expanding the generating function it is readily seen that 
Z(t) = oF 2(—n, n + 1; 1, 1; 2). 


and so 


1” H. Bateman, Tohoku Math. Jour., vol. 37 (1933), p. 23. 
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I 
or 


This result is easily generalized. Indeed, 
| e~*—t te Fy(— n, a + n;c, b; tdt 
(16) Jo 


= (1 +2)~*"' F(—n, a + n;c; u) T(b + 1), 
where u = (1 + 2"). Thus, expansions in series of Jacobi’s polynomials may 
be used instead of expansions in series of Legendre polynomials. Since, if 
C*(z) is Gegenbauer’s polynomial and | z| < 1, 

' zx—1 
r+b)(1 py et Cet ——— 
(1 + b) (1 + 2) (254) 
_ TQrv +n) 7 
niT(2v) J, 


(17) 
eat ' tho F's — n, Qy + Nip + i, b; t)dt, 


we have 
(18) r1+b6)(14+ 2)" (1 — 2 ~ =! + “)" = |] e~* g(t)dt, 
where 


a r : 
egt)=t du . eee 2" oF 2(— n, 2v + nj» + 3,6; 2). 


3. A second way of using orthogonal functions for the expansion of f(x) is 
to make use of the polynomials of Laguerre and the equation 


(19) go Let(s) = [ et O T(u, v, n; t)dt, 
where x~“J(u, v, n; 2) denotes the polynomial” 

I(v + fu+n +1)_ 

niT(u + 1) (v + fu + 1) 

which may be derived from the generating functions 


Fx(—n;u + lv + du + 152°), 





(20) (1 — #)-°-*T, [22t(1 — #)4] = DS oP J(u, v, n; 2) (jt| <1), 
n=0 
ef oF 2(u + 1,0 + fu + 1; —2°t) 


(21) 2 
= Mu + 1)r(v + jut 1) YS J(u, 0, nj x)x-“t"/T(v + fu + on + 1). 


n=0 


These results may be verified by first using the well-known inversion formula 
(Cn,, = binomial coefficient) 


m x 
ln = > a ’ by, = > Cor m—n On 
n=O n=0 


2° This polynomial may be used to construct a set of solutions of Laplace’s equation in 
four independent variables. 
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to obtain the expansion 


(22) zt = mI T(u + m+ 1) >) (—1)*Casguee.m—nd (u, v, 2; 2), 


n=0 


from which the others are readily obtained by summation. Writing D, for 
d/dx and J*’" for J(u, v, n; x), we have the recurrence relations 


D(a" J%") = Qe", Dart) = Hagel, 
—l,v t 7 —_ yp—1 . “ 
(23) adn + ed gry = ude, In ee I" = Io 


a u—l,v+ utl,v+) 
D, Jn" a Jn . _ aos ; ’ 


and the following relations are easily verified 


(24) J(u,v + m,n3z) = DS Crss-t1.2J(u, v, n — 8; 2), 


s=0 


xz“? J ,(2azr) a .% T(u +n+ 1)aPt2”. 


n=0 


Fu+n+1,n+4u+v4+1p4+n4+1;a*)J(u, v,n;2)/M(p+n+4+ 1), 


o~ 
to 
oO 


7 a 





’ . <1 sj u+l s 2w—1 
(26) i J(u, v, n; x sin 6)(sin 6)"*"(cos 4) dé 
= 3 (w)r J(u + w,v — fu, n; 2), R(u) > 0, R(w) > 0, 
| e-* J(u, v,n; x)a“dr 

(27) 0 

= Pa Tu + WC rstuannt F(—n,u + 40 + du + 1; 4), 

| Kfax)x""" I (u,v, nj xdex 

(28) Jo 


= 28g (uu — 6 + W)C orgusn.nl(—n, u + 1 — 8,0 + fu + 1; 407%). 


The cases u = —}3, u = } are of special interest. We then write 

(29) (1 — #)** eos [2rt(1 — #4} = DO PCli(a) (jt| <1), 

(30) (1 — @)-*"' sin [22t(1 — @)'} = >> @"*" Sie (z) (\t| <1), 
n=O 

(31) D,Cli(xz) = —2S1*i (x), D, SU (x) = 2CU.**(2), 

(32) (2n + 1)SU%(x) = 2 CL *Hx) + (Qv + 2)S12*) (x), 

(33) (2n + 1)SU°(x) — (Qn + 1 4+ 2v)SL2_ (x) = cD, SI% (2), 

(34) (n + 1)CU, (2) = (vo + DCI*Nz) — « SU**K(z), 


(35) Qn Cl (x) — (2n + Qv)CUE_ (x7) = cD, CU(2). 
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4. Instead of expanding f(x) in a series of orthogonal functions associated 
Writing 


with the range (0, x) we may use a newtonian series. 
4 

C, = | e “N,(t)dt, 
—, ntl 0 


where N,(t) is a polynomial of degree n in ¢, it is readily seen that, if lz <3, 


zNo(t) + 2? Nit) + ANo(t) + --- = flog (1 + 2) H[2{t log (1 + 2)}4). 
The polynomial N,(t) is closely related to Angelesco’s polynomial*' A,(x), which 


may be defined by means of the contour integral 
A,(x) = 1 [Se ( a i)(2 - 2) an ( = n). 
2nt Jc z z : 


If 
q"t 


B,(x) = (—1)" dyn 


gar ' 


{1 — 


Angelesco obtains the relation 


| A, (x) B, (x) dx = bmn, 


which enables one to find the coefficients in a series of functions A,,(2). 


CALIFORNIA INSTITUTE OF TECHNOLOGY 

21 A. Angelesco, Jour. de Math., (9), vol. 2 (1923), p. 403. Our notation differs from that 
of Angelesco as we wish to avoid confusion between his polynomial, which he denotes by 
P,(x), and the Legendre polynomial. The contour in Angelesco’s integral is a simple one 
enclosing the point z = 0. The function A,(z) is a solution of the integral equation 


(; . i)(2 i. ') cad ( - ') - x # 4,( dt. 
2 2 x m Je 


T\r 











UBER DEN FUHRER EINES RINGES IN ALGEBRAISCHEN 
ZAHLKORPERN 


Von MicHaet BAvER 


1. Es sei a eine primitive ganze Zahl des algebraischen Zahlkérpers K = R(a). 
Der Fiithrer des Ringes O(a) ist bekanntlich ein Ideal, dasselbe soll durch 
t(a) bezeichnet werden. Nach Dedekind hat man 


(1) v = t(a)d, D\| = N(d), 


wo #, beziehungsweise } die Differente der Zahl a, baw. des Kérpers K ist, 
D ist die Kérperdiskriminante und N bedeutet die Norm. 

Ore! hat den Fiihrer des Ringes O(a) einer weiteren Untersuchung unterzogen, 
indem er den Fiihrer inbezug auf die Primzahl p, bzw. auf das Primideal p 
eingefiihrt hat. Die Ideale t,(a) baw. t,(a) werden durch die ganzen Zahlen ¢,, 
bzw. ¢e gebildet, fiir welche 


giw = P\'(«) (mod p') baw. gw = PS'(a) (mod p*) 
ausfallen, wo P{'’, PS’? rationale ganzzahlige Polynome sind. Die Zahl » ist 
eine beliebige ganze Kérperzahl, ferner ist ¢ eine beliebige positive rationale 
ganze Zahl. Nun ist zuniichst 


(2) t(a) = [J t,(a). 


Das Ideal t,(a@) liisst sich weiter zerlegen, geniigt a der irreduziblen ganzzahligen 
Gleichung F(x) = 0, so spielt bei der Zerlegung von t,(a) die Zerlegung von 
F(x) (mod p”) in irreduzible Faktoren eine entscheidende Rolle. Es sei im 
Kérper K in Primideale zerlegt 

(3) p =pf'--- pf*..- per, 

wo p, ein Primideal g,-ten Grades bezeichnet, dann ist in irreduzible Faktoren, 
deren héchste Koeffizienten gleich Eins sind, zerlegt 

(3*) F(x) = F(x) --- Fy(x) --- F(x) (mod p”) 

wenn vy = 6 + 1 ausfillt, wo p*® die héchste Potenz von p ist, die in der Diskri- 
minante D(F(z)) enthalten ist. Gehért p, im Sinne von Ore* zum Polynom 
F(z), so ist sein Grad n, = exg. Es sei F(a) genau durch p;" (k 4 0) teilbar. 
Das Ideal ty, (a) ist eine Potenz von p,, man kann setzen 


(4) ty, (a) = pit, 


Received May 14, 1936. 
1 Uber den Zusammenhang zwischen den definierenden Gleichungen und der Idealtheorie 
in algebraischen Kérpern, Math. Annalen, Bd. 96 (1927), S. 313-352. 
? A.a.O. 8. 326 
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und es ist nach Ore 
(5) tp(a) = II piette "= > Yel, vu = 0. 
Die Zahlen yy (k ¥ 1) sind fiir »y = 6 + 1 invariant. 


2. Die Relation (5) kann in zwei Teile getrennt werden. Zunichst wird 
behauptet 


(6*) tp(a) teilt das Ideal a, = Il pretre, 
k=] 


Diese Tatsache wird a.a.O. auf 8S. 336 bewiesen. Dann aber wird gezeigt> 
dass t,(@) keinen echten Teiler von a, bildet, also besteht (5). Es kann aber 
leicht mit den von Ore beniitzten Mitteln, unabhingig von (6*) bewiesen 
werden,’ dass 


(6**) t,(@) durch das Ideal a, teilbar ist. 


{s ist nicht ohne Interesse, dass die Relation (5) sowohl aus (6*), als auch 
aus (6**) gefolgert werden kann, wenn gewisse Sitze, welche von Ore in der 
genannten Abhandlung bewiesen wurden, angewendet werden. Diese Siitze 
werden im Folgenden zusammengestellt, der Beweis kann ohne die Theorie der 
Fiihrerzerlegung gefiihrt werden.‘ 

Js seien p**, baw. p’*™ (k # 1) die héchsten Potenzen von p, die in der Dis- 
kriminante D(F;(x)), bzw. in der Resultante R(F;(x), F:(xz)) enthalten sind. 
K's sei ferner 


F .(&) = 0 


und die Indexe der Zahlen a baw. & beziiglich der Kérper K = R(a) bzw. 
K® = R(a&) sollen genau p*, bzw. p** enthalten, dann ist fiir geniigend grosse® y 


(7) v= Dkt Dm 
k=1 k>l 
(8) Pkt = Jeri (k = l). 


* Es ist zu beachten, dass im Falle 7, + rx 2 1, solche rat. ganzzahlige Polynome U,(z), 
G,(x) vorhanden sind, wofiir II,(z)G,(z) # 0 (mod p) ausfillt und der Grad der linken Seite 
kleiner als der Kérpergrad von K ist, ferner bildet I,(a@)G,(a)p™' eine ganze Zahl, welche 
genau durch pratrent teilbar ist und eine beliebig hohe Potenz des Ideals ), (1 # k) enthalt. 
Andererseits kann t, (a) nur die Primideale ~i, ---,De, -+*,Pr, als Teiler besitzen. 

*Vgl. meine Note Bemerkung zum Hensel-Oreschen Hauptsatze, Acta Litt. ac Scient. 
reg. univ. Hungaricae Francisco-Josephinae, tom. 8. Wir kénnen bei unseren Betracht- 
ungen die Relationen (7), (9*), (9**) durch andere ersetzen, welche in Fricke, Lehrb. der 
Algebra, Bd. 3, Braunschweig, 1928, angegeben sind. Vgl. (12) 8. 118 und (20) 8. 114. 

® Hier muss v nicht méglichst klein gewihlt werden. 











580 MICHAEL BAUER 


Im Koérper K® = R(a) wird p = p;*, der Grad des Primideals j, ist gleich g,, 
aus der Definition des Fiihrers folgt 

(9*) ty(du) = ty, (a) 

und Ore hat bewiesen, dass 

("") ti, (a) = bi 


ist. Aus dem Vorhergehenden folgt 


(10) p¥ = N(t,(a)), 2y = > Jk Te + p> D 9K = > gi(te + x). 
k= k=1 1=1 k= 


Man kann aus (10) die Relation (5) beweisen, sobald irgendeine der Relatio- 
nen (6*), bzw. (6**) feststeht. Es ist namlich 


tela) = Do pyttrates 
k= 


wo « £0 bzw. «& = 0 ist, jenachdem (6*), oder (6**) gilt. Hieraus folgt 


> gilts + vk + &) = > gi(te + Ye); 
k=1 k=1 


woraus sich e« = 0 ergibt. Infolgedessen ist (5) richtig. 


Bupapest, HUNGARY. 














DIRECT DECOMPOSITIONS 
By OysTEIN ORE 


One of the fundamental representations of algebraic systems is the decomposi- 
tion into direct products. The principal theorem on direct decompositions has 
been proved for ideals in commutative and non-commutative rings under vari- 
ous assumptions; for groups it is the well-known Schmidt-Remak theorem. 
In a recent paper' On the foundation of abstract algebra I have shown that the 
main theorem on direct decompositions holds for all Dedekind structures which 
are of finite length or satisfy one chain condition and an additional restriction. 
It should be observed that such an additional condition is necessary, since it 
is known that the theorem is not true for all Dedekind structures satisfying one 
chain condition.* 

In the present paper the properties of direct decompositions are studied 
further and various new and interesting facts about such decompositions are 
obtained. The method incidentally gives a new proof for the main theorem 
valid in the finite case or, with an additional restriction, when only one chain 
condition is satisfied. The method is a generalization of a method introduced 
by Krull* in the study of the so-called generalized abelian groups. The results 
of Krull are extended and refined in various ways and the theory is greatly 
simplified by the use of structures. It may not be superfluous to observe again 
that the formulation of the theory in terms of structures gives it a great general- 
ity, making it valid for all algebraic systems in which the Dedekind axiom is 
satisfied. As an example let us mention that the present theory is valid for 
arbitrary groups, a case to which the original theory of Krull was not ap- 
plicable. 


Chapter 1. The main theorem 


1. Theorems on components. Before we can begin the principal investi- 
gations it is necessary to mention a few facts about the so-called components. 
The components have already been defined in II, Chap. 1, but we shall have to 
recall some of their properties here. Let 


(1) M = (Bi, Bel, (Bi, Be) = G 


Received April 13, 1936. 

1 On the foundation of abstract algebra, I, 11, Annals of Math., vol. 36 (1935), pp. 406-437, 
vol. 37 (1936), pp. 265-292. These articles will be referred to in the following as Ore I and 
Ore II and the terminology of these papers will be used in the following without further 
explanation. 

2 EF. Steinitz, Rechteckige Systeme und Moduln in algebraischen Zahlkérpern, 1, Math 
Ann., vol. 71 (1911), pp. 328-354. W. Krull, Matrizen, Moduln und verallgemeinerte Abelsche 
Gruppen, Sitzungsberichte Heidelberg, 1932, pp. 13-38. See also the discussion in Ore I] 

*W. Krull, (ber rerallgemeinerte endliche Abelsche Gruppen, Math. Zeitschr., vol. 23 
(1925), pp. 161-196. 
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be a direct decomposition of a quotient Yt in a Dedekind structure >. Here 
B, and B, are quotients with the same denominator and G is the corresponding 
unit quotient. Now let S be a factor of Pt. The quotients 


(2) BilS} = (Bi, (Be, S)), B2{S} = (Be, (Bi, S]) 
are then called the components* of S in B, and Be. 

One finds that the union of the components of two factors S,; and Sz in the 
same quotient B; is equal to the component of the union [S,, Se] in the same 
Bi: 

(3) Bif(Si, Sol} = (BiiSi}, Bef Sof]. 

We may now prove 


THEeorEM 1. Let S; = Se be two factors of M. Then there exists the following 
similarity relation between their components in By: 


(4) Bi} Si} / Bil So} 


Ile 


Si [Se, (Bo, S:)). 


One finds by transformation with 8, that the last quotient is similar to 
(Bs, S}/[Be, S2] and a contraction with B, gives the desired similarity (4). 
A consequence of Theorem 1 is the 

Lemma. The component B,{S} is similar to a left-hand factor of S: 


(5) BiiS}| = S/(S, Be). 


Ile 


From this lemma it follows that 8,{@j is the unit quotient if and only if S 
is contained in Bs and B,{S} is similar to S if S is relatively prime to Bs. 

THeoreM 2. Let € be a factor of M and let S, and Se be two factors of ©. 
The necessary and sufficient condition that S, and Ss have the same component 
in ©, is that 


(6) Si, (Be, )] = [Se, (Be, ©)}. 
From the identity 
(Bi, (Si, Be]) = (Bi, (Se, Be) 


one obtains by taking the union of both sides with 8B. and applying the Dede- 
kind axiom 


Si, B.| _ (Se, Bo], 


and (6) follows by taking the cross-cut of both sides with ©. The sufficiency 
of the condition (6) follows directly from (3). As a special case of Theorem 2 


‘ These components are easily seen to correspond to the ordinary notion of components, 
for instance in group theory, where the component of a sub-group S in & is the subgroup 
of W consisting of all elements @ occurring in the direct product representations ¢ = af 
of the elements o of S. 
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we see that the necessary and sufficient condition that any two factors S, 
and Ss of M@ have the same components in QB, is 
(7) [S:, Be] = (Se, Bel. 

THEorEM 3. Let € be a factor of M and Bi some factor of B,. The maximal 

factor of B{ which is the By-component of some factor ©, of © is 
Bi = (Bi, BilC}), 

and the maximal factor of & having its B,-component equal to BY is 

(8) G, = (GC, [B;, Be). 

Since any factor of © has a B;-component contained in B,{C}, it follows 
that BY is the maximal factor of 8), which may be the B,-component of some 
factor of ©. It is easily verified by means of the Dedekind axiom that the 
quotient ©, defined in (8) actually has the B,-component 8/. Furthermore 
it follows from Theorem 2 that ©; is the maximal such factor, since it contains 
(S., ©). As a consequence of Theorem 3 we find that the necessary and suffi- 
cient condition that a factor 8} be the B,-component of some factor of © is 
that B) be contained in B,{C}. 

These results have all been derived for right-hand divisibility, but the anal- 
ogous results hold for left-hand divisibility. 


2. Reductions and increments. Let us now suppose that there exist two 
direct decompositions of the same quotient 
(9) mM = [%, A] = [B, Bo], (%, WM») = (S:, B.) = G. 

We shall now introduce a new operation which consists in the repetition of the 
operation of taking components. Let & be a factor of %; and 8,{S} its com- 
ponent in 8,;. This component again has a component in %,, namely 

(10) RS} = ALBilS}} = (i, (Me, (Bi, (Be, Al) 

and this quotient shall be called the reduction of S (with respect to B,). For 
the special case S = %, we shall write 

(11) Ry = RY (Mj. 

A second fundamental operation is derived from the consideration of the 
following problem: For a given factor S of %, we wish to determine the maximal 
factor of %, such that its reduction is contained in S. Through a double appli- 
cation of Theorem 3 we find that the quotient in question must be 
(12) NY {S}. = (MH, (Be, (Bi, Me, SY), 


and we shall call it the increment of S (with respect to B,). The increment 
of the unit factor &) obviously represents the maximal factor of %, having its 
reduction equal to @. This factor 


(13) MY = NY {Go} = (Mi, (Be, (Bi, Mz))) 
we shall call the null-factor of %, (with respect to B,). 
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Before we study the properties of these new concepts, let us for a moment 
consider the corresponding left-hand concepts. These furnish us with a dualism 
between reduction and increment which manifests itself in all the following 
investigations. 

To the given direct decompositions (9) there exist corresponding left-hand 
decompositions 


(14) M = (AT, Whe = (BT, Bek, 


where 





A =—M A, AP = MM, BT =M By, B= M/B, 


are quotients similar to %,, %:, Bi, Be respectively. There exists a structure 
isomorphism between the r.h. factors S of %, and the lh. factors S* of AT 


given by the correspondence 
(15) S* — M/[%e, SI. 


1 : : ~ ° * ‘ 
The |.h. reduction and increment of S* with respect to B, are then easily 
found to be 





RI(S*! = M/[%e, NY? FSI], 
Ni1S*} = M [%e, RS Sh], 


and hence the correspondence (15) makes |.h. reduction correspond to r.h. 
increment and |.h. increment to r.h. reduction. From this correspondence 
one also obtains the following similarity relations 


* ~~ rol = *~ ‘ly 
‘ *! ds ' y™ i ‘ ‘ 
R,(S*} =Ny' tS}, Ni tS} = Rts}, 


(16) 


and hence one obtains for = = © and S = A, 
RI=MRy, N= Ri. 
3. Theorems on reductions and increments. We shall now derive a series 


of properties of these operations. From the relation (3) and the definition, 
the reduction must have the distributive property 


(17) RV WS, Sl} = (RV ESal, RYE Sef], 
and in a similar way one proves 
(18) N'')}(Z1, S2)} = (NY FSi}, NY? 1 Se}). 
From these relations we obtain the fact that if Z,; 2 FS,» are two factors of Y%, 
then 
RY 'Ss} > Ry (Ssh, NY’ {Si} > NY" (Se). 


Very important are the following relations, which may be proved from the 


definitions through a repeated application of the Dedekind axiom. 








1c 
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THEOREM 4. Let © and T be the two factors of %,. Then the following identi- 
ties hold: 


NOUR (S}, TI} = (S, NOTH], 
RY UNY?{S}, T)} = (S, RY IT)). 
For T = G and T = YA, we obtain the simpler identities 
NOR PIS = (S, NY), 


(1) r(1 > (1) 
R [NY {S}} (S, RY’), 


(19) 


(20) 


and hence also 
7(1) 1) 1 ) ‘ 
(21) N \y {KR } = %, ’ R‘ IN } = Y. 


Let us also mention the following symbolic identities, which one can derive 
from (20): 


N-R-N = N, R-N-R = R. 


By means of these relations we prove easily 
THeoreM 5. The necessary and sufficient condition that two factors S, and 
Se of A, have the same reductions is 


(22) [S1, MV) = (Se, NY). 
The necessary and sufficient condition that they have the same increments is 
(23) (S1, REY’) = (Se, RY). 
To prove that 
(24) RY }S1} = RY {Se} 
if and only if (22) is satisfied, we observe that the relation (22) follows from 
(24) by applying (20), and (24) follows from (22) by means of (17). The 


condition (23) is proved in a similar way. 


THEOREM 6. Let S, = Se be two factors of A,. Then the following similarity 
relations hold: 


1 ~ (1)y> ~ ~ ~ 1) 
- RS NS} R\ $}So} >< [Se, (<1, plat )I, 
(25) 


11) nly ~ ~ (1 ~ 
NY Sal /NYV Se} S (Ga, (Se, RY’) / Ss 
and for S; = M1, S2 = Go follows from either relation 
(1) s( 1) 
4/RNY SMV. 
The first similarity relation (25) may be obtained by applying Theorem 1 


twice, and the second follows dualistically. 
Let us finally mention 
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_ 


THEOREM 7. Between the reductions and null-factors of A, with respect to B, 
and Bz there exist the relations 


(26) A, = (RY, RY), Go = (MY, MY). 
According to (3) we have for any factor S of % 
[RiP {S}, RS? {S}] = Bh, (Me, Bi, Bs), 
where 8; and SB} are components of S in B, and Bz respectively. Since 
[Bi, Bz] = [(B1, (Be, SP, (Be, (Bi, SP] = (Bi, S] (Be, S) > SF, 


we also find 





[RY {S}, RP {S}] > S, 


a relation which gives the first of (26) as a special case. The second relation 
(26) may be obtained directly from the definition of NY’ and RY. 


4. The algorithms. We shall now consider the algorithms consisting in 
repeated application of the two operations of taking reductions and increments. 
We define the n-th reduction R{ {S} of a factor S of %, as the result of taking 
n successive reductions of S. We define similarly the n-th increment NY {S}. 
Let us also write 





RY _ RY {%}, ny” - NY {G}, 
where 9S") shall be called the n-th null-factor of %, with respect to B,. It is 
the maximal factor of %{, for which the n-th reduction is equal to the unit quo- 
tient. The relations between left-hand and right-hand n-th reductions and 
increments are the same as those indicated in (16) for the case n = 1, reductions 
corresponding to increments and increments corresponding to reductions through 
the isomorphism defined by (15). 

One may now derive properties for these general operations corresponding 
closely to those formerly obtained in the case n = 1. We observe first that 
for R\’{S} and J\"'}S} we have the distributive properties expressed by (17) 
and (18). Through mduction the relations (19) may be generalized to 

NY (RY 1S}, TH = (FS, NET}, 
RY UN 1S}, ZT} = (S, RY IT)), 
‘and for T = G and T = A; we obtain the special cases 
NP {RY {S}} = (S, NM, 
RY IN {S}} = (S, RY”). 

From these relations we obtain the following generalization of Theorem 5: 

THEeoreM 8. The necessary and sufficient condition that two factors S, and S. 
have the same n-th reduction is 


(29) Si, mi” = [S, NY” ]. 


(27) 


(28) 








_— Fr vs 


Ue 








DIRECT DECOMPOSITIONS 587 


The necessary and sufficient condition that they have the same n-th increment is 
(30) (S,, RY) = (S, RY”). 
The proof is analagous to that of Theorem 5. From Theorem 6 one obtains 
by induction the more general similarity relations valid for S, = S:2: 
RY” (S,}/RY {Ss} = S/[Se, (Si, ny”)), 
(31) Mis I/N(Yis x es (n) = 
NY US Y/NY'(Se} = (Si, (Se, RY / Se, 
and again for S; = %1, S$: = G, 
(32) A /NY? = RY. 
Let us finally prove 
THEOREM 9. For all m and n we have 


(33) Mt, -_ [Ry”, RY”), 
and 
(34) G = (i, NP”). 
This theorem is true for n = m = 1 according to Theorem 5, and hence it 


may be proved by induction. We shall suppose that the theorem holds for all 
n+ m < No and prove that it is also true when n + m = No. 
Let us observe that 


xe = xo, Ry? j= RI”. 
It follows then by the Dedekind axiom from the induction condition that 
My = (RO, REPL, (RE, REY = HLM, RE, CHE, BE”), 
and hence it is sufficient to show that 
(35) U = (RM, RP] S RP-?, Ry”). 
However, since U is the union of the Y%4,-components of B | Ry"? | and B RY" |, 
it is also according to (3) the %,-component of 
[BERN PY, BoM" PY] = (CB, [By RP -PY, CBs, [Br RPP Y)] 
= ((9,, RY”), (9, 8") > (RY, RY”), 
and the relation (35) follows immediately. 
The relation (34) follows by a dualistic process. We have 
GE = (CRP, RE), (NP, RO) =_ AL’, NE’, ['P-?, RE", 
and hence it is sufficient to prove that 
(36) B= (RY, RO) s (RY, RI. 
Through simple reductions one finds 


We= (%, 1B, (NN, MWD, Be, (MP, WDD Ss A, Me, NY, RE", 


and from this relation (36) follows immediately. 
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5. The invariants. It follows from their definition that the reductions form 
a decreasing sequence 
(37) 4%, = WY 
while the null-factors form an increasing sequence 
(38) Gq sr sR s.---. 


If at any point of these sequences the equality sign holds, it must hold for all 
following terms. Let us now suppose that in the sequence (37) all terms become 
equal after the n-th. We shall then write 


(39) R= RK =KR = 


“— 
and we shall call i,,; the reduction invariant of %, with respect to B,. The 
quotient 2y,; must always exist when the descending chain condition is satisfied 
in IN (or in %,). The name is justified by the fact that 


(40) RifRial = Ria. 


Similarly, if the terms in the sequence (38) all become equal after the m-th, 
we write 


(41) Mai = M +’ = M ‘hie = 


an 
and we call 2j,, the tnerement invariant of %, with respect to B,. It has the 
property 


(42) NiMiay = Mia, 


and it must always exist when the ascending chain condition is satisfied in DW. 
We can now prove 
THEOREM 10. When the invariant %,,; exists and is defined by (39), then 


(43) Y, = [ey ly eg | 
and if N;,, exists and is defined by (41), then 
(44) &, = (Mai1, Ry”). 


Theorem 10 is a consequence of Theorem 8, since when %),; exists, the quo- 
tients %, and 9,,; have the same n-th reduction. The relation (44) follows in 
a similar way. It may be observed that according to (39) and (41) the rela- 
tions (43) and (44) will also hold for all larger n and m. 

From Theorem 10 we obtain in turn 

TuHeonem 11. Jf both invariants Ry, and Ny, exist, %, has a direct decom- 
position 


(45) A = (Nir, Naas! (Ri, Naa) = Go. 


” 
We shall say that %, has a regular decomposition with respect to B,, when the 
relation (45) holds. It is obvious that a regular decomposition always exists 
when Wi (or A, only) has a finite length. We shall see later that it also holds 
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under much more general conditions. One very interesting property of the 
regular decomposition (45) is that it is explicitly expressible by means of the 
components in the given decomposition (9). 


6. Properties of invariants. We shall now only consider quotients Yt hav- 
ing the property that the regular decompositions exist for any component in 
any direct decomposition of Mt. Such a quotient YW or structure = may be 
called a regular quotient or structure. We shall discuss later the condition 
for a quotient or structure to have this property. 

For such regular quotients we can prove 

THEOREM 12. Let %i.; be the reduction invariant and Ny., the increment 
invariant of %, with respect to B,, and similarly Ry, and N,, the corresponding 
quotients for B, with respect to %,. Then Rj. is the Bi-component of Ri. and 
Mi .1 7s the A-component of Nir, 


(46) Mia = (Di, (Be, Wiad), Mir = (Whi, Me, Mia). 
Similarly, one finds 


(47) Nia = (Bi, [%e, Nia), Mia (M1, [Be, Na)). 


It follows from the definition of the invariants that the B,-component of 
Mi.. must be unchanged when one takes reductions of it in %,, and hence we have 


Mir S (Bi, (Be, Mad). 
In the same way one finds 
Wir = Mh, Me, Wiad, 


and the substitution of one relation into the other shows that the equality sign 
must hold. The relations (47) are proved in a similar manner. 

THEOREM 13. When %i,,1 and N,,2 are the reduction invariants of A, with 
respect to B, and Be, similarly, Ni. and No the corresponding increment invari- 
ants, then 


(48) %, = [Mia, Mel, Go = (Nia, Mie). 


This theorem is a consequence of Theorem 9. We mention also the 
lowing result: 
Tueorem 14. The two equivalent conditions 


Wha Yh, 
imply 
(49) (%L,, Be) 
From My; , follows 


a = (Mh, [Wo, (Ye, ¥,)}) _ So, 
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and hence the first condition (49) must be satisfied. Since %1,, = %, we must 
also have R\'', = %1, and hence 


mM = [RY , Ys] = (As, (B:, [Be, %f,])]. 


This implies the second relation (49). 
Let us now substitute the decompositions 


(50) Yq, = [Mi.1, Nail, B, = [Ri.1, Ns, ] 


in the original decomposition (9). We then obtain the further direct decom- 
positions 


(51) Mm = [Mia, Nia, Ae] = (Ra.2, Nia, BI. 
If one takes here the component of %1,; in 91,1, one finds that it is equal to 
Ri,1 by using the identities (46). In the same way %;,; is the 9j,;-component 
of Ri1, and R:,, and §,,,; must be their own reduction invariants with respect 
to each other. 

This last remark in connection with Theorem 14 gives us the following 
relations: 
(52) (Mir, (Nir, Bel) = (Ria, (Nia, Mel) = G 
and 
(53) M = (Rar, Nir, Ae] = (Ria, Nir, Bel. 


From these results we conclude: 

THEOREM 15. In the direct decompositions (51) for M the two quotients Ri, 
and R,, may be interchanged to give the two new direct decompositions (53) for M. 

Another interesting fact is the following: 

TueorEM 16. All null-quotients NY, (n = 0, 1, --- ) are invariant with 
respect to taking reductions in By, 
(54) Ry iRY} = RY, 
and hence all of them are factors of Ru. 

We shall prove (54) by induction. We observe first that 

nv Ss (Bi, [Be, NY), (Be, (Bi, Ni). 


We take the %,-component of both sides and apply (20). Since R’Z is a factor 
of %, we find 
(55) NV S (RINT, TY, Ri). 
On the other hand, we have 
(By, (Be, RV) S (RY, Be, (B,, NVI, 
and by again taking the %,-component we find 
RyIRNTZ} SRP, MTP, RY) = Nie. 


Hence both terms on the r.h. side of (55) are contained in N'} and the equality 
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sign must hold. Furthermore, since the theorem holds for 2{'y'’, the term 
(NYT”?, RY?) is contained in Ry {NY} < Ry{NYP} and (55) is proved. 

A consequence of Theorem 16 is that Ni: is a factor of Ry and also Nya 
factor of Miz. Since we also have 


4%, = (Ru, Nu] os [Mis, Niel, 
we find the further direct decompositions 
(56) Ru = Mie, (Mu, Ri2)], Mie = (Nu, (Mu, Rie)). 


THEOREM 17. Let M be a regular quotient for which there exist two direct 
decompositions 
M = (Ai, Ae] = [Bi, Be). 
Then for each quotient A, and A, (or B, and Bz) there exists a direct decomposition 
into three quotients 


(57) Mi, = [Nu, Nie, un, Rre)]. 


7. Proof for the main theorem. ‘There are still a number of interesting 
properties of the invariants R and M which we have not touched upon. Some 
of them are of considerable importance for the study of the properties of direct 
decompositions. For instance, we have seen that 2, is the maximal direct 
component of %, which may be interchanged with a direct component of B, 
and %i2 has a similar property with respect to B.. One may now ask for the 
maximal direct component of %, interchangeable both with a component of 


%, and B.. This would lead us to find factors of %, which are invariant both 
with respect to taking reductions in B, and B:. Such an investigation might 
be carried through along the same lines as here. The theory may also be ex- 
tended to the case where 2? is the direct union of an arbitrary number of quo- 
tients %; and B;. In this case one has the possibility of taking reductions with 
respect to one or more components in arbitrary orders and one obtains a great 
number of various reduction and increment invariants with corresponding direct 
decompositions for the %; and B;. We shall, however, not carry through the 
discussion of this theory. 

We shall conclude these investigations by applying them to the proof of the 
main theorem. 

THEOREM 18. Jn two different direct decompositions 


(58) M = [%h,---, W) = (Wh, ---, Bl 


of a quotient M into direct indecomposable quotients, both sides must contain the 
same number of quotients directly similar in pairs. 

We shall prove the theorem under the assumption that its r.h. (lh.) factors 
always have regular decompositions (45). This is certainly the case when 
has a finite length, but it is also true for more general structures, as we shall 
see in the next chapter. 
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Let us prove first that any %; may replace a suitable %; in (58) to give a new 
direct decomposition of IM. We write 


G, = (Me, --- , Ml, Bi = [B2, +++ , Bal, 


and we shall prove that %; may replace some %;. Since %, is directly inde- 
composable, its reduction and increment invariants with respect to any quotient 
are either U%, or ©. If the reduction invariant of %, with respect to B, in (58) 
is equal to %,, then %, and B, are interchangeable according to Theorem 15. 
Hence we may suppose that the reduction invariant of %, with respect to B; 
is ©. According to (56) we then have that the null-quotient of Yt, with respect 
to B, must be G, and this is easily seen to imply (%, B:) = G. 

From Theorem 13 it follows that in our case the reduction invariant of Y, 
with respect to B, is %,, while the corresponding increment invariant is G. 
The corresponding quotients for 8, with respect to %, are then found by 
Theorem 12, 


R= (B,, [B., %)), R= (Bi, 4), 


where ® is directly indecomposable because it is similar to %. This gives us 
the direct decomposition 


(59) B, ae (Bi, [B,, %,]), (B:, %)] = [Bo, tae B,|. 


If we now suppose we have proved that any quotient Y%; in a decomposition 
(58) may replace some %; when the r.h. decomposition contains less than s 
terms, we may apply this result to (59). When ® may replace B.2, we find 


Bi - (Bi, (Bi, %), Bs, Pe , B.), 
and by taking the union with B,, one obtains the new direct decomposition 
Mm = [B,, W,, Bs, ---, B.I, 


showing that %, may replace B.. 
The proof of Theorem 18 is now simple. We suppose that %, may replace B,, 


mM = [B,, o9 €% %, | = IY, B2, en B, |. 
This shows that %, and B, are directly similar. The quotient 
(60) we x + - Ih, Ye, 47", it ” Mh, Be, 44, e*e 


is then a lh. factor of IN, but it is also similar to the r-h. factor %,. Using 
induction, we may assume that the theorem is true for the quotient (60); hence 
we have r = s and the direct similarity of the other quotients M%; and YB; is 
easily obtained. 


Chapter 2. Conditions for the main theorem 


1. Existence of the invariants. The theory which we have derived in 
Chapter 1 depends entirely upon the existence of the decomposition (45) of a 
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quotient Y%, into its reduction and increment invariants. Even when no finite- 
ness condition of any kind is imposed upon the quotient YY it is possible to 
define reduction and increment invariants under very general conditions, but 
it seems difficult in this case to draw any conclusions about the existence of 
direct decompositions of the form (41). Hence we shall have to suppose in 
the following that either the ascending or the descending chain condition is 
satisfied in M. 

We shall use the same notation as in Chapter 1, considering a direct de- 
composition 


(1) mM = [%,, BY P| = [B,, Bo]. 

We may drop all subscripts of the occurring quotients, since we shall only con- 
sider the reductions of % in B,. In this simplified notation let us recall that 
the reduction invariant ® and the increment invariant N were defined by 

(2) R= RY = ROD = 

and 


(3) N= RNC = Jw = 


Let us make the preliminary observation that when # and M exist, we have 
n = m for the smallest indices n and m for which (1) and (2) hold. When ® 
exists, we have according to Theorem 10, Chapter 1, 


(4) %, = (RM, N™], 


and by taking the n-th reduction of both sides we obtain W°" = ® and hence 
m =n. Similarly it follows from 

(5) GF = (NM, RO™~) 

that nm 2 m. 

Let us now suppose that the ascending chain condition holds in Mt (or only 
in %). The increment invariant % must then exist and the relation (5) holds. 
Our problem is to determine the conditions for the existence of R. According 
to the preceding remark, the necessary and sufficient condition for this is 


(6) MOY = MOrtD 
or, as one easily sees, 


(7) YA, = [R™, NR. 


From Theorem 6, Chapter 1 it follows that we always have the similarity 
relation 


REMEDY MOM /(ME, MY), 


Since RC” is relatively prime to N according to (5), it is also relatively prime 
to MR, and hence we obtain 


(8) RHO SMO, 
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This proves that if % has no factors D and D’ < D such that D > D’, the reg- 
ular decomposition 

(9) %, = [M, Nl, (RW, 2 = G 

must exist. 


Similarly, if the descending chain condition holds in %,, the reduction in- 
variant ® must exist and we find corresponding to (8) 


(10) Mi /NOO Sj AL/N™. 


Hence in this case the regular decomposition (9) must exist provided %, has 
no left-hand factor D with a proper I|.h. factor D’ such that D =] D’. 

THEOREM 1. Let I be a quotient satisfying the ascending (descending) chain 
condition. Then the regular decomposition (9) will exist and hence the main 
theorem about direct decompositions will hold provided M has no right-hand (1.h.) 
factor D containing a proper r.h. (l.h.) factor D"’ such that D and D’ are similar. 

One may, however, improve considerably upon Theorem 1 by observing that 
the similarity relations (8) and (10) are of a very special nature. Let us suppose 
again that the ascending chain condition holds in %; and let us denote by B°™ 
the component of R°” in B,. We find then 


(11) [R, ¥.] = [B™, ¥2], 
where 
(12) (R™, Bs) = (BW, Bs) = G. 


The last relation (12) is obvious, because 8” is a factor of B, and the first 
follows from the fact that 2” is relatively prime to N® according to (5). 
The component of B°™ in % is RY and we find as before that 


(13) [Mo , A] = (BC, As], 
where we also have 
(14) (Ri), A.) = (B™, Ws) = G. 


To prove the last relation it is only necessary to observe that (%:, 8°”) is 
found to be the B,-component of 


(R™, NY) = G. 


We have formerly defined two quotients % and % to be directly similar when 
there exists a third quotient € relatively prime to both % and B such that 


(a, ©} = [B, G]. 


The notion of direct similarity in a Dedekind structure is usually not transitive. 
This leads us to introduce another special type of similarity: two quotients YF 
and 8 are said to be semi-directly similar when there exists a © to which they 
are both directly similar. 

The relations (11), (12) and (13), (14) show that R°(™ and MC"! are semi- 








is 
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directly similar. One may define left-hand semi-direct similarity in a corre- 
sponding manner, and one finds naturally that when ® exists, the two quotients 
%/N~ and %,/RN*Y are Lh. semi-directly similar. 

THEOREM 2. The main theorem holds in M when the ascending (descending) 
chain condition is satisfied and M contains no r.h. (lh.) factor D with a proper 
r.h. (Uh.) factor D' such that D and D' are semi-directly similar. 


2. Axiomatic conditions. The preceding results naturally lead us to the 
consideration of structures having the following special property: 

I. (r.h.). When A = A’ and A and A’ are r.h. semi-directly similar, then 
A= W’. 

We may say that a structure = in which this condition is satisfied is r.h. semi- 
directly regular. One may also express the condition for r.h. semi-direct regu- 
larity in the following manner: 

I'(r.h.). Let A 2 A’ be two elements in the structure =. If the relations 


[A, B| = [C, B), [C, D] = [A’, D} 
(A, B) = (C, B) = (C, D) = (A’, D) 


(15) 


hold, we can conclude A = A’. 
The dualistically corresponding condition for |.h. semi-direct regularity is 
obviously 
l’(.h.). If we have A = A’ and the relations 
[A, B] = [C, B] = [C, D] = [A’, D) 


(A, B) = (C, B), (C, D) = (A’, D), 


(16) 


we can conclude A = A’. 
In the last formulations the condition for semi-direct regularity reminds one 
strikingly of the following formulations of the two principal axioms.* 
DisrriputivE Axiom. If 


(17) |A, B| = [A’, Bl, (A, B) = (A’, B), 


then A = A’. 

DepEkinp Axiom. Jf A 2 A’ and the relations (17) hold, we can conclude 
A=A’, 

It is obvious that the distributive axiom implies semi-direct regularity, since 
im distributive structures direct similarity implies equality. More interesting 
is the fact that either r.h. or Lh. semi-direct regularity implies the Dedekind 
axiom. ‘To prove this we need only make B = D and C = A im (15) or (16). 

TueoreM 3. The distributive axiom implies semi-direct reqularity and semi- 
direct regularity implies the Dedekind axiom. 

I have formerly proved the main theorem on direct decompositions in Dede- 


* See Ore 1, Chap. 1. 
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kind structures, where the descending chain condition holds and where in ad- 
dition the following axiom is satisfied * 

II. (lh.). Let A, B, © and D be four quotients with the same denominator 
such that 


(a, B] = [C, D] = [C, B] = (A, D}. 
If then (A, B) = (CG, D) = G, we can conclude (B, ©) = (A, D) = G. 
We shall say that a structure = in which II (1.h.) is satisfied is (I.h.) weakly 
regular. The condition for weak regularity may also be stated: 
II’ (l.h.). Lf the relations 
M = [A, B] = [C, D] = [C, BI, 
T = (A, B) = (C, D), (C, B) 2 T, (A, D) 2 T 
are satisfied, we can conclude T = (A, B) = (C, D) = (C, B) = (A, D). 
Correspondingly we have 
Il’ (r.h.). If 
T = (A, B) = (C, D) = (C, B) = (A, D), 
M = |A, B} = [C, D], [C, B] <= M, [A, D] s M, 
we can conclude M = |A, B| = [C, D| = [C, B] = [A, D]. 
TueoreM 4. Right-hand (l.h.) semi-direct regularity implies r.h. (l.h.) weak 
regularity. 
Let us suppose that the conditions of II’ (r.h.) are satisfied. The relations 


[A, D| - [(C, [A, Dj), Dj, [(C, [A, D)), B| _ [(A, [B, (C, [D, A)}))), B| 


show that the quotients A/T and A’/T, where A’ = (A, [B, (C, [D, A])]) 
are semi-directly similar. Hence if = is semi-directly regular we have A = A’, 
so that 


M = [B, (C, [D, A])] 


and consequently M = [B,C|. The relation M = [A, D] is proved in a similar 


manner. 

The decomposition theory of Chapter 1 is valid when the descending (ascend- 
ing) chain condition holds and & is Lh. (r.h.) semi-directly regular. On the 
other hand, the main theorem on direct decompositions has been proved for 
Dedekind structures where the descending (ascending) chain condition holds 
and which are |.h. (r.h.) weakly regular. I have not been able to carry through 
the general decomposition theory under these weaker conditions and it seems 
possible that the existence of the decompositions of Chapter 1 requires a some- 
what stronger axiom than the main theorem. It seems an interesting problem 


to be considered. 
Yate UNIVERSITY 


* Ore II, Chap. 2. 





SEMI-CONTINUITY OF INTEGRALS IN THE CALCULUS 
OF VARIATIONS 


By E. J. McSHane 


Introduction. In various studies of existence theorems in the calculus of 
variations much use has been made of the property of lower semi-continuity 
of the integrals involved. For each separate type of problem there has been a 
separate proof of semi-continuity. The principal object of this paper is to 
prove one theorem on semi-continuity of integrals which has generality enough 
to cover as special cases the simple problem in parametric form and in ordinary 
form, the Lagrange problem in parametric form and in ordinary form, and the 
parametric problem associated with a problem in ordinary form.’ As a by- 
product we are able to state existence theorems for certain problems not covered 
by the existence theorems in the literature. 

The purpose of §2 is merely to extend to our analytical situation the every- 
day theorems in invariance under change of parameter. In §3 the principal 
semi-continuity theorem is proved. The notation is that of the parametric 
problem, but the hypotheses are so weak as to permit us in §4 to restate it in 
ordinary form. In §5 it is specialized to cover parametric problems and La- 
grange problems in parametric form. In §6 it is specialized to cover Lagrange 
problems in ordinary form. The next section gives three examples to indicate 
that the hypotheses in §6 do not admit of much weakening. One of these ex- 
amples (Example IIL) has an interest quite apart from semi-continuity theory, 
for in it we exhibit a Lagrange problem in ordinary form for which y = 0 is an 
extremal imbedded in a field of extremals, furnishing a weak relative minimum 
for the integral in the class of admissible curves, satisfying the Legendre and 
Weierstrass conditions along y = 0 (but not, of course, in strengthened form) 
and yet y = 0 does not afford a strong relative minimum for the integral. In §8 
we deduce from the general theorem a theorem on the semi-continuity of the 
parametric integral f f(a, y, 2’, y’)dt associated with a problem f g(x, y, y’)dx in 
ordinary form. The specializations in §§$5, 7, and 8 yield Theorems 5.1, 6.1, 
6.2, 8.1, 8.2, which, to the best of my knowledge, are stronger than any in the 
literature. 

If to the hypotheses of the semi-continuity theorem we add the hypothesis 
that the integrand is positive, we can easily obtain an existence theorem. In §9 
we apply this existence theorem to three special cases. The first is that of 
finding the path of a beam of light through a space in which pieces of glass are 
suspended. The second is the Zermelo navigation problem? Here the inte- 
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grand f(x, x’) is not defined for all x’, and it is useful that our semi-continuity 
theorem has hypotheses so weak as to permit us to set f(x, 2’) = + where it 
is not already defined. The third problem is the special case of the Lagrange 
problem in parametric form in which the side equations are linear. 

All of our semi-continuity theorems state merely that, under appropriate 
conditions, the integral involved is lower semi-continuous on a class of curves 
of uniformly bounded lengths. This restriction is really not so important, 
since in establishing our existence theorem we need not only a semi-continuity 
theorem but also a theorem establishing the convergence of a minimizing se- 
quence of curves, and in order to establish this convergence it seems vital to 
have a uniform bound on the lengths of the curves in question. However, 
from the semi-continuity theorems here obtained it would be very easy to obtain 
conditions guaranteeing semi-continuity on the class of all admissible curves, 
without a uniform bound on the lengths.* 


$1. Notation and definitions. The letters x, y, r, 7 will be used to denote 
vectors; y will stand for (y', y*, --- , y®, » for (m', n°, --- , 9%), while x, r will 
stand for (2°, --- , 2%), (7°, +--+, 7%), respectively. If a function f(z, r) has a 
partial derivative with respect to r‘, that derivative will be denoted by fi (x, 7). 
Likewise, if g(u, y, n) has a partial derivative with respect to n', that derivative 
will be denoted by gy (u, y, n). The lengths of the vectors x, y, 7, » will be de- 
noted by x|,\/y¥\, > 7|,| 7) respectively. We use a modification of the tensor 
summation; if a Greek-letter affix is repeated, the expression is to be summed over 
all values of that affix. Thus 


rida) (Xn ’ rn) = rrJcoy(Xn; T.) + ae + r? fiay(Xn; Fad, 


summed on a but not on n. 


Functions will be permitted to assume the value of + «, but not — <. 
For the symbol ~ we use the rules of calculation x + x« = *x+a=a+x=x 
for all finite numbers a; ax = «x if a>0,0 x = 0. These rules cover all 


cases which will occur. The notion of lower semi-continuity extends, of course, 
to such functions (if we take « > a for all finite a); a function f(x) is lower 
semi-continuous (hereafter abbreviated to l.s.c.) on a set EF if for every xo in E 
and every number h < f(xo) there is a neighborhood U’ of xo such that f(x) > h, 
forre EU. 

The integrals used will be Lebesgue integrals, with one minor modification; 
if a function ¢(2) is measurable but not summable over a set E, and there is a 


3 In terms of the notation about to be introduced, the additional requirement is that 
for each (xo, 70) in R there shall be a linear function b,r* and an e > 0, such that f(z, r) + 
bar® 2e|r|forallanearxz. This condition can be deduced from various other conditions 
in the special problems considered. For instance, in the parametric problem f/f(z, z’)dt = 
min, if the integrand has partial derivatives f(:)(x, 7) continuous for | r | ¥ 0, it is enough 
to add the assumption that G(z, r, *) does not vanish identically for any z. 
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summable function g(x) such that ¢(x) = g(x), we shall define 


[ ocarax = 0, 


The letters a.c. will be used in place of the words “absolutely continuous”. 
Given a function, for example x(t), the symbol #(¢) shall denote the derivative 
x’(t), where x’(t) is defined and finite, and shall have the value 0 elsewhere. 


§2. Since our goal is a semi-continuity theorem which, among other things, 
covers the Lagrange problem, it is appropriate for us to suppose that at each 
point of our space there is a restriction on the set of directions which may be 
taken by the curves which we wish to admit. Correspondingly, it is desirable 
that the conditions imposed on our integrand shall refer only to these allowable 
directions. Guided by these considerations and our needs in the following pages, 
we set down the following conditions on our integrand f(z, r): 


(2.la) f(x, r) is defined (finite or + ~) and Ls.c. on a set R in (2, r)-space; 
(2.1b) if (2, r) ¢ R, then (2, tr) « R, and f(z, tr) = tf(x, r) for all t = 0; 

(2.1c) Ris dense in itself, and the set of x such that (x, 0) € R is closed; 

(2.1ld) if (wo, ro) €R and u < f(xo, ro), then there exists a linear function 
aar* with the. properties (i) aar§ > u and (ii) for every « > 0 there is a neighbor- 
hood U of xo such that f(x, r) 2 aar* — € | r | whenever x ¢ U and (2, r) € R. 


We can, however, state another set of conditions, slightly more restrictive, 
but satisfied by nearly all the integrands we shall discuss: 


(2.2a) f(x,7r) is defined (finite or + ©) and L.s.c. on the closure R of a set R in 
(x, r)-space; 

(2.2b) if (x, r) e R, then (2, tr) « R and f(z, tr) = tf(2, r) for all t 2 0; 

(2.2c) Ris dense in itself, and the set of x such that (7, 0) « R is closed; 

(2.2d) if (to, ro) «Rand u < f(x, ro), there exists a linear function agr* such 
that (i) uw < aarf and (ii) aar* S f(xo, r) for all (ro, r) € R. 


Here it is obvious that (2.1, a, b, c) follow from (2.2). To obtain (2.1d), 
we note that if the linear function a.r* of (2.2d) does not satisfy (2.1d), then 
for some e > 0 there is a sequence (z,, 7,) of elements of R with x, — xo such 
that f(tn, Tn) < dar% — €|r,|. By (2.2b) we may suppose |r,| = 1. From 
the (%,, Tn) we select a subsequence (zx,, rp) such that r, tends to a limit 7. 
Then (x0, *) ¢ R, and by (2.2a) we find f(x, #) S lim inf f(z, , rp) S aa?* — €|7| 
< a7, contradicting (2.2d). 

Of the conditions (2.2), (a) and (b) are obvious weakenings of standard hy- 
potheses. Also, the requirement that R be dense in itself is almost trivial, for 
an isolated point of R could not lie on any admissible‘ curve except a degenerate 
curve consisting of one point, and so may be disregarded. With (2.2d) it is 


‘ This term will be defined in the next paragraph. 
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different; this is our “regularity condition”. If, in particular, it happens that 
R is closed, that f(x, r) has partial derivatives fi) (x, r) with respect to the r* 
and that 


S(a, r, 7) = f(a, 7) — Pfa(z, 7) 2 0 


whenever (x, 7) and (x, 7) are in R and |r| # 0, then (2.2d) is satisfied if we 
take a; = fi (Xo, To). Again, if R consists of all sets (2, r) with z in a set A 
and r arbitrary, then (2.2d) is satisfied if and only if f(z, r) is a convex function 
of r for each fixed x. 

A representation x = z(t), a < t S b, will be called admissible if the functions 
z(t) are absolutely continuous and (z(t), <(t)) « R for almost all t. We shall 
now prove that (under conditions 2.1) if z = 2(t),a < t S b, is any a.c. repre- 
sentation of a curve C and x = &s),0 S s S L, is the representation of C with 
arc-length as parameter, then z = x(t) is admissible if and only if x = &(s) is 
admissible. Since 


B(s) = (0) + / * B(s)ds, 
0 


and s(t) is a.c., we find 


ri(t) = &(s(t)) = &(0) + i] Ei(s(t)) 8(t)dt, 
so that 
(2.3) z(t) = &(s) a(t) 


for almost all ¢. Suppose now that x = x(t) is admissible. Let 7 be the set 
(of measure 0) on which (2.3) fails or (x(¢), #(¢)) is not in R, and let T, be the 
set on which s(t) = 0. The measure of the set s(t), te (Zo + 7) is® 


i a(t)dt = 0. 
Tot+T, 


For all other s we have &(s) = #(t) + &(t) with (2, %) eR and &> 0, so 
by (2.1b) we see that (é(s), &(s)) « R, and x = &s) is admissible. On the 
other hand, suppose that z = £(s) is admissible. Then (€(s), &(s)) ¢ R for all 
values of s except those in a set So of measure 0. Let 7 be the set such that 
s(t) «So for te Ty. For almost all ¢ not in T> we have #(t) = &s(t))8(), so 
(x(t), 2(t)) « R by (2.1b). For almost all ¢ in To we have® s(t) = 0 and z(t) = 
£(s(t))-0 = 0. Since (x(r), #(r)) is in RK for almost all 7, so is (2(7), 0). We can 
choose a sequence of 7 tending to ¢; then z2(r) —> 2(t), and by (2.1c) the set 
(x(t), 0) = (x(t), <(t)) isin R. Hence for almost all ¢ the set (x(0), £() is in R 
and z = x(t) is an admissible representation. 


5’ Hobson, Theory of Functions of a Real Variable, vol. 1, pp. 606 and 342. 

6 Sy is contained in a G;-set S of measure 0. The set 7' on which s(t) « S is also a Gs, 
hence measurable. By footnote 5,0 = mS = J,8(t)dt, so 8(t) = 0 almost everywhere in T 
and A fortiori almost everywhere in T». 
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It follows at once that if one representation of a curve C is admissible, so 
also are all other a.c. representations. Hence in this case we are justified in 
saying that C is an admissible curve. We shall understand that all representa- 
tions of curves hereafter mentioned are a.c., so that if C is an admissible curve 
x = x(t), the representation x = x(t) is admissible. 

We can now prove 

TuHeorEeM 2.1. Jf x = z(t),a St Sb, and x = #r),a Sr S 8B, are two 
representations of the same admissible curve C, then the integrals 


b 7 
[FH a and | f(%, Z)dr 


are both.defined (finite or + ~) and are equal. 

Proof. As in the preceding proof, the set (x(t), 0) is admissible for all ¢. 
Hence to each xo on C there is, by 2.1d, a linear function a,r* and a neighborhood 
U such that f(z, r) 2 aar* — |r| if (2,r)eRandzeU. A finite number of these 
neighborhoods U cover the set of points r(t),a St S b. Denoting by N — 1 
the greatest of the corresponding numbers (vector-lengths) | a;|, we have 
f(x, r) 2 —N |r| for (2, r) « R and z in a neighborhood of the point-set x(2), 
astszsb. Forn > N we define f,(z, r) = min (f(z, r), n!r!), (2, r) eR. 
Then | f,(z, r)| S n|r{if x is on C and (2, r)eR. The function f,(2, r) is 
Ls.c. on R, being the minimum of two l.s.c. functions; hence f,(x(t), #(é) is 
measurable.?. The same is true of f,(é(s), &(s)), where x = £(s) is the represen- 
tation of C with are-length as parameter. Since the functions n | #(é) | and 
n | &(s) | are summable, so are f,(x, #) and f,(, &), and by (2.1b) and (2.3) 


[ Iu(E(s), E(s))ds = | SulE(s(t)), E(s())) s(t)at 








b 
= / fn(a(t), #(t))dt. 


Now let n—» ©. Then for all s and all ¢ the functions f,(2(t), #(0) and f,(&(s), 
£(s)) increase monotonically and tend respectively to f(x(é), 2(0) and f(e(s), 
&(s)). Hence the two integrals 


r f(e(s), E(s))ds and [ £20, a(t))dt 


both exist (finite or infinite) and are equal. By repeating the argument with # 
in place of x, the theorem is established. 

We are now justified in denoting the common value of the integrals in The- 
orem 2.1 by the symbol ‘F(C). 


7 Carathéodory, Vorlesungen tiber Reelle Funktionen, p.377. The theorem does not apply 
at once, but fx(z,r) + N | r | is non-negative and 1.s.c. for (z, r) « R and x in a neighborhood 
of the point-set composing C, so /,(x, r) + N | r | can be extended to be L.s.c. on all of space. 
This implies the measurability of f,(2,z) + N | z |, hence of f,(2, 2). 
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§3. These preliminaries being disposed of, we proceed to the proof of our 
principal theorem on semi-continuity. 

THEOREM 3.1. If hypotheses (2.1) are satisfied and M is any positive number, 
then {¥(C) is l.s.c. on the class of all admissible curves of length = M. 

We prove this theorem in several steps. 

Lemma 3.2. To establish Theorem 3.1, it is sufficient to show that for every 
M > 0 the integral f f(x, z)dt is Ls.c. on the class of all admissible functions® 
x = x(t),0 St S 1, such that | x(t) | S Mand | x(t) | S M. 

Proof. Suppose Theorem 3.1 false. We can then find a number M, and a 
sequence {C,} of admissible curves tending to an admissible limit curve Co, 
having lengths < M,, and satisfying the inequality lim inf {F(C,) < F(C@o). 
For each curve C, we choose as parameter t = s/&(C,), where s is the arc- 
length and £(n) is the total length of C,; then C,, is represented by equations 
x = 7,(t),0 < t S 1, where z,(¢) satisfies a Lipschitz condition of constant M, 
and | z,(t)| S$ M,. From the C, we first choose a subsequence |C,} such that 
lim F(C,) exists and is equal to lim inf ‘F(C,,), and then from the {C.} we choose 
a subsequence {Cs} such that 2z3(t) converges uniformly to a limit function 
xo(t); this last is possible by Ascoli’s theorem. Since xg(t) = xo(t), the curves 
C3 tend to the curve represented by x = 7o(t). But lim Cs = Co; therefore 
x = x(t) is a representation of Cy. Clearly x(t) also satisfies a Lipschitz 
condition of constant M,. Since x4(t) — xo(t), the numbers | x4(t) | are bounded, 
say S M;. Setting M = max (M,, M2), we have | x(t)! < M, | #a(t) | S M, 
and by Theorem 2.1 


1 1 
lim int [ f (xg, %) dt < / f (x9, wo) dt. 
0 0 


Hence if Theorem 3.1 is false, there is an M such that f f(z, @)dt is not Ls.c. 
on the class of admissible functions z(t) with |2| S M and || S M, and 
our lemma is established. 

The use of this lemma is that it enables us to consider the representations 
as fixed; and having no further need for invariance under change of parameters, 
we may use auxiliary functions which do not satisfy (2.1b). 

Lema 3.3. If hypotheses (2.1) are satisfied, there exists a function F(x, r), 
defined and L.s.c. for all x and all r, satisfying the equation F(x, r) = f(x, r) for 
(x, r) € R and such that if (ro, ro) € R and u < F(xo, ro), there exists a linear 
function a,r* for which agr6 > uand a,r* = F (ao, 7) for all r. 

Proof. Let us first set g(z, r) = f(z, r) for (x, r) € R, and g(x, r) = @ else- 
where. Now we define F(z, r) to be the lower limit function of g; that is, the 
smaller of g(x, r) and lim inf g(Z, 7) as (#, 7) — (x, r). Then F is Ls.c.® for all 
(z,r). If (xo, ro) € R, then for every h < f(xo, ro) there exists a neighborhood U of 
(ro, ro) such that f(z, 7) > hfor (#,7) «RU. Therefore, g(%,7) > hfor (%, 7) « U, 


* The distance between two functions z(t) and 2,(t), 0 S ¢ < 1, is here understood to be 
max | z(t) — z(t) |. This is a special case of a definition which will be given in §4. 
* Carathéodory, Vorlesungen tiber Reelle Funktionen, p. 137. 
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and lim inf g(#, 7) = f(r0, ro) = g(%o, ro). By the definition of F we then have 
F (xo, ro) = f(x0, ro). Finally, let aar* be the linear function of (2.1d). From 
(2.1d) we know that for every « > 0 there is a neighborhood l’ of x» such that 


f(z, r) — agr* + €,|r| 2 0 for (x, r) € R, ze. 


Then g(x, r) — dar? + € r| 2 0 for rel and all r; and so the lower limit 
function of g — agr* + €|r_, which is F(z, r) — agr* + €|7r!|, is also non- 
negative in l’. In particular, F(x, r) 2 aar* — € |r|. Since ¢ is arbitrary, 


we have F(x, r) = agar". 

Lemma 3.4. For every M > 0 there exists a sequence of functions g,(x, r), de- 
fined and continuous in (x, r) for |x| S M, r| S M, conver in r for fired x, 
and such that 


(i) g(r,r) < gel(x,r) < 
for|x\| = M,\r| Ss M, and 
(ii) lim g,(z, r) = F(z, r), (z7,r)¢eR,|z| 3 M, ris M. 


Proof. On the bounded closed set |x| S M, |r| Ss M the Ls.c. function F 
attains its lower bound. Since fF # — ~, this lower bound is not — ~. Conse- 
quently," there exists a sequence {¢,(2, 7)} of functions continuous for x, S M, 
|r| S M, such that ¢,(r7, r) < g(x, 7) < ++: — F(a, r). For each x let 
g,(x, r) be the “convex envelope” of ¢,(x, 7); that is, the least upper bound of 
all convex functions ¥(r) S @,(2, 7). Then g,(x, r) is convex in r. From the 
definition it is easily seen that if two functions differ by less than e, then so do 
their convex envelopes. As # -> x, the function ¢,(%, 7) tends to ¢,(2, r) uni- 
formly in 7, so g,(%, r) tends to g,(2, 7) uniformly in r. For each fixed x, g,(2, r) 
is convex in r, hence continuous in r. Thus g,(2, r) is continuous in r and is 
continuous in x uniformly with respect to 7, so it is continuous in both variables. 

From ¢ < de < --- it follows at once that g; < ge < 

Finally, suppose (29, ro) « R, | 2 | S M,!r| Ss M, and let u be any number 
less than F(x», ro). By Lemma 3.3 there is a linear function ar such that 


dar, > wand ar S F(x, r) for all r. If we put @o = 3(u — aar$) < 0, then 
ao + dar > wand F(x, r) > ao + agr* for all r. The continuous functions 
o,(%o, 7) — ao — aar* tend on the bounded closed set |r S M to the positive 


limit F(x, r) — ay — aar*; hénce for all large n we have @,(2%0, 7) > @o + Gar 
for all r with |r| S M. Then ao + ag is a convex function which does not 
exceed @,(2%o, 7), so for the least upper bound g, of such functions we have 
gn(%o, T) 2 ao + ag. In particular, g,(%o, To) 2 ao + dar} > u; so lim 
d.(to, Yo) 2 u. This being true for all u < F (x9, ro), it follows that lim g,(2o, 
ro) = F(x, ro). On the other hand, g,(2%0, ro) S F(x, ro) for all n, so lim g,(2o, 
ro) S F(x, ro). Therefore g,(%o, ro) tends to F(xo, ro). This establishes the 
lemma. 


1° Carathéodory, Vorlesungen tiber Reelle Funktionen, p. 402. 
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Lema 3.5. In Lemma 3.4 we can further assume that the first partial deriva- 
tives of the g, with respect to the r‘ exist and are continuous for |x| < M,|r| < M. 

Proof. In the statement of Lemma 3.4 we replace M by M + 1 and denote 
the functions thus obtained by f,(z, r). If 0 < h < 1/(q + 1), the integral 


ro=h r%+h 
g(z, r) = / of f(x, +++, 2%, O, --- , t*)dt? ... dia 





(Qh)! J op Gh 
(3.1) : : A 
7 oom |. ii [ise eee , 7? + t2)dt® ... dt@ 


is defined for | 2! < M,|r Ss M. Since f, is continuous for |x| < M + 1, 
r| < M +11, the integral is a continuous function of x andr. It is a convex 
i ’ 
function, as we see by integrating the inequality 


{f.(z,7 +0) + flax, re + t)}/2 =f.(a, 3(rm + re) + 8. 


It has continuous first partial derivatives with respect to the r*. 
Since fru < fn < fnsi, by Lemma 3.4, there is a positive number e, such that 
fasi(z, 7) — f(x, r) > « and f,(z, r) — fru(a,r) > «, for|a2| S M+ 1and 
r =< M+41. Also, f, is continuous, so there exists a 6 > 0 such that 
f(z, 7) —f.(a,r)) < €.if |? — r| < 6. Choosing h = 6, we have from the 
definition (3.1) 


$nalz, 1) < fa(z, 7) — €n < gn’(t, 7) < falz, 7) + €n < fusi(x, 7). 


Thus f; < gy’’ < fs < gi?’ < +++. Choosing the second, fourth, --- terms of 
this sequence and renaming them g;, ge, --* , we have g; < gz < +--+, and also 
lim g,(z,r) = lim f,(z, r) = F(z, r) if (z,r) eRand|z| < M,|r| s M. 

Lemma 3.6. For each of the functions g,(x, r) of Lemma 3.5 the integral 
S gn(xz, Z)dt is L.s.c. on the class of all functions x(t), 0 < t < 1, satisfying the 
Lipschitz condition of constant M and the condition | x(t) | = M. 

Proof. Suppose that the functions 7,(t) satisfy the above conditions and 
converge uniformly to zo(t). Then z(t) also satisfies these conditions. Let 
g(x, r) be any one of the functions g,(z, r), and let gq(az, r) be the partial de- 
rivative of g with respect to r’. Since g is uniformly continuous for |z| < M 
and r <= M, the difference g(z,(t), r) — g(xo(t), r) tends to zero uniformly 
for0 Sts land|r, = M. Therefore 


1 
(3.2) lim inf 


—_ 


1 
g(r. , z,)dt = lim inf / g(xo, az, dt. 
0 


For fixed z, the linear function osculating g(x, r) at ro Is 
g(x, To) + (r* — 75) Gay (4, To). 


Since g is convex in 7, it is not less than this linear function: 


q(x, 7) 2 g(a, ro) + (r* — ro )gca) (4, To). 
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Therefore 


1 1 
lim inf / g(xo, &,)dt = / g(xo, ¥o)at 
0 0 


(3.3) P 
oe lim inf | (“* - z$) 9(a)(2o, X)dt. 
0 


From x, — 9 we find that for0 s<h sk <1 


lim [ (#* — #))dt = lim [xi (k) — xi (k)] — [xi (h) — xj(h)] = 0. 


no nso 


Also, | #, — #5 | S$ 2M. Hence" for each i we have 


1 
lim / (z° - #3) 9 (®o, Xo)dt = 0. 
n—2 J0 

Therefore the sum on the right in (3.3) tends to zero, and by (3.2) and (3.3) we 
have 


a 1 
lim inf | g(Xn, &,)dt = i g(x, x)at. 
0 0 


This establishes the lemma. 

We now take up the proof of Theorem 3.1. Let [M] be the class of admissible 
functions x(t), 0 S ¢ S 1 such that | 2(t)| S M and | #(t)| Ss M. With the 
gn of Lemma 3.5 we have for each x(t) « M 


1 1 
/ g(x, é)dt < / Gnsa(x, £)dt. 
0 0 


For almost all ¢ the set (x(t), #(2)) « R, and for all such ¢t we know by Lemma 3.5 
that g,(x(0, #(2) increases with n and tends to F(x(t), #(0). So 


Son(x, #)dt > f F(x, #)dt, 


and on [M] the functional fF (x, #)dt is the limit of an increasing sequence 
of functionals fg,(x, #)dt. By Lemma 3.6, these last are Ls.c., so [F (x, #)dt 
is itself L.s.c. on [M]. But on all admissible curves, and in particular on [.W/], we 
have fF(x, #)dt = ff(x, #)dt, by Lemma 3.3, so that ff(x, #dt is Ls.c. on 
[M]. By Lemma 3.2, this implies that ‘1(C) is Ls.c. 


$4. Theorem 3.1 appears to apply only to integrals in parametric form. 
But, as a matter of fact, the hypotheses are weak enough so that we can state 
a theorem exactly equivalent to Theorem 3.1 in which the notation is that of 
ordinary problems. We wish then to investigate the semi-continuity of inte- 
grals f g(u, y, y’) du on classes of functions y = y(w),a Su Sb. But we 
cannot even define semi-continuity until we have a notion of limit defined. 
Accordingly, ify = y(u),a Su Sb, andy = y(t), @ S uy S bh), are continuous 


" Hobson, Theory of Functions of a Real Variable, vol. 1, $279. 
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functions, we define the distance dist (y, y:) as follows. First we extend the range 
of y to the whole u-axis by setting y(u) = y(a) for u < a and y(u) = y(b) for 
u > b, and we extend the range of y,; similarly. We then define the dist (y, y;) 
to be the greatest of the three numbers max = y(u) — y:(u) |, | a — a |, 

b — b, |. The distance thus defined actually does define a metric for con- 
tinuous functions, but what we are interested in showing is that if the curves C, 
are defined by the continuous functions y = y,(u),a, S u S b,,(n = 0,1, --- ), 
and dist (yo, yn) — 0, then lim C, = Co. Let us map the interval (ao, bo) 
on (a,, 6,) by a linear transformation u,(u); then the maximum of | u,(u) — u | 
occurs at one of the ends of (ao, bo), and is either | a, — ao | or |b, — bo|. In 
any case max | u,(u) — u| S dist (y,, yo) ~0. Now write 


dist (C,, Co) S max y,(u,(u)) — yo(u) |, ao Su S bo 


< max y,(u,(u)) — yo(un(u)) | + max | yo(un(u)) — yo(u) |. 


By definition, the first term on the right does not exceed dist (y,, yo) — 0. 
Since yo(u) is uniformly continuous and u,(u) = u, the second term also tends 
to zero, and so C, — Co. 

THEOREM 4.1. Let g(u, y, n) satisfy the following conditions: 

(4.la) g(u, y, n) ts defined (finite or +) and L.s.c. on a set Y in (u, y, n)-space; 
(4.1b) the set of (u, y) such that (u, y, n) € Y for some n is closed; 

(4.le) af (uo, yo, no) € Y and h < g(uo, yo, no), there exists a linear function 
do + aar* such that (i) ay + dan* > hand (ii) for every « > 0 there is a neighborhood 
l of (uo, yo) such that g(u, y, n) 2 ao + dan®* — (14+ ||)! of (uy y, n) € Y 
and (u,y) € U. 

Then for every M > 0 the integral S(y) = J g(u, y, y)du is l.s.c. on the class of all 
functions y = y(u),a S u S b, having total variation < M and such that (u, y(u), 
y(u)) € Y for almost all u. 

It is not difficult to see that this theorem implies Theorem 3.1. If conditions 
(2.1) are satisfied, we introduce the new notation (y', --- , y@*!) = (2°, «++, x), 
(ni, «++, nt) = (r®, --+, 7%), and define g(u, y, n) = f(z, r). The set Y will 
consist of all (u, y, 7) for which (y, ») = (2, r) isin R. Then (4.1a) and (4.1c) 
follow from (2.1a) and (2.1d), respectively. By (2.1b), if (u, y, ») « Y for some 
n, so is (u, y, 0), so that (2.1lc) implies (4.1b). Now let C,: x = 2,(0), 
a, &t Ss b,, be a sequence of admissible curves of length < M tending to Co. 
For each C; (j = 0,1, --- ) we can choose an a.c. representation r = 2;(u), 
0 <u <1,sothat z, 325. By Theorem 4.1 we have 


1 1 
lim inf / f(z, , %,)du = lim inf [ g(U, Yn, YnJdu 
0 i) 


1 
I q(uo, Yo, Yoldu 
0 
1 
= / f(x, do)du, 
0 


\\V 


This establishes Theorem 3.1. 
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On the other hand, Theorem 3.1 implies Theorem 4.1. Suppose conditions 
(4.1) verified. We define 


flu, y, §, n) = tg(u, Y; n/&) for => 0, 
f(u, Y; 0, 0) = 0, 


and we define RF in the following way: if (u, y, ») « Y, then (u, y, t, tn) « R for all 
t 2 0. We now introduce the notation (2°, 2', ---,2%) = (u,y',---,y), 
(n°, rity --+,rt) = (& n',---,%). Then condition (2.1b) is satisfied. The 
set of x for which (x, 0) « R is the same as the set (u, y) for which (u, y, ») « Y 
for some , and is closed by (4.1b), so that the second part of (2.1c) holds. 
The first part of (2.1c) follows from the definition of R, since R consists of the 
points (u, y, t, tn), each of which belongs to a line-segment of points of R. 

Let (x0, 70) = (Uo, Yo, t, tn), t 2 0, be a point of R, and let h be any number 
less than f(x0, ro). Then if t > 0, 


ht < t"f(xo0, ro) = f(uo, yo, 1, n) = g(uo, Yo, 20). 


By (4.1c) there is a function @) + dan* such that (i) ao + den* > At, and (ii) 
for « > 0 there exists a neighborhood U’ of (uo, yo) such that 


g(u, y, n) 2 ao + aan™ — e(1 + | 0 |)! 


for (u, y, n) e Y, (uyy) el. If we write f for g and use the homogeneity of f, 
these become 


(i) ar* = aot + ag(tn®) > h, 
(ii) f(x, r) = flu, y, t, tn) = aot + ag(tn?) — (C+ | ty 2)! 
= aI* — e|r|, 


as required in (2.1d). If, however, ¢ = 0, we notice that the discussion of (ii) 
requires no alteration, while (i) reduces to aar¢ = 0 = f(x, ro) > hk. Therefore 
(2.1d) holds. 

To establish the fact that f(x, r) is Ls.c. on R, we first consider a set (29, ro) 
with ro = 0. If N — Lis the length of the vector a; of (2.1d), and if we there 
take « = 1, we find that for every 6 > 0 we have 


f(z,r) 2ag* — |r| 2-—N\ir| > —6 = f(xo, ro) — 4, 


provided that (7, r) e R, re UU, |r| < 6/N. So f(a, r) is Ls.c. at (ro, ro). If 
[ro | # 0, then rj) > 0, for since (x9, ro) = (Uo, Yo, t, tno), the only way to have 


r} = Ois to have t = 0. In this case, if we choose (2, 7) = (ta, Yas Ens Me) 


tending to (ao, ro), we have &, > 0 for almost all n, and 
lim inf f(7,, rn) = lim inf & g(a, Ya, Mn/ En) 
>¢ 


Eog(uo, Yo, No fo) = S (xo, ro). 


So in this case also f(x, 7) is Ls.e. at (vo, ro), and (2.1a) is satisfied. 
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Suppose now that y,(u), a. S u S b,, is a sequence of admissible’ functions 
tending to yo(u), ao S u S bo. If we define C; to be the curve 2° = u, 


zi = y‘(u), a; S u S bj, then C, — Co, and by Theorem 3.1 


bn bn 
lim inf | g(U, Yn, YnJdu = lim inf i S(an, &n)du 


n a 
bo 
i S (x0, %)du 
ao 
° 


b 
| g(u, Yo, Yoddu. 


IV 


This establishes Theorem 4.1. 


§5. We now begin to deduce from Theorems 3.1 and 4.1 corollaries of more 
recognizable appearance. An immediate corollary of Theorem 3.1 is 

TuHeoreM 5.1. Jf f(x, r) is defined and L.s.c. for all x in a closed set A and all r, 
and f(x, tr) = tf(z, r) for x € A andt = 0, and f(z, r) is a convex function of r 
for each x ¢ A, then for every M > 0 the integral ‘F(C) is l.s.c. on the class of all 
curves lying in A and having length <= M. 

A second corollary is 

THEorEM 5.2. Let the functions f(x, r) and $*(x, r) (k = 1, 2, +--+, m) be 
defined and continuous for all x in a perfect set A and all r; let the partial derivatives 
of f and ¢* with respect to the r‘ exist and be continuous for x « A and |r| > 0; 
let f(x, tr) = tf(x, r) and $*(2, tr) = to*(x, r) for x « A and t = 0; for each x « A, 
let there exist constants c,, +++ , Cm such that the function 


F(z, r) = f(z, r) + cab*(z, 1) 
satisfies the inequality 
&,(z, r, 7) = F(a, 7) — F(x, r) 20 


whenever o*(x,r) = o*(x,7) = Oand|r| >0. Then for every M > 0 the integral 
S(C) = Sf dt is Ls.c. on the class of all curves x = x(t) lying in A, having length 
< M, and satisfying the equations o*(x(t), <(t)) = 0 for almost all t. 
Let R be the class of all sets (x, r) such that x «€ A and ¢*(z, r) = 0, (k = 1, 
- ,m); this set is closed. Conditions (2.2a, b, c) are clearly satisfied. If 
(to, To.) € Rand | ro! # 0, we set ag = Fra)(to, fo). It is a well-known conse- 
quence of the homogeneity of F that 


(5.3) ro F ta)(Xo, ro) = F (2x0 ’ ro), 
while by hypothesis for all r such that (27> , 7) « R we have 


a,J* = TF (ay)(Zo , fo) = F (zo, r) ~~ Gp(2o, ro, T) < F(x ’ r). 


12 A function y(u) is admissible if (u, y, y) « Y for almost all u. 
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But if (x, r) « R, by definition all ¢* vanish and F(z, r) = f(x, r). Hence 
Gal = f(t0, To) and agr* S f(xo, r) for (xo, r) € R. 


Thus in case | ro | > 0, condition (2.2d) holds. 

If ro = 0, we distinguish two cases. There may be no ro except 0 for which 
¢*(x0,r) = 0. In this case we choose numbers a; arbitrarily and obtain a,r* = 
0 = f(xo, r) for all r such that (xo, r) ¢ R, namely ro. Or there may be an 
r, ¥ Osuch that (7,7) « R. In this case there exists, as we have seen above, 
a function a,r* such that aar* < f(xo, r) for (ao, r) € R, while arf = 0 = 
f(z0, 70). Soin any case (2.2d) is satisfied, and our conclusion holds. 


§6. Just as Theorem 3.1 led us to a theorem on Lagrange problems in para- 
metric form, so does Theorem 4.1 lead us to one on Lagrange problems in or- 
dinary form. 

TuHEorEM 6.1. Let the functions g(u, y, n) and ¥*(u, y, n), (k = 1, ---,m), 

satisfy the conditions 
(6.1) g(u, y, n) and the ¥*(u, y, n) are continuous and possess continuous first 
partial derivatives with respect to the n‘ for all (u, y) in a closed set A and all n; 
(6.2) for each (u, y) € A the system of equations ¥*(u, y, n) = 0 has at least one 
solution ; 
(6.3) there exists a set of functions c;(u, y), (¢ = 1, --- , m), continuous on A, 
such that if we set G = g + cay¥*, for every admissible set (uo , yo, no) and every 
e > 0 there is a neighborhood U of (uo , yo) for which 

G(u, Y; n) ah G(uo , Yo, no) a (n* a 15 )G ca) (Uo , Yo, no) = —e(1 + | ] *)8 


whenever (u, y, n) is admissible and (u, y) « AU. 
Then for every M > 0 the integral S(y) = Jg(u, y, y)du is L.s.c. on the class of all 
admissible functions y = y(u),a S u S&S b, having total variation <= M and such 
that (u, y(u)) € A for every u. 

Referring to Theorem 4.1, we observe that conditions (4.1, a, b) are obviously 
fulfilled, while (6.3) implies (4.1c) if we set 


ao = Guo, Yo, no) — GG iay(Uo, Yo, No) 


a;= (iy (Uo , Yo, no) (¢ = 1, ae q) 


and recall that 


G(u, y, n) = gu, y, n) + ca¥*(u, y, ») = gu, y, n) 


for admissible (u, y, 7). 

From this theorem there follows a rather interesting corollary. 

THeoreM 6.2. Suppose that 
(6.4) g(u, y, n) and Y*(u, y, n) are defined and continuous, together with their 
first partial derivatives with respect to the y', for all (u, y) in a closed set A and all 9; 
(6.5) for each (u, y) € A the system of equations ¥*(u, y, n) = 0 has at least one 
solution; 
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(6.6) for every admissible set (uo , Yo , no) and every 6 > O there is a neighborhood 
U of (uo, yo) such that if (u, y) is in AU, the equations y*(u, y, n) = 0 have a 
solution™ with |» — no| < 4; 

(6.7) there exist functions c;(u, y) continuous on A such that if we setG = g + ca, 
then the inequality 


&4(u, Y, 1; n) = G(u, Y; 7) =n G(u, Y; n) o. (7% = n*)Ga)(u, Y, n) 20 


holds for all admissible sets (u, y, n) and (u, y, 7). 

Then for every M > 0 the integral S(y) = Jf g(u, y, y)du ts L.s.c. on the class of all 
a.c. functions y = y(u) satisfying the equations y*(u, y, y) = 0 almost everywhere 
and such that (u, y(u)) € A and having total variation of y(u) S M. 

Comparing this with Theorem 6.1, we see that the only hypothesis not obvi- 
ously satisfied is (6.3). The functions G and G;;) are continuous for all (u, y) «A 
and all »; hence if (wo , yo , no) is admissible, for every y > 0 there is a neighbor- 
hood U, of (uo , yo) and a 6 > O such that if (u, y) «AU; and | » — m0| < 4, then 


(6.8) G(uo » Yo, no) — G(u, Y, n) | <7) 
{LalG (a) (Uo » Yo; no) ones Gia)(u, y; n)}?}3 < 7- 


Let U be the neighborhood mentioned in (6.6). For every (u, y) in AU,U 
there is an 7 with | 7 — mo| < 6 for which (u, y, 7) is admissible. Then if (u, y, n) 
is admissible and (u, y) « AU,U, we have 


G(u, y, n) — G(uo, yo, no) — (* — n5)Gea)(Uo , Yo, 10) 
= [G(u, y, n) — Glu, y, 4) — (nt — 9°)Ge@(U, y, 2)] 
+ [G(u, y, 1) — Guo, yo, n0)] + (n* — 4%)[Ge(u, y, 1) — Ga(Uo, Yo, n0)] 
+ (n> — 7%)Ge)(Uo, Yo, No). 


The first term on the right is non-negative by (6.7). The second is not less than 
—y. The third is not less (by 6.8) than —(|9| + | 9|)-y. If we write P 
for the length of the vector whose components are Gi)(uo , Yo, no), the fourth 
term is not less than | 7 — | P <-Pé, so the left member is greater than 
—y —-vy n\ —v 7 — Pé. For any ¢« > 0 we can choose 6 and y so small 
that y¥(1 + | + 6) + P6 < €/2; then the left member is greater than 
—e/2—e\n /2> —e{1 + 7 2}'. This proves that (6.3) is satisfied (with U 
replaced by UU’), and so the conclusion of Theorem 6.2 must hold. 

§7. If we compare Theorem 6.2 with Theorem 5.2 we notice that there is a 
decided strengthening of hypotheses. Hypotheses (6.5) and (6.6) have no 
analogues in Theorem 5.2, and even (6.7) requires that the c; be continuous, 
which was not needed in §5. This suggests an investigation to see whether 
these hypotheses are really essential or are merely dictated by our methods of 


8 This holds in particular if m < q and the matrix || vis y(u, y, n) || has rank m for all 
admissible (u, y, 7). 
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proof. We shall here show by examples that the former is the case; if either 
(6.5) or (6.6) is omitted, or even if (6.7) is relaxed to allow discontinuous c,(u, y), 
the theorem is no longer valid. 


For all y + 0, +1, +3, +4, --- we define u(y) = ev‘ *™, while for 
these exceptional values we set u(y) = 0. Then u(y) is defined and continuous 
for all y, and is positive except for y = 0, +1, ---,+1/n,---. Ourintegrand 
will be g(u, y, n) = —7’y’sin*(r/y), if y + 0, g(u, 0, ») = 0. For our three 


examples we choose three different side equations y(u, y, 7) = 0. 

Example I: ¥(u, Y; n) =e? — u(y). 

Example Il: y(u, y, ») = q(y, u(y) + 1 — e*), where g(y, v) = o% — v + 
u(y) (v — 1). 

Example III: y(u, y, n) = (e" — u(y))(e” — 1). 

In example II, we observe that g(y, v) = 0 has one real root v = 1 if u(y) > 0, 
and two distinct roots v = 0, v = 1if u(y) = 0. Hence in this case the solution 
of the equation y(u, y, n) = 0 is » = log u(y) if y + 0, +1, +1/2,---, and 
n = Oif y = 0, +1,---. In example I, for y + 0, +1, --- the solution is 
n = log u(y), and for y = 0, +1, +1/2, --- there is no solution. In example 
III, » = log u(y) is a solution if y + 0, +1, +1/2, --- , and » = Oisa solution 
for all y. 

In examples I and II hypothesis 6.7 is satisfied if, for example, we take c = 0. 
For, given any (u, y), there is never more than one » such that (u, y, 7) is ad- 
missible, and (6.7) reduces to the triviality Ge(u, y, 7, 7) = 0. Example I 
satisfies (6.6)'* but not (6.5); example II satisfies (6.5), but not (6.6). Example 
III satisfies both of these hypotheses. With regard to (6.7), we notice that if 
y = 0, +1, --- , there is only one » (namely 0) for which ¥(u, y, 7) = 0, so as 
before, if (u, y, n) and (u, y, #) are admissible, Se(u, y, n, 7) = Ge(u, y, 0,0) = 0, 
no matter how we choose c. If y # 0, +1, ---, there are two solutions of 
y = 0, namely 0 and log u(y). Then y¥,(u, y, 0) = 1 — u(y) > 0, while y,(u, y, 
log u(y)) = u(y)(u(y) — 1) < 0. Let us set G = g + ey and try to determine 
cso that Se(u, y, n, 7) 2 0 whenever (u, y, 7) and (u, y, 7) are both admissible. 
If » = 7 this is certainly satisfied. Otherwise we have 


Se(u, y, 0, log u(y)) = —y*sin~*(x/y) + ysin*(r/y)[e(1 — u(y))), 
Se(u, y, log u(y), 0) = y*sin~“(4/y) — y~tsin-*(x/y)[2y-? + cu(y)(u(y) — 1)). 


These are both positive if c is large enough. Hence (6.7) holds except for the 
requirement that the c; be continuous. 

In all these examples there is a family of admissible functions determined 
by the equation 


(7.1) y = log u(y) = —y‘sin-*(/y); 


these are absolutely continuous and monotonic decreasing, with derivatives 
less than —1for|y{< 1. For example II there is a second family of admissible 


4 In fact, if Y(u, y, n) = e? — 1 = 0, then dy/dn = e® = 1, 
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functions y = 0, y = +1, y = +3,---. For example III every function 
y = constant is admissible. 

To establish the lack of semi-continuity in example I, we let yo be the function 
defined only for a = 0, and there having the value 0, and we define y, to be a 
solution of (7.1) in the interval (0, b,), where y,(0) = 1/n and b, is so chosen 
that y.(b,.) = 1/(n +1). Then y, — yo, and S(yo) = 0. We calculate G(y,) 
most easily by using y as the independent variable: 

I/n 


bn 
G(yn) = — ¥sint(r/y)ydr = / — y*sin®*(r/y)y~ sin-*(x/y)dy 
0 1 


/(n+) 
—~171/ am 
= ly Na+) — —1. 


So example I is not l.s.c.,even though the total variation of y, is 1/n — 1/(n + 1), 
which tends to 0. 

The same family of functions is admissible for examples II and III also, and 
so they are also not l.s.c. However, it is interesting to modify the y, somewhat. 
We define yo(u) = 0,0 < u S 1l;andforn = 1, 2, --- we define y,(u) to be the 
solution of (7.1) on (0, b,.) as before, setting y,(u) = 1/(n + 1) forb, <u <1. 
The functions yo, y, thus defined are absolutely continuous on the interval 
(0, 1), and since on (b, , 1) we have g(u, yn, Jn) = g(u, Yn, 0) = O, we still have 
G(y,) = —1. For example III we thus see that the function yo = 0,0 < u < 1, 
furnishes a weak relative minimum for S(y), since if we consider only admissible 
functions y(u) with |y| < 1 and | y| S 1 we obtain only the functions y = 
const., for which $(y) = 0. Of course there is not a strong relative minimum 
at yo, aS Our comparison functions y, show. The function yo is imbedded in 
the field of extremals y = const. The Weierstrass condition holds along yo, 
but not in strengthened form. 

In the usual Lagrange problem the number m of equations is required to be - 
less than the number q of functions y(u), while here m = q = 1. This objection 
is readily disposed of if we interpret our problem as one in (a, y, z)-space in 
which the functions g and ¥ happen to be independent of z and 2Z. 


§8. From §5 we obtain at once a theorem concerning integrals in ordinary 
form, without side equations; for if we suppose that there are no functions yf, 
Theorem 6.2 becomes 

TueoreM 8.1. Jf g(u, y, n) is defined and continuous, together with its first 
partial derivatives gq) , for all (u, y) in a closed set A and all n, and if &(u, y, 2,7) 2 
0 for all (u, y) € A and all n and 4, then for every M > 0 the integral Si(y) is L.s.c. 
on the class of all a.c. functions y = y(u), a < u & b, having total variation < M 
and such that (u, y(u)) lies in A for all u. 

However, in a previous paper I have had need of a semi-continuity theorem 
more general than this. We therefore suppose that g satisfies the conditions 
(8.la) g(u, y, n) is defined, finite, and |.s.c. for all (wu, y) in a closed set A and 
all ; 

(8.1b) g(u, y, 0) is bounded above. 
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From g(u, y, n) we form a new function f as in §4: 
(8.2) flu, y, §&,n) = ég(u, y, n/£), (uy y) €A, E> 0, f(u, y, 0,0) = 0, (u, y) «A, 
and we again introduce the notation 2° = u, z' = y‘,r° = £,r° = n*. We readily 
see that f(z, tr) = U(2, r)ifxeA,andt 20. If mis any constant, the semi- 
continuity of [f(x, )dt is equivalent to that of f(f(x, ) + m2#)dt, since the two 
integrals differ only by the functional fm*dt, which, being m times the differ- 
ence of the final and initial values of x°, is a continuous functional. Hence in- 
stead of (8.1b) we may assume without loss of generality that 
(8.3) f(z, ---, 27%, 1,0,---,0) < 0. 

Let us now assume that 
(8.4) for fixed (u, y) « A, the function g(u, y, n) is convex in ». This implies 
(8.5) for fixed x ¢ A, the function f(z, r) is convex on the set of all r with ° > 0. 
We must show that for any such r; and rz we have 


f(x, 1) + f(x, m2) 2 F(z, (rn + 12); 
that is, 
fig(u, y, m/&) + &g(u, y, m2/k) = (i + &)g(u, y, Cm + m2)/(E + &)). 
Writing kj = &:/(& + &) (¢ = 1, 2), we have k; > 0, ki + ke = 1, and 
kym/&: + kene/E = (m + m2)/(& + &). 
The inequality to be proved is then 
kig(u, y, m/f) + keg(u, y, m2/&) 2 g(u, y, kim/é + Kene/&), 


in which form it is easily seen to follow from (8.4). 

We next prove that if (8.3) and (8.5) hold, f(u, y, & ) is a monotonic de- 
creasing function of & for fixed (u, y, »). Suppose 0 < — << —&+h. Remember- 
ing the homogeneity and continuity of f, we have 


S(u, y, & +h, n) = flu, y, (E + h)/2, 0/2) S flu, y, & 0) + flu, y, h, 0) 
= flu, y, §, n) + hf(u, y, 1, 0) < f(u, y, §, ”), 
which was to be proved. Hence f(u, y, £, 7) tends to a limit, finite or infinite, 


as £ + 0, and we can define 


(8.6) f(u, y, 0, n) = lim f(u, y, & n). 
t—0 
This is easily seen to be consistent with the definition f(u, y, 0,0) = 0. 
Next we prove that f(u, y, § 7) is Ls.c. For & > 0 this is an immediate 
consequence of (8.1a), for if (u,, Yn, En, Mn) — Cu, y, §& 0), then 


~— 


lim inf f(un, Yn, En, mm) = lim inf E,g(tn, Yn, En/mn) 2 Eg(u, y, n/t 

= f(u, y, & 0). 
Hence for every positive integer p the function f(u, y, —§ + 1/p, n) is Ls.c. for 
(u,y)e«Aandé 20. Now let p— ~. For £ > 0 the function f is convex, 
hence continuous, so f(u, y, — + 1/p, n) > f(u, y, §& 7). For & = 0 the same 
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relation holds by definition (8.6). As p increases and 1/p decreases, f(u, y, 
& + 1/p, n) increases, since it is monotonic decreasing in —. Hence f(u, y, &, ) 
is the limit of an increasing sequence of l.s.c. functions, so it is itself l.s.c. More- 
over, f(u, y, — + 1/p, n) is convex in (£, n) for & 2 O, so for fixed (2, y) we see 
that f is the limit of an increasing sequence of convex functions, so it is itself 
convex in (&, 7). 

Thus we have shown that if R is the set of (x, r) with x « A and r® = 0, condi- 
tions (2.2a) and (2.2b) are satisfied by f. Condition (2.2d) is satisfied because 
f(x, r) is convex in r for fixed x. Condition (2.2c) holds, because, first, R is 
closed, and second, if (x, y, &, n) « R, so is (x, y, & + h, n) forallh > 0. From 
Theorem 3.1 we therefore conclude: 

THEOREM 8.2. If g(u, y, n) satisfies conditions (8.1) and (8.4), then for every 
M > 0 the integral ff(x, «)dt is l.s.c. on the class of all curves tying in A, having 
lengths = M, and having x = 0; here f is defined by (8.2) and (8.6). 

If in particular we restrict our attention to curves having a.c. representations 
of the form y = y(u), a S u S b, we obtain as in §5, the following generalization 
of Theorem 8.1: 

TuHEorREM 8.3. If g(u, y, ) satisfies conditions (8.1) and (8.4), then for every 
M > 0 the integral S(y) = Jg(u, y, y)du is L.s.c. on the class of all a.c. functions 
y(u),a < u S b, such that (u, y(u)) lies in A and y(u) is of total variation <= M. 


§9. If in addition to hypothesis (2.1) we assume that the set R consists of all 
(x, r) with z in a closed set A, and if moreover there is an m > 0 such that 
f(z, r) 2 m|r |, it follows readily that in the class K of all curves joining two 
fixed points xz, and 22 there is a curve C for which ‘F(C) is least. In case the 
lower bound 7 of ‘F(C) on the class K is ~, any curve of K will serve. Other- 
wise 0 Si < x. Wechoose a sequence {C,} of curves of K such that i+ 1 2 
F(C;) = F(C2) --- +7. For all of these curves we have £(C,) = f | 4%, | dt s 
mf f(x, , @,)dt = m“F(C,) < (i + 1)/m. Hence by Hilbert’s theorem there 
is a curve of accumulation Cy , which also joins 2; to zz. From the C,, we choose 
a subsequence {C,} with limit Cy. Since Co e K, we see that (Co) 27. On 
the other hand, by Theorem 3.1 


it = lim inf {F(C,) 2 (Co) 2 1, 


so {F(Cy) = 2, and Co is the curve sought. 

We now apply this very simple existence theorem to three special cases. 
As a first example, we consider a number of pieces of glass with index of re- 
fraction p > 1 suspended in a vacuum and consider the path that a light-ray 
would traverse in going from 2; to v2. The reciprocal of the velocity of light 
at any point is p(x)/c, where ¢ is the velocity of light in vacuo and p(x) = p 
if z is in the glass and p(x) = lif cisinthe vacuum. The time of traversal of 
any given path z = z(t) is then ce! fp(x) | a#\dt. Here p(x) |#| =| a), and 
if the glass be regarded as forming an open set p(x) | #| is L.s.c. The others of 
conditions (2.1) clearly hold. Therefore there exists a path for which the time 
of traversal is least, and by Fermat’s principle, this is the path sought. 
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The theory applies equally well if we replace the glass by anisotropic crystals 
and require that the path have a point in common with a given point set (mirror). 

As a second example we consider the Zermelo navigation problem.” A ship 
whose velocity relative to the water is k is to travel from a point x, to a point 22, 
the water being in motion and having at the point z a velocity (v'(x), v°(x)). 
We suppose that the “sea” is a closed set A and that v(x) is continuous. (If v 
were a function of both x and the time 7, this would be the general form of the 
navigation problem, which is a Mayer problem.) For this problem the time 
of traversal is [f(x, #)dt, where f(x, r) is defined as follows. For fixed z, a half 
line from the origin, r = 0, may meet the circle | r — v(z) | = kin 0, 1, or 2 points 
distinct from r = 0. In the first case we leave f(z, r) undefined on the ray. 
In the second case, if the intersection is at r; , we set f(z, i.) = tfort 20. In 
the third case, if r, is the intersection further from the origin we set f(z, tr:) = t 
fort 20. Thus for each z the set Z,,,, on which f(z, r) is defined and f(z, r) S u 
consists of all the points of the circumference | r — uv(x) | = uk and all points 
on the line segments joining this circumference to r = 0. Since |7,/| S 
k + | v(x) | , we find that whenever f(z, r) is defined the inequality 


f(z,r) 2 |r| /(k + | (2) |) 


holds. Thé sets for which f(x, r) is defined will be called “attainable.” 

We now define an auxiliary function F(z, r) which is equal to f(z, r) when 
(x, r) is attainable, and is ~ if x « A and (z, r) is not attainable. This function 
is Ls.c. For if u < F(x, r), then r is not in the set F,,,, so it has a distance 
26 > Ofrom that set. If # remains in a neighborhood N, of 2, the center ur(z) 
of the circle |r — uv(x) | = uk moves less than 4, so for such @ the distance 
from r to E;,, is still less than 6. Thus for # « N, and |? — r'| < 6 we have 
either f(%, 7) defined and F(z, 7) = f(%, 7) > u, or else f(%, 7) undefined and 
F(%,7) = ~ > u. Conditions (2.2b, ¢) clearly hold for F. To verify (2.2d), 
we first dispose of rp = 0. For this, if u < F(xo, ro) = 0, we choose aar* = 0, 
and then F(x», r) 2 aar*forallr. If ro # 0, choose any u < F(x, 70). Since 
F (xo, ro) > 0, we can determine a positive w so that u < w < F(x, 79). Con- 
sider the set E,,,.. This is convex, and ro is not in it, so we can find a line not 
passing through the origin and separating ro from F,,... Let the equation 
of this line be written in the form a,r* — w = 0. The line separates ro from the 
origin, sO darq — w > 0. The set E,,,.. lies on the same side of the line as the 
origin, so for r e E,,,.. we have agr* — w S 0. That is, if P(r, r) = w, then 
dat® S w= F(xo,r). By homogeneity, the inequality aar* S F(xo, r) holds 
for all r, and (2.2d) is satisfied. 

Since F' satisfies all the hypotheses of our existence theorem, there is a curve 
C joining x; to x2 for which fF (x, #)dt is least. If this last integral is finite, 
then F(x, #) must be finite for almost all 4. But F(x, #) = © unless (2, 2) is 
attainable, so for almost all ¢ the set (x, @) is attainable and F(x, 2) = f(a, 2). 
Thus we have shown that if it is possible to travel from x; to 2 in a finite time, 


%® Carathéodory, Variationsrechnung, p. 234. 
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it is possible to make the journey along a path x = zo(t) such that (29, %) is 
attainable for almost all ¢ and such that the time of the voyage along the path 
x = 2)(t) is the least possible. 

For our final example we consider the problem of minimizing an integral 
Sf(z, z)dt in the class (assumed non-vacuous) of curves joining two fixed points 
r,, x2 and satisfying for almost all ¢ a set of equations ci (r)a* = 0, (= 1,---,m< 
q). We assume that f(z, r) is defined and continuous for all z in a closed set A 
and all r, and that f(z, tr) = tf(z, r) fort 2 0. We further assume that the c’;(z) 
are continuous on A, and that there is a number p > 0 such that f(z, r) = p |r| 
whenever the equations c}(x)r* = 0 are satisfied." (A set (x, r) such that c}(x)r@ 
= 0 will be called admissible.) 

As before, we define F(x, r) to be equal to f(z, r) whenever ci (x)r* = 0, and 
F(z, r) = ~ elsewhere. This function is lower semi-continuous. For the ad- 
missible arguments (zx, r) form a closed set, so if (ro, 79) is not admissible, there is 
a neighborhood of (zo , ro) on which F(z, r) = ~. If (xo, ro) is admissible, for 
every « > 0 there is a neighborhood of (z», ro) on which F(z, r) 2 f(z, r) 2 
S(20, 70) — € = F(ao,70) — €. Conditions (2.2b, ¢) clearly hold. With regard 
to (2.2d) we now assume that &,(z, r, 7) = 0 whenever (z, r) and (z, 7) are ad- 
missible. Then if (zo , ro) is admissible, the function rf,.)(19 , To) serves for the 
ar® of (2.2d); for if (zo , r) is admissible we have 0 < &;(z0, 179, 7) = f(z0, 7) — 
Sie)(Zo, To)r* = F(xo, 7) — agr*; otherwise F(z), 7r) = © > agr*. If (xo, 10) 
is not admissible, then the equations cir¢ = 0 are not all satisfied; say c%(z0)r$ > 
0. Choose now any linear function b.r* such that F(x» ,r) = bar* for all admissi- 
ble r; we have just seen that this is possible. Now if u be any number, it is 
possible to find an N large enough so that u < bar§ + Ne*%(xo)r§. Then the 
function a,r* = b,r* + Ne*r@ is the one sought; for if (xo, r) is admissible, we 
have F(zo,r) = f(zo, r) 2 bar* = aar*, while if (xo, r) is not admissible, 
F(zo,7r) = *~ > a,r*. 

Since F(z,r) satisfies all the hypotheses of our existence theorem, there is a 
curve C: 2 = x(t) for which f F(z, Z)dt is least. But this least value is by 
hypothesis finite, so F(z, ¢) < @ for almost all ¢. Therefore for almost all t 
the equations c)(r)z7(t) = 0 must be satisfied, and we have proved 

THeoreM 9.1. Jf the functions ci (x) (j = 1, ++: ,qg,t = 1,-++,m <q) are 
continuous on a closed set A, and f(x, r) is defined and continuous for x « A and 
all r, and f(z, tr) = Uf(x, r) fort = 0, and there is a positive number p such that 
f(z, 7r) 2 p rr whenever c)(x)r* 0, and &(x2, r, 7) 2 O whenever x € A and 
ch (x)r* = c,(x2)F = 0, then in the class of all curves x = x(t) joining two fixed points 
Z;, 2, and such that ch (2)z* = 0 for almost all t, there exists a curve x = xo(t) for 
which the integral § f(x, z)dt assumes its least value. 


UNIVERSITY OF VIRGINIA. 


# Of course this last could be replaced by weaker hypotheses. 























NOTE ON A SINGULAR INTEGRAL. II 
By E. P. Nortrurop 


1. Introduction. This paper is concerned with the convergence in the mean 
to f(x), as m — o~, of the integral 


Tw(x;f) = (2x)-} [ K(x — u;m) f(u)du, 

and is a generalization of results obtained in an earlier note by the author." 
As a point of departure for the present note we shall, after a few preliminary 
remarks, introduce the main theorem (hereafter referred to as Theorem I) of 
the first one. 

Since all of the functions to be considered will be defined over the infinite 
range, we shall denote the Lebesgue class L,(— «, +) by simply L,. We 
write || f(x) ||, for the norm of a function in L,, and define it by means of the 


relation 
i f(x) ||, = il : | f(x) rar | 


The Fourier transform of a function f(x) «L,, r > 1, is defined (provided it 
exists) as the limit in the mean of drder s, 1/r + 1,s = 1, as A > &, of the 
integral 


“A 


(2r)-4 | e~** f(t)dt, 


and will be denoted by 7[z; f(4], or, if there can be no confusion regarding the 
argument, more simply by 7[f(r)]. The inverse Fourier transform of f(x), 
denoted by T“[x; f()] = T-[f(x)], is defined by the same expression, except 
that e~* is replaced by its complex conjugate. 

Tueorem I. Let K(x; m) € Le for every m. Then in order that T(x; f) «Le 
for every m and || T,.(x; f) — f(x) 2 > 0 as m — &, for every f(x) € Le, tt ts 
necessary and sufficient that K(x; m) satisfy the conditions 


(i) eub. | TLK (2; m)] | M,,, and lim M, < M, 
o<crc4aw m-2e0 
om 
(ii) lim | | TIK(2; m)] — 1 dx = 0 


for every finite a and b. 
Remarks. In condition (i), as throughout the rest of the paper, M,. is a 


Received June 12, 1935. 
' Bull. Amer. Math. Soe., vol. 40 (1934), pp. 494-496. 
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finite function of m, M is a constant, and e.l.u.b. denotes essential least upper 
bound, i.e., the least upper bound, for a fixed m, except for a set of measure 
zero. The reader who refers to the first note will find that condition (i) has 
been revised. It was originally stated thus: | T[K(x; m)]| < M for all m and 
almost all z. In this case, Theorem I would not be true, for the purposes of 
necessity, when m is a continuous parameter. This difficulty was not men- 
tioned explicitly by Lebesgue in connection with a theorem? upon which the 
proof of Theorem I is based, and the difficulty was again overlooked by the 
present author. In Theorem I as stated above, however, it is immaterial 
whether m is a continuous or a discrete parameter. The same may be said 
for the remainder of the theorems in this note, with the exception of Theorem ITa. 

It is natural to inquire whether or not this theorem can be generalized so as 
to cover the case where f(z)«L,, 1 < r S ~. This note is an endeavor to 
answer this question, and does so partially, in that sufficient conditions for 
convergence are obtained in the case 2 < r < , and certain necessary condi- 
tions in the case 1 < r < 2. The difficulties involved in the cases 1 < r < 2 
relative to sufficient conditions, and 2 < r < ©@ relative to necessary condi- 
tions, will be discussed later. No attempt has been made here to treat the 
extreme cases r = 1,7 = x. It might be pointed out in this connection that 
H. Hahn has obtained necessary and sufficient conditions for the case r = 1, 
although these are not in terms of the Fourier transform of the kernel.* 

Throughout the paper it will be assumed that p and q are numbers satis- 
fying the relations 1 < p < 2,1/p + 1/q = 1. It follows that 2 <q < ~. 
We shall have occasion to use Hélder’s inequality: if fi(x) « L,, and fe(x) « Le, 
then 








+e l 
[ ful) fal)dx | < || fale) Ip || fox) Iles 





as well as the following known properties of the Fourier transform :* 

(a) If f(x) e L,, then T[f(x)| and T~[f(x)] exist and belong to L,, and 
T\TIif(z)}} = T{T-{f(x)]} = f(x) almost everywhere. 

(b) For every fi(x) and fo(x) belonging to L,, 


(1.1) | fila) Tl fe(x) dx = / f(x) TI f(a) |de. 
(ce) If f(z) e L,, then 
(1.2) TIf(z)) qo S A(p) | f(@@) >, 


where A(p) is a finite quantity depending only upon p. 


2 Ann. de Ja Fac. des Se. de |’Univ. de Toulouse, (3), vol. 1 (1909), p. 52. 

3 Kais. Ak. der Wiss. in Wien, Denkschriften, vol. 93 (1917), p. 667. 

* For property (a) see E. Hille and J. D. Tamarkin, Bull. Amer. Math. Soe., vol. 39 
(1933), pp. 768-774; for properties (b) and (ec), . C. Titchmarsh, Proce. Lond. Math. Soc., 
(2), vol. 23 (1924-25), pp. 288 and 287 resp. 
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2. Sufficient conditions in the case f(x) « L,. We have two main theorems 
in this case; in the first, f(x) belongs to a (dense) subset of L,, and in the 
second, f(x) is an arbitrary function of L,. 

TuHeoreM II. In order that T,,(x; f) « L, for every m and 


\| T(z; f) — f(z) ||; > 0asm— a, 


for every f(x) which is the Fourier transform in L, of some function in Ly, it is 
sufficient that K(x; m) satisfy the conditions 


(i) K(x; m) ¢ L, for every m, 
(ii) e.lu.b. | TIK(x;m)]| = M,, and lim M,, < M, 
—e<c r+ mo 
b 
(iii) lim i | TIK(x;m)] — 1 |? dx = 0 


for every finite a and b. 


Proof. Conditions (ii) and (iii) are obviously equivalent to those obtained 
by replacing T by T-'. In the following we shall use the conditions so revised. 
We show first that 7,,.(7; f) « L,. If in (1.1) we put fi(u) = K(x — u; m), 
T(fe(u)] = f(u), then 


T(z; f) = (2x)-3 , T-[f(u)] Tlu; K(x — t; m)]du 


= (2r)-} | e~z T—[ f(u)] T-(K(u; m)]du, 
since it can be easily verified that T[u; K(a — t; m)| = eT [u; K(t; m)| = 
e*=T-K(u; m)]. Then by condition (ii), | T“[f(w)]T-[K(u; m)]| < 
M | T“{f(u)] |. As the right term of this inequality belongs to L,, so also 
must the left term, and T’,,(27; f) can be regarded as the Fourier transform in 
L, of T—[f(x)|T—[K (x; m)]}. 

Keeping in mind that f(x) can be thought of as the Fourier transform of 
T|f(x)], we can then write, with the aid of (1.2), 


(2.1) || Tm(z3 f) — f(2) |la S A(p) || TUS@IJT LK; m)] — 1} |p. 


The norm on the right of this last relation is the p-th root of the integral 
+0 
/ | T-[f(a~)] |? | TL (x; m)] — 1)? de. 


By the theorem of Lebesgue used in the proof of Theorem I, the conditions 
(ii) and (iii) are sufficient for the convergence of the above integral to zero, 
asm-— > «. This, by (2.1), proves the theorem. 

Tueorem III. Jn order that T(x; f) « Lg for every m and 


| T(x; Sf) — f(z) Ile 
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as m— ~, for every f(x) € La, it is sufficient that K(x; m) satisfy conditions (i) 
and (iii) of Theorem II, and the condition 


+20 
(ii’) i] | K(x; m)| dx < N (a constant) for every m. 


Proof. Note that condition (ii) of Theorem II has been strengthened, as 
(ii’) implies (ii). We first show, as before, that 7',(2; f) «L,. To do so, we 
use (ii) and Hélder’s inequality in a somewhat modified form. We have, 
keeping in mind that 1/p + 1/q = 1, 


| (oe) T(x: f) is [- | K(x — u; m) Pfu) | Ke — u;m) *} du 
+2 1p += 1 
< | | K(x — usm)| du fe] su) |6| K(2 — usm)| du | 
L 4-20 1 
< Ne | / | f(u) |* | K(x — u; m) | auf 
That is, 
(2.2) T m(23f) |\q < (24)-*N || f(x) Iq. 


We now make use of the fact that, given an arbitrary function f(r) «L,, 
and an e > 0, we can find a function g(x) « L, which is the Fourier transform 
of a function in L, and such that 


(2.3) | f(x) — (2) a <e. 


(It would be sufficient to use, as g(x), a step-function; for the Fourier transform 
of such a function is in every L,,r > 1.) With this in mind, we write 


| Tm(@3f) — f(z) le S | ole) — F(x) la + | Tule; f) — Tul; 9) lle 
+ || Tn(z;~) — (2) |\e- 
If we can show that the right side of this inequality can be made arbitrarily 
small by the choice of a sufficiently large m, then the left side will also have 
this property. The first term is arbitrarily small by (2.3), as is the second 
term. For in view of the fact that 7’, is an additive transformation, 
|| Tm(a3f) — T(z; ¢) \la = | Tm(a; f — @) |la < (27) IN If — elles 


by (2.2). Finally, to the third term we can, for a fixed g, apply Theorem II. 
This proves the theorem. 


3. Necessary conditions in the case f(x) « L,. 
Tueorem IV.5 For every m let K(x; m) be the Fourier transform in L, of 
some function in L,. Then in order that T,,(x; f)eL, for every m and 


5 The author wishes to express his indebtedness to the referee for suggesting this theorem, 
and to E. Hille for indicating the argument which replaces the theorem of Lebesgue. 
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| T n(x; f) — f(x) ||) ~ 0 as m — x, for every f(x) «L,, it is necessary that 
K(x; m) satisfy the conditions 


(i) e.lu.b. | T[K(2;m)]|=M,, and lim M,, < M, 
—wczr<c+o m2 
b 
(ii) lim | | T[K(x; m)] — 1\¢dxr = 0 


for every finite a and b. 
Proof. If in (1.1) we put fi(u) = f(u), Tl fe(u)] = K(x — u; m), then 


T(x; f) = (24r)-3 r Tl f(uw)] T[u; K(a — t; m)] du 


II 


(2r)- ig ec“? T[f(u)] TLK(u; m)] du, 


since it can be easily verified that T-'[u; K(a — t; m)] = e™T[u; K(t; m)] = 
e*T[K(u; m)]. From the above it is evident that T’,,(z; f) can be regarded 
as the inverse Fourier transform of T[f(x)]T[K(z; m)]. If now we assume that 
T(x; f) belongs to L,, it follows that its Fourier transform in L, is almost 
everywhere equal to T[f(x)|T[K(x; m)]. We can then write, with the aid 
of (1.2), 


(3.1) Tr(z3f) \» 2 (A) || TIF@I|TIK(@; m)) a, 
(3.2) || Tala; f) — f(z) ||» 2 [AMT || TIF@NTIKG@; m)} — 1} |. 


If the term on the left of (3.2) tends to zero as m — ~, so also must the term 
on the right. Similarly, the boundedness of the left side of (3.1) implies that 
of the right side. We cannot apply the theorem of Lebesgue directly to the 
present situation, as we did in Theorem II, because T[f(x)] is not an arbitrary 
function of L,. This follows from the fact that the Fourier transform maps L, 
on a (dense) subset of L,. This difficulty, however, can be taken care of as 
follows, by an argument of Lebesgue’s type. 
For any choice of a and b, a < b, b — a < «@, the function 





eibt __ giaz 
Pe = ix Vor 
is in L,, and 
| f(2, a, b) lip = a(p)(b — a), 
where a(p) depends only upon p. Furthermore, 
la<2<b, 


TIf(a, a, b)] - «= 4a, b, 


0, elsewhere. 
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For this function, (3.2) becomes 


| Pax; f) — fl) lip 2 Ar] | 


a 


1 


| T[K(x; m)] — 1/2 az. 


Hence condition (ii) is necessary. 
Similarly, for the function f(z, a, b), (3.1) gives 


( b 1 ; 
! Tr(a3f) |p = [A(p)a(p)] "i : : / T (K(x; m)] ear} f(x, a, b) ||. 
-— a } 
As we shall presently prove, the transformation T',,(2; f) is bounded in L, for 
every m; i.e., there exists a finite B(m) such that || T,,.(x;f) ||» S B(m) | f(x) ||p 
for every f(x) « L,. This implies that 


“b 
(3.3) b _ | | T[ K(x; m)]|\¢ dx < [M(m)]:, 


where M(m) is a finite quantity independent of a and b. But if a is fixed, and 
b — a, then, by a well-known theorem of Lebesgue, the left side of (3.3) tends to 
| Tia; K(t; m)] |\¢ for almost all values of a. Hence 


(3.4) T(K(2; m)| | < M(m) 


for almost all x. 
In order to prove the boundedness of T(z; f) in L,, we note first that 
Hélder’s inequality gives 


(3.5) | T(r; f) | S& (2r)-* || K(a; m) ||, || f(z) lp, 
so that the transformation is bounded for every m. Now define the function 


f , 
JT.(z;f) |x| Sn, 
\0, elsewhere. 


Tr(a; f) = 


By assumption, 7',,(x; f) « L,, so that a fortiori T(r; f)e L,. But T(a; f) 
is a bounded transformation in L,, since by (3.5), 


1 
Tre; f) |» S (2e)-4(2n)” || K(w; m) |\q || f(z) Il. 


Furthermore, | T(z; f) — T(x; f) » 7 0 as n — @ for every f(x) e Ly. 
But if a sequence of bounded linear transformations converges at every point 
of L,, then the bounds of the transformations must be uniformly bounded, 
and the limiting transformation is bounded.6 Hence T’,,(x; f) is bounded in 
L, for every m. i 

The same argument shows that the bounds M(m) of (3.4) must be uniformly 
bounded, since by assumption T7’,,(x2; f) converges all over the space L,. Hence 
condition (i) is necessary. This completes the proof of the theorem. 


®S. Banach, Théorie des Opérations Linéaires, Warsaw, 1932, p. 80, Th. 5. 
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4. Special cases of Theorems II, III, and IV. It is perhaps of some interest 
to see what simplifications are brought about in the conditions on the kernel 
in case it is of the form K(x; m) = mk(mzx). We have the following theorems. 

THEOREM IIa. If in Theorem Il, K(x; m) = mk(mz), conditions (ii) and (iii) 
can be replaced, respectively, by the following: 


(iia) T [k(x)]| < M for almost all zx, 
h 
(iiia) lim xf | T[k(x)] — 1|? dx = 0, 
h—0 2h = 


where h is a continuous parameter. 

Remarks. It will be shown that (ii) and (iia) are equivalent in the case under 
consideration, and also that (iii) and (iiia) are equivalent provided m tends 
continuously to #. If, on the other hand, m runs over an arbitrary sequence, 
(ilia) appears to be more stringent than (iii). 

Proof. It is easily verified that 


(4.1) T(x; mk(mt)] = T|x/m; k(d)), 


whereupon (ii) and (iia) are obviously equivalent. 
As for (iii), it becomes, in view of (4.1), 


m—*o 


b 
lim f (Tl[x/m;k()] — 1\" dx = 0 


for every finite a and b. That is, 


b 
lim m f | T{k(x)] — 1\"? dx = 0; 


mo 


or, putting m = 1/n and dividing by 2, 
bn 
(4.2) lim | | T[k(z)] — 1|>dz = 0 
n—0 2n an 


for every finite a and b. We now show that if m (and consequently n) is a 
continuous parameter, (4.2) and (iiia) are equivalent. First, assume (4.2) and 
take a = —1,b = 1. This gives (iiia), if we substitute h for n. Next, (iiia) 
implies 


(4.3) lim va | Tik(x)] — 1\"dx = 0; lim oh [| T\k(x)] — 1\?>dx = 0 


for every finite a and b. But 


bh ah bh —ah 
Ree er ee ie 
—bh —ah ah —bh 
Hence (4.3) implies 
fim ah aL. +{- } | Tik(x)] — 1\/" dx = 0. 
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Since | T[k(x)] — 1 |” is a non-negative function, each of the integrals in the last 
relation is non-negative or non-positive, according as a < b or a > b, and 
consequently both of them must vanish in the limit. This gives (4.2), as we 
wished to show. 


TueoreM IIIa. Jf in Theorem Ill, K(x; m) = mk(mz), conditions (ii’) 
and (iii) can be replaced by 
+20 
(ii’a) | k(x)| dx < N; 
(iiia) as in Theorem Ila. 


The proof is immediate. 
TuHeoreM IIIb. Jf in Theorem Illa we desire the conditions to involve only 
the kernel and not its Fourier transform, we may replace (iiia) by the condition 


(iiib) (2n)-3 [ “we ee 


Proof. We shall show that (ii’a) and (iiib) imply (iiia). We know from 
(ii’a) that k(a) eL,. Hence 


T[k(x)] = (27) [- e~*"k(u) du, 


T(k(0)] = (2n)>3 / oS 


Since T|k(x)] is continuous and has the value 1 at the origin, so also must its 
mean value. That is, the condition (iiia) must hold. 

TueoreM [Va. If in Theorem IV, K(x; m) = mk(mz), conditions (i) and (ii) 
become respectively 


(ia) | T{k(x)]| < M for almost all x, 


(iia) lim 1 | T[k(x)] — 1|\¢dxr = 0, 
n—+0 2n —n 


where n is defined as 1/m. 
For the proof of this theorem the reader is referred to that of Theorem Ila, 
where the essentials are to be found. 


5. Remarks. Superficially it would seem that there should be theorems 
similar to Theorems II and III for the case f(x) e L,, and similar to IV for the 
case f(x) « L,. Whether or not this is true the author is not prepared to say. 
On the other hand, he can say with certainty that the same methods of proof 
would not apply, due to the asymmetrical properties of the Fourier transform, 
as evinced by the relation | T[f(x)] ||, < A(p) | f(x) |», which was so necessary 


to the methods used here. An example, however, of what can be done in this 
direction will be given without proof. It amounts to replacing the inequality 
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just mentioned by a less useful and less known one due to Hardy and Little- 
wood;? to wit, if f(z) « L,, then | T[f(x)]e®/? |, <= A(p) | f(x) ||>. Using 
this relation, the following theorem can be proved. 

TueoreM A. In order that | {T,(x; f) — f(x)}x?/? |, ~0asm— o~, 
for every f(x) which, together with its Fourier transform, belongs to L,, it is suffi- 
cient that K(x; m) satisfy the conditions 

(i) K(x; m) is for every m the Fourier transform in L, of some function in L,, 

(ii) and (iii) as in Theorem II. 


Yas UNIVERSITY AND Tue Horcuerss ScHoo.. 


7 Math. Annalen, vol. 97 (1926-27), p. 203. 














SYMMETRIC FUNCTIONS OF NON-COMMUTATIVE ELEMENTS 
By MarGarReETE C. WoLrF 


Introduction. A study of symmetric polynomials of matrices, for which 
the commutative law of multiplication need not necessarily be valid, led to the 
study of symmetric polynomials of certain abstract elements for which the 
processes of addition and multiplication obey the postulates of a linear asso- 
ciative algebra. This results in a generalization of the definition of the ele- 
mentary symmetric functions. For example, if x; and z2 are such elements, 
let 2:22 symbolize x; multiplied on the right by x2 and let x; + 22 indicate addi- 
tion of x; and x2; then since 2,72 differs in general from x2x7;, the second ele- 
mentary symmetric function of the elements x; and x2 becomes 


Es = 22t_ = 3X2 + 40h; 


but as before, FE; = =x, = x; + x2. The simple symmetric functions of third 
degree of the elements x; and re are 


> > 2 2, 2 
LLL, = XXX, + To Xo, LMX_ = MT. + Mor}, 
net 2 2 > 3 K 
Lrizs = Tits + 232, Sr? = x? + 23. 


These functions cannot be expressed as polynomials in FE; and EF» as in the case 
of commutative elements, but another polynomial, for example Yz,re7,, must 
be defined as an elementary symmetric function in addition to FE; and Ez if 
the fundamental theorem is to be reéstablished. Note also that F,E2 differs 
from F.k, for non-commutative elements. If three elements 2, re, x3 are 
considered, two polynomials of third degree are required to serve as elementary 
symmetric functions instead of the one function Es; = S2,rer3 of the commuta- 
tive elements. Two polynomials which may be used are FE; = Vaxyrer3 and 
Yx227;. This paper shows that as the number of elements and the degree 
are increased, an infinite sequence of symmetric polynomials, consisting of a 
finite set of one or more for each degree can be chosen so that every symmetric 
polynomial may be expressed uniquely in terms of the polynomials of this 
sequence and the coefficients of the original polynomial, with coefficients which 
are integral. This sequence may be chosen in more than one way but the 
number for each degree is unique. 

Since by the Poincaré equivalence theorem every linear associative algebra 
is equivalent to a matric algebra, no generality is lost if the elements are taken 
as matrices. 


1. Simple symmetric polynomials and elements completely non-commuta- 
tive of order m. The usuai definitions and theorems which apply to sym- 


Received October 31, 1935; presented to the American Mathematical Society, Septem- 
ber 10, 1935, under the title Symmetric functions of matrices. 
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metric polynomials of commutative elements must be modified in some 
instances in order to be applicable to symmetric polynomials of non-commuta- 
tive elements. This is done in §§$1, 2, and 3, and several new definitions are in- 
corporated, giving rise to new theorems which facilitate proving in §4 the general- 
ized fundamental theorem on symmetric functions. 

A polynomial in n elements x2;, 22, --- , 2, is said to be symmetric in these 
elements if it is unaltered by any interchange of the elements. A necessary 
and sufficient condition that a polynomial be symmetric is that it be unchanged 
by every interchange of two elements. 

If x1, %2, +--+, 2, are n non-commutative elements, a simple symmetric poly- 
nomial is defined as the sum of all terms obtained from a term xj! x2}! --- 2{ 
by allowing the distinct subscripts to take on all possible permutations chosen 
from the numbers 1, 2, --- , m, where , v2, --- , ve is a fixed set of exponents. 
Let the symbol 8,/’(a,) = Saxy! ait --- xf denote a simple symmetric poly- 
nomial of the n elements 2, 22, --- , Zn Of degree m = » + ve +--+ + m%, 
where j takes on one of the values 1, 2, --- , jm for each different simple sym- 
metric polynomial of degree m. 

The sum, difference, and product of any two symmetric polynomials are sym- 
metric. The degree of every term in the product of two simple symmetric 
polynomials is equal to the sum of the degrees of the two polynomials. 

In general, in the total matric algebra of order n, the terms of a simple 
symmetric polynomial as defined above will not be distinct, because some powers 
and products of the matrices will be commutative and polynomial relationships 
will exist among the products yielding a reduction in the number of different 
terms. 

A set of elements 2, 22, --- , 2, is said to be completely non-commutative 
of order m if no product is commutative with any other product whenever the 
sum of the degrees of the two products is less than or equal to m, where the 
factors of each product need not be distinct. Furthermore, this set is said to 
be independent of order m if a polynomial of degree less than or equal to m 
equals another polynomial of degree less than or equal to m only if the coeffi- 
cients of like terms are equal. 

In this paragraph it will be demonstrated that for every n = 2 and every m 
there always exists a matric algebra from which there can be chosen n matrices 
such that these n matrices are completely non-commutative of order m and 
independent of order m. Form all possible products through degree m of the n 
letters 21, 2, ---, 2, and with unity define these as basal elements of a finite 
linear associative algebra A over a field K. In A let every element of degree 
greater than m be defined as zero. That is, let the basal elements be 

U = l, “m= Xn, Me = Ze, ves, U, = Su; Ung. = Xite, Rin 
where the multiplication table by definition is of the form u,ju; = ux and uu; = 0 
if the degree of uju; is greater than m. This finite linear associative algebra 
with a principal unit is equivalent to a matric algebra with basal elements 
determined by the above multiplication table. With the use of this theorem 
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one can build n matrices x7;, 22, --- , Z, Which are independent of order m and 
are completely non-commutative of order m. 

From the definition of » elements which are independent of order m and 
completely non-commutative of order m, it follows that if any symmetric poly- 
nomial of these n elements contains the term z{!z{?--- x}{, then it contains 
every term obtained from it by the interchange of any two elements, and there- 
fore a term obtained by any permutation of the elements. Consequently every 
symmetric polynomial of degree m can be expressed as a sum of simple sym- 
metric polynomials of degree m. In the remainder of this paper, unless specifi- 
cally stated otherwise, every set of elements studied will be considered inde- 
pendent of order m and completely non-commutative of order m. 

It is necessary and sufficient to have m such elements to express all possible 
simple symmetric polynomials S‘??(zx,) for each degree m, inasmuch as there are 
exactly m positions to fill in each term of every S‘??(z,). 

In order to calculate the number of S),’’(z,) for every m, choose in each of these 
simple symmetric polynomials, S‘?’(z,,), the following typical term x7'z?? -.- xt, 
such that in the sequence of subscripts reading from left to right, each sub- 
script which differs from all those which precede it is the smallest integer which 


differs from those integers which precede it. That is, 7;,, = 1,2, ---,k, +--+ 49, 
or j + 1, but 7;,; = k only if each of the integers 1, 2, --- , k — 1 have occurred 
at least once as a preceding subscript. Hence 4, = 1; 72 = 2; 73 = 1 or 3; 


1, = 1, 2, 3, or 4, but 2, can equal 4 only if 73 = 3; 2, = 1, 2, 3, 4, or 5, but 7; 
can equal 5 only if 7, = 4, and 7; ¥ 4 if, for example, 73 = 1 and 74 = 2. Let 
n»(Z~) be the number of such typical terms of degree m with k different sub- 
scripts in a term, that is, k distinct elements in a term. To calculate the total 
number of S‘?’(z,) for a given m, one need but compute the value of n,,(a,) 
fork = 1, 2,---,m. These numbers can be obtained by means of recursion 
formulas. Assume that all m»:(7%,), k = 1, 2,---,m — 1, are known. A 
term of degree m in one element, that is 27, can be obtained from a term of 
degree m — 1 in one element by taking the m-th factor 7,. A term of degree m 
in two distinct elements z;, 22 can be obtained from a term of degree m — 1 
in one element by taking the m-th factor ze, or from a term of degree m — 1 
in two distinct elements, by taking the m-th factor either 2; or z2. Continuing 
in this manner, a term of degree m in k distinct elements can be obtained from 
a term of degree m — 1 in k — 1 distinct elements, by taking the k-th factor z,, 
or from a term of degree m — 1 in k distinct elements by taking the k-th factor 
any one of the k elements 2, 22, ---, 2. That is, 


Na(Xi) = Nai(M) = 1, 
Nm(Le) = Nm (21) + Zms(T2), 
Nm(L3) = Nm1(L2) + 3nms(zs), 

, 
Na(Le) = Nma(Lea) + knnr(re), 


, 


a Thm (Sn 1) l, 
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where the total number of polynomials S‘?’(z,) is equal to bm Nm(xx). For 
k=1 
m = 1 and m = 2, the formulas are n(7) = 1, me(x1) = 1, and ne(xe) = 1. 


TABLE OF THE NUMBER OF SIMPLE SYMMETRIC POLYNOMIALS FOR THE DEGREES 


























Sey 
Degree 
| } 1 | 2 | Be 2h ei? 8 
aa | —-- 

g| 1 |} 1] 1 | Sask. 1 1 1 
Oe SS eee See an ee Pee, ee Neees 
2| 2 | } 1 | 3 | 7 | 15 | 31 | @8 | 1297 
- | | | 
|). | __i_mc—|i—r_-_—-}—-|\—c-— 
s| 3 | | | 1 | 6 | 2 | 90 | 301 | 966 
=a Se oe eS Ss ee = = et CY: See ae 
= | 
E| 4 | 1 | 10 65 | 350 | 1701 
iv) | | | 
es es ee ee arma ‘iam Mi ee | a e oy —— a a ar 
= 5 | | | | 1 15 | 140 | 1050 
277 ees) eae a Mees eee wee Po Rae A re! OA 
— wf} | | | 1 | 21 | 266 
So | = } eee See | —— _—o ae = es at 
é | | | 
5); 7 | 1 | 2 
| aa. ee aS See ee Sw iee Oe 
=) 
ae : 

ae a a Se = 

ee ee ee ee ee || 

| Total | 1 2 | 5 | 15 | 52 | 203 877 | 4140 








A product of simple symmetric polynomials is equal to a sum of simple 
symmetric polynomials with coefficients of positive unity. To prove this state- 
ment, consider the product of two, S‘?)(x,)-S\(z,), of degree m and n respec- 
tively. Every term in each of the polynomials S‘?’(x,) and S\')(x,) has a coeffi- 


cient of positive unity by definition. If (xj! --- xj) (aij --- aim) is a term in 
the product, it can arise only once, since x;' --- x{* oceurs only once in S\’’'(x,) 
,’ . ’ ) 1 k . m n/? 
; = a" ‘ 
and aj! --- 2}™ occurs only once in S\*?(x,); furthermore, the term can not arise 
i m m n ’ 


from a different factorization because the first factor must always be of degree m 
and that of the second, n. The argument can be extended immediately to the 
product of three or more polynomials. 





2. Fundamental sets of order m. A set of simple symmetric polynomials 
is said to be a fundamental set of order m if every symmetric polynomial of 
degree m can be expressed uniquely as a polynomial in the polynomials of 
this set. 


The existence of fundamental sets will be proved later by the construction of 
particular fundamental sets. 
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If m elements are chosen which are independent of order m and completely 
non-commutative of order m, then a fundamental set of order m is analogous to 
the elementary symmetric functions of commutative elements. Since there 
are a finite number of the polynomials S‘/(zx,) for each degree m, there can be 
only a finite number of fundamental polynomials for each degree m. In general, 
a fundamental polynomial of degree m in the elements 2, 22, --- , 2, Shall be 
denoted by F,;’, where 7 identifies the different polynomials for one degree m. 
To each F,/’ a weight m shall be assigned. Two polynomials of equal weight 
are said to be isobaric. The weight of a product of fundamental polynomials 
is equal to the sum of the weights. Since the representation of the S‘?)(z,) 
in terms of the fundamental polynomials is to be unique, the F‘’’ must be so 
chosen that two polynomials of the F,’’ can only be equal if the coefficients of 
like terms are equal. 


3. An order for symmetric polynomials. In a manner similar to that used 
with commutative elements, the symmetric polynomials are ordered to simplify 
the problem of expressing the polynomials S‘?)(z,) as polynomials in the F‘)’. 

All simple symmetric polynomials of degree m are said to be of higher order 
than all those of degree less than m. 

The terms of a simple symmetric polynomial are ordered in the following 
manner. Theterm z;'z;: --- z;‘is said to be of higher order than xj'x;" -- - xj 
if the first non-zero difference in subscripts ¢ — 7’ is less than zero. 

The simple symmetric polynomials of degree m are ordered in the following 
(a,) and ayia? -.. ate 
is the highest ordered term of S,/’(z,), then S)'’(z,) is said to be of higher order 
than S,; (z,) if the first non-zero difference in exponents » — v’ is greater than 
zero. In case all differences are zero, that is, all exponents are equal, S‘')(z,) 
is of higher order than S/’/(z,) if the first non-zero difference in subseripts i — 7’ 


manner. If z;'z\*--- x;‘is the highest ordered term of S,' 


m 


is less than zero. 

A symmetric polynomial P is said to be of higher order than a symmetric 
polynomial P, if, after those simple symmetric polynomials which occur in 
both have been deleted, the highest ordered remaining simple symmetric poly- 
nomial of P; is of higher order than the highest ordered remaming simple 
-yrametric polynomial of P2. 

From this method of ordering all polynomials and their terms, a theorem, 
essential to later development, may be deduced, namely: 

The highest ordered term of the product S'''(x2,)S‘\)/(2,) is the highest ordered 
term of S(x,) multiplied on the right by that highest ordered term in S‘))(x,) 
of which the first letter is the same as the last letter of the term of S)'’(2,). 


4. A fundamental! set of order m. In the following paragraph a particular 
fundamental set of order m of simple symmetric polynomials will be defined. 
‘Yo distinguish these from other fundamental sets, designate the polynomials 


“” 


as K,’’, where m indicates the degree and 7 takes on one of the values 1,2, --- ,Jm 
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for every distinct fundamental polynomial of a fixed degree m. It is proved 
that the elementary symmetric functions of commutative elements are a special 
case of the polynomials E}’’, and that the fundamental theorem concerning 
elementary symmetric functions may be restated and proved for the poly- 
nomials E‘??. 

(a) Define EZ; = Tx,. 

(b) Every simple symmetric polynomial of degree m whose highest ordered 
term is not the highest ordered term of some product of weight m of E\!’, 
where n = 1, 2,---,m — l andj = 1, 2, --- , jn, is to be defined as an E\?’, 
j - 1, ae ? jm. 

All E};’ through the fourth degree are given in the following set: 


Ey = =n, 


E, = rx17%2, 


1) > 2) > 

} +8 = Lr), E = Txt, 

ual > 2 2) * 73 > 

EY? = Sxnzizs, E'S) = UxFsrits, EX?’ = S2Fsrils, 

EY = T2yrrtsh, ES) = UxreFste, EYE? = Treats. 

TABLE FOR THE NuMBER OF E\!’ ror m = 1,2, --- ,6 
Degree 

= 1 2 3 1 5 6 
E | ' 
Qo |- - 
= | | 
8 2 1 1 1 1 1 
© 3 l 4 12 33 
S 
& { 1 S i4 
7 
= | 
os 5 l 13 
& | 
= 6 \ 
3 
“| Total l 1 2 ti 22 92 


In constructing the EY!" for a given degree m, the followimg facts are useful 


The first is that every simple symmetric polynomial of the form Sx. 2 Pin 


such that ¢) # oj.) is among the EF, These polynomials are the direct gen- 
eralization of the elementary symmetric functions but they are not sufficient 





to form a fundamental set of order m. ‘The second useful fact is that the 


exponent of the first and last letter of every BS!” is unity. For suppose the 
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first exponent is not unity but ». One can then factor the highest ordered 
vy —l 


term into xz} ' and xmxr}:--- 2,*. But xj" is the highest ordered term in 
(E,)"*"'. Therefore the highest ordered term of E‘?? is the highest ordered 
term in a product (£,)"'~' times some other product of E‘!”’s. Similarly, a 
contradiction is reached if the last letter is assumed to have an exponent not 
unity. 

THeorEM. If the elements 2,, 22, ---, Xn are commutative, the definition of 
the fundamental set of order m which gives the E‘}? yields the first n elementary 
symmetric functions. 

As in the case of commutative elements, E; = 22, FE. = Taye. If the 
elements are commutative, the highest ordered term of a product is equal to 
the product of the highest ordered terms. Therefore the highest ordered term 


can be written as r/'x3*.-- ri*, where hy = he => --- = hy and 


hi t+ he t+ --- +h, = m. 


If h; > 1, the highest ordered term can always be expressed as the highest 
ordered term of the product of elementary symmetric functions, each of weight 
less than m, in the following manner: E}* . ERt"** .... 

Example. The highest ordered term of LTriz$x$z4z5 is the highest ordered 
term of E,-E}-E}. If hi = 1, then he = h3 = --- = hy = 1, where 


hy + ha + +++ + he = m. 


This cannot be expressed as the highest ordered term of a product of elementary 
symmetric functions of weight less than m, because in every such product h; 2 2. 
Then Y2,272 --- Z, would be, according to the above definition, an elementary 
symmetric function. This is in accordance with the usual definition of ele- 
mentary symmetric functions. 

THEeorEeM. Any polynomial of degree m symmetric in the n elements 


Ti, te, +*+5yTn 


is equal to an isobaric polynomial of weight m, with integral coefficients, in the E\?? 
and the coefficients of the polynomial. 

The proof is based on induction on the order of the symmetric polynomials. 
The theorem need only be proved for simple symmetric polynomials, since 
other symmetric polynomials are sums of the simple symmetric polynomials. 
Given any simple symmetric polynomial, assume all simple symmetric poly- 
nomials of lower order capable of being expressed as polynomials in the EY’. 
Since the F,’’ are defined so that they contain all simple symmetric polynomials 
which are not of the highest order in some product of E\?’, every other simple 


symmetric polynomial is the highest ordered polynomial in some product 


BY’. BY... BY, or in other words, 
SU'(r,) = BY. BY» ... B® — (lower ordered terms). 














ll 
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But lower ordered terms are expressible as polynomials in E\/’; hence S\?’(z,) 
is also expressible as a polynomial in the E\’’. The theorem is true for the 
lowest ordered simple symmetric polynomials of every degree, because each 
such 22122 --- 2 is defined as an E\’). The induction proof is then complete, 
and furthermore the coefficients are obviously integral, and the terms are iso- 
barie of weight m. 

In order to establish that the set of polynomials E\’’, k = 1, 2, --- , m, form 
a fundamental set of order m, it remains to prove that every S‘/’(zx,) can be 
represented uniquely as a polynomial in the E‘’’. Before this can be accom- 
plished it must be proved that the highest ordered terms in products of EF‘? 
are distinct. It has been proved that in the highest ordered term of a product 
the last letter of the left factor must be the same letter as the first in the right 
factor; it has also been proved that the exponents of the first and last letters 
of any E\!) are unity. Let hi = 2, --- 24, be the highest ordered term in E\?* 
and let hj..., = 2% --- i; be the highest ordered term beginning with ry 
in the product EY? ... EY”. Then 


hyh;... + = [xi te zi) [rx eee xij) 


is the highest ordered term in E\’* . [E\’? -.- EY’”]. Now suppose it is also 
the highest ordered term in E}?? . [E\i™ ... E\??], where 1 > i. Let 


hi = [i +++ Lig Vig ++ Lin 


be the highest ordered term in E}’? and let hm... . = [xi, «++ 2;;] be the highest 


ordered term beginning with x;, in E{i" ... EY*. Then 
hihi ... s = [xi, ce Din Tepe ° Link (Tin ‘ee zi,] 
is the highest ordered term in E,!? [E\i~ ... EY*]. But 
he r= Vig + Lins Lig es Vij 
is the highest ordered term beginning with x;, in EY’? --. EY”, and ry +++ 2iq 


must be the highest ordered term beginning with rx; in some S\/'(2,) = 
Yr, +--+: ti,. By a previous theorem 


SS(x,) = EVM EG ... EG” — (lower ordered terms). 
Consequently the highest ordered term in FE’? is the highest ordered term in 
EY). EQW EO... EG”. This is a contradiction. If [x,, --- ra) [rm «++ 24] 
is the highest ordered term in E\!" . E\}? ... EY? and the highest ordered 
term also in EY*{EY~ ... EY°], where one can assume E\!? # EY”, the 
first part of the proof can be reapplied to the second two factors [Ev? ... Eve] 
and [HOm ... BG®), and a contradiction is reached. 

Tueorem. Every simple symmetric polynomial of degree m can be represented 
uniquely as a polynomial in the set of polynomials E\!’, k = 1, 2, «+, m. 
Let F(R’) be a polynomial in Ey, --- , EY? and let one term be 


P = MEVR ... EY 


te 
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(all like terms being combined into one term P). If now the E‘’ be replaced 
by the z,’s, a symmetric polynomial ¢(7;) of the x,’s is obtained with the 
highest term 


Yn 


A = Mzj'z;2 +--+ 23". 


The terms P of F(E!}’) are ordered so that P is of higher order than P’ if 
after substitution of the z,’s the corresponding term A has a higher order 
than A’. For different terms P, the highest ordered terms A will be different 
by the ordering and the preceding theorem. Consequently the highest term P 
of F(E,;’) has the same coefficient as the highest term A of ¢(z,). Hence if 
every coefficient of F(E‘!’) is not zero, every coefficient of ¢(x;) is not zero. 
If ¥(z,) = F(E'’) and Y(x;) = F,(E\?’), where every coefficient of F — F, is 
not zero, upon substitution of the z,’s a contradiction is reached. Hence the 
representation of S,/’(z,) in terms of the E’’’ is unique. Consequently there 
can be no polynomial relationship among the E’’. 

This completes the proof that the E,’’,k = 1, 2, ---,m, form a fundamental 
set of order m. 


5. A second ordering and a second fundamental set. The terms of a 
simple symmetric polynomial may be ordered as in the first order. me 
term xjirj}--- x}, is said to be of higher order than the term xj},x7?, --- x{4,, if 
the first non-zero difference in the subscripts 7 — 7’ is less than zero. 

The simple symmetric polynomials of degree m are ordered in the following 


manner. Let xjjrj}--- 25; be the highest ordered term of Sj;)(z,) and 
rz? --. £2 be the highest ordered term of S{?’(z,). Write these highest 
erde red terms in the form iti, +++ Li, and Fry, +--+ Fe, 80 that every exponent 
is unity. The simple symmetric polynomials may be ordered according to the 
order of the highest ordered terms.  z; igs * Lim is said to be of higher order 
than zy,2,--- ty, fa < i, Ifi= w€. then r,7;,--- 2%, is of higher order 
if ig < iz, and if 1; = 7;, i2 = ig, then z:2;, --- 2;,, is of higher order if i3 < 74, 


ete. This process is continued until all simple symmetric polynomials of 
degree m are ordered. The polynomials of degree m are considered to be of 
higher order than all those of lower degree. 

Under this ordering it can be proved that the highest ordered term of the 
product of two simple symmetric polynomials S!')(2,)S')?(a2,) is equal to the 
highest ordered term of S{'’(x,) multiplied on the vight by the highest ordered 
term of S,’’(z,). 

Define a set of functions as follows. (a) Hy, = XYa,. (b) very simple 
symmetric polynomial of degree m of which the highest ordered term is not the 
highest ordered term of some product of weight m of I)/’, n 1,2,---,m—1 
and j 1,2, ---, Jn, is to be defined as an H,,?’, 7 San 

All 12? through the fourth degree are in the following set: 
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Hy, = 2n1, 


Hz = rxi7t2, 


1 > 2 (2) > 
HS) =<Ixnzi, HY = Sarees, 
1 - ) . 3 ‘ 9 
HY = T2zrerizs, H®?? = 32273, HY = Uxzrizs, 
. (5) _ 2 . 
H\® = Txzersre, H'® = S277, HS) = Uxirersrs. 


It can be proved that the above set forms a fundamental set which is also a 
direct generalization of the elementary symmetric functions. The proofs are 
almost identical with those of the preceding paragraphs. 


6. General properties of fundamental sets. The choice of simple sym- 
metric functions which can be used for fundamental polynomials of order m 
is not unique, but there are several properties which are common to all, which 
can now be stated and proved. Clearly, every fundamental set of order m 
must have 22, as the first polynomial. It can be proved that every simple 
symmetric function of degree m is contained in (Zx,)". The statement is easily 
verified for degrees one and two, and the proof can be completed by induction. 
Assume (22;)"~! contains every simple symmetric polynomial of degree m — 1. 
Multiply (22)""(221) = (2x,)". The m-th position is filled im every possi- 
ble way, since Sx, contains every x;. Hence the above conclusion is true. 

THeorEeM. The simple symmetric polynomials of degree m which involve two 
elements x1, %_ require for representation as polynomials in a fundamental set 
of order m one and only one fundamental polynomial of every degree 1, 2, --- , m 

The theorem is proved by induction. It can be demonstrated for degrees 
one and two that 


Fy 


+ 22, Fy, = Wye + Met, (F,)? - F. = z* a z3, 
or 

’ >’ 2 2 x’ yo , 

Fi, =%1+ 22, Foe =27,+ 23, (F,)? — Fe = x%2 + Ten. 


Assume that all simple symmetric polynomials through degree m — 1 can be 
expressed as polynomials in a set of fundamental polynomials Ff), F2, --- , Pea, 
where there is one and only one F, of each weight k Ss m — 1. There is a 
totality of 2"~' simple symmetric polynomials of degree m involving two letters, 
since m positions can be filled by two letters in 2" ways, but those two permuta- 
tions belong to one simple symmetric polynomial which differ only in the inter- 
change of 2, and a,. From the above assumption the total number of products 
of the Fy of weight m is 2""' — 1 beeause the total number of compositions! 
of m from 1, 2, +... , mis 2" ', and if m is excluded, the number is 2"! a 
The representation of the simple symmetric polynomials of degree m, the Syy (2), 


'Macmahon, Combinatorial Analysis, vol. 1, p. 151. 
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as polynomials in the F; must be unique; hence the F; must be so chosen that a 
polynomial P;(F;, F2, --- , Fm) can equal another polynomial 


P(F;, F2, er » Fa), 


if and only if the coefficients of like terms are equal. Consider the products of 
the F, of weight m as 2”-' — 1 equations in the 2” unknowns S‘?)(z,). It 
has been proved that every S,/’(z,) is present in at least one product, namely, 
(2z,)", and it has been proved that a product of simple symmetric polynomials, 
and therefore a product of the F;, is a sum of simple symmetric polynomials 
with coefficients positive unity. The rank of the matrix of the coefficients of 
the S)/’(z,) is equal to 2"~' — 1, which is the number of equations. That is, 
the rank is equal to the number of products of the F; which are of weight m, 
because there is no polynomial relationship between the products. Since there 
are 2”-' unknown S}/’(z,), it is necessary that one S‘?)(z,,) be assigned arbitrarily 
so that one can solve for the other S}?’(z,) in terms of the products of the F;,, 
k = 1,2, ---,m—1,0f weight mand the fixed S{))(x,). Let this S{7)(z,,) = Fm. 

From this theorem it is clear that the fundamental polynomials of order m 
form an infinite sequence as m increases indefinitely. 

Tueorem. For every fundamental set of order m the number of polynomials 
for each degree n <= m is unique, and is equal to the total number of simple sym- 
metric polynomials of degree n minus the number of products of weight n of the 
fundamental set F,?’, k = 1, 2,---,n — 1. 

Let the number of products of weight n of the F,’’ be t. Let the number of 
S,/ (z,) of degree n be s. The t products are t equations in the s unknowns 
S,//(z,) as in the preceding theorem. Since there is no polynomial relationship 
between the ¢ products, the rank of the matrix of the coefficients of the S‘/)(z,) 
ist. There is then a solution in which the S,’’(z,) are expressed as polynomials 
of the F'’’, k = 1, 2,---,n — 1, and s — tof the S‘(z,). Assign these s — t 
of the S'/'(z,) as F.?’,j7 =1,2,---,s—t. The preceding theorem necessitates 
that t < s, because at least one F,’’ exists for every m. 

7. Remarks. As the degree m increases, the number of fundamental poly- 
nomials increases by a finite number for each m. This distinguishes them from 
elementary syrametric functions. Because of this difference it cannot be ex- 
pected that all the theorems on symmetric functions of commutative elements 
can be generalized to non-commutative elements. For example, the theorem 
which states that every symmetric polynomial can be expressed as a polynomial 
of the S, = Zz}, which are sums of like powers of the elements, does not hold 
in the case of non-commutative elements. Another theorem which does not 
hold is the following. 

A symmetric polynomial in 2, Le, +++, £, when written in terms of the ele- 


mentary symmetric functions hy, hy, ---, My, will be of the same degree in the 


hs as it was in any one of the z's. 
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An example to show this is not true is given by E\°’ = Sx,r3 2s, of first degree 
in E{’, but of second in 22. 

It has been proved, in general, that every simple symmetric polynomial can 
be expressed with integral coefficients as a polynomial in the fundamental poly- 
nomials called E{). It can be exhibited that all simple symmetric polynomials 
in the five non-commutative elements 2, 22, 3, 24, 25 Of degrees 1, 2, 3, 4, 5 
can be expressed as polynomials in the H‘?) with coefficients which are not only 
integral but are positive or negative unity. It is conjectured that every simple 
symmetric polynomial of degree m can be expressed as a polynomial in the 
fundamental polynomials with coefficients equal to positive or negative unity. 
For all the specific cases computed it was found that every simple symmetric 
polynomial of degree m when expressed as a polynomial in a fundamental set 
includes as a term an F‘?? of weight m for at least one value of j. These and 
several other properties of fundamental sets are open for consideration. 


UNIVERSITY OF WISCONSIN. 














FUNCTIONS ARISING FROM DIFFERENTIAL EQUATIONS AND 
SERVING TO GENERALIZE A THEOREM OF LANDAU 
AND CARATHEODORY 


By Joun W. CELL 


Introduction. The hypergeometric linear differential equation which has 
for solutions the quarter periods of elliptic functions has been studied extensively 
by Fuchs' and Tannery.2 By the use of a particular quotient of two of its 
solutions Picard and Landau’® proved their remarkable theorems on analytic 
functions. If a certain transformation is made so that the exponents at the 
singular points (0, 1, ~) of this hypergeometric equation are all equal to each 
other, the equation so obtained is invariant with respect to the linear fractional 
dihedral group of order six, generated by z’ = 1 — z and 2’ = 1/z, where zisa 
complex variable. 

The cyclic, dihedral, tetrahedral, octahedral and icosahedral groups are the 
only groups of finite order which are representable on linear fractional substitu- 
tions of a complex variable.‘ We shall specialize the exponent differences at 
the three singular points of the hypergeometric equation and then make the 
substitution z = z" on the independent variable. For each of the four specializa- 
tions to be made we shall obtain an equation which is invariant with respect to 
some one of the first four groups named above, which is such that the exponent 
difference at each singular point is zero, and which has the property that an 
appropriate quotient function of two of its solutions has properties quite similar 
to those of the quotient function already mentioned. 

We shall thus obtain four quotient functions, and by their use we shall obtain 
specific formulas for the radius of the circle in which every function of the form 
F(z) = a9 + aye + aor*® + --- (a; ~ O) must either have a singularity or assume 
one of a certain set of values as, for example, the n n-th roots of unity. More- 
over, these radii will depend only on do, a;, and this set of values. 


1. Specializations of the hypergeometric equation. In the hypergeometric 


equation 
(1) 42(z — 1jo’’ + 44 (z —1)(1 — A) + 2(1 — pw) fo’ + [C1 — A_— pp)? — Jv =0 


the singular points are at z = 0, 1, and « with exponents 0, A; 0, 4; 
41 —A —p—v),41—A — yw 4 v), respectively. 


Received August 26, 1935. 

’ LL. Fuchs, Journal fir Mathematik, vol. 71 (1870), pp. 91-127. 

2 J. Tannery, Annales de |’Keole Normale, (2), vol. 15 (1879), pp. 169-194. 

*h. Landau, Vierteljahrechrift der Natur. Gesellschaft, vol. 51 (1906), pp. 252-318 
*}. Klein, Lectures on the Icosahedron, p. 126. 
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If we set \ = » = v = O and denote by V(z) the solution which is regular in 
the neighborhood of the origin and which is such that V(0) = 1, then V(1 — z) 
is also a solution of equation (1) and is regular in the neighborhood of z = 1. 
We define 


T(z) = iV(1 — z)/V(2). 


This is the quotient function which was used by Picard and Landau. T(z) 
possesses the following properties if z stays away from 0, 1 and =, then (1) T(z) 
is regular, (2) ¥ [T(z)] > 0, (3) T’(z) ¥ 0; if z = 2(T) is the inverse function and 
if $ (T) > 0, then (4) 2(T) is single-valued and regular, (5) 2(T) stays away from 0 
and 1; moreover, (6) the axis of reals is a natural boundary for 2(T), and (7) 2(T) 
is a fuchsian function. 

In a linear differential equation of the second order whose singular points are 
all regular singular points, the inverse of the quotient of two solutions is a single- 
valued function if and only if the exponent difference at each singular point is 
individually either zero, the reciprocal of an integer greater than 1, or 1 with the 
condition that a transformation can be made so that this singular point becomes 
an ordinary point in the resulting differential equation. Moreover, the inverse 
function of the quotient of two such linearly independent solutions will stay 
away from those values which correspond to singular points with exponent 
difference zero.’ 

We make the transformation z = f(x), v(z) = u(r) on equation (1) and require 
f(x) to be single-valued and such that the resulting second order differential 
equation has the following properties: (A) the coefficients are rational functions, 
(B) the singular points are all regular singular points, (C) the exponent difference 
at each singular point is either zero or the reciprocal of a positive integer. (If 
this integer is 1, a further condition is necessary as before.) Moreover, \, 4 
and y are to be either zero or the reciprocal of an integer greater than 1. A 
straightforward computation shows that if is a positive integer and ¢ is a non- 
zero constant, only the following six transformations possess the requisite 
properties: 2 = czr*,z = 1 — er*,z = c/z*,2 = 1/(1 — cr"), 2 = (cr* — 1)/(cr"), 
z = ex"/(cx™" — 1). But these are all essentially z = cr*, since we may obtain 
the others by first permuting the singular points of the hypergeometric equation 
and then making this transformation. We shall work out the case for ¢ = 1, 
since the general case is obtainable from this by a transformation of the form 
xr’ = kx, where k # 0. 

We make the transformation z = 2", (2) = u(r) on equation (1) and obtain 

4r(r" — lu’? + 441 An)(2" 1) + nl — p)rrtu’ 


(2) 
+ nt(l —A — wp)? — Fiz" u = 0. 


® Tannery, loc. cit.; Klein, Vorlesungen ber die Hypergeometrische Funktion, 1933, p. 291. 
*L.R. Ford, Automorphic Functions, pp. 298 and 304. 
’ Klein, see footnote 5, p. 292; H. Poincaré, Oeuvres, vol. 2, p. 14. 
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If ¢ = e™/", the exponent differences at the singular points of equation (2) 
are respectively: An at z = 0, vat az = 1,o,o°,---,o" ,andynatr = o. If 
the quotient of two solutions of equation (2) is to have properties similar to those 
of T(z), we need to consider only the following cases: 

CaseI. } = 1/n,p =v = 0, 

Case II. AX =v» = 1/n,p = 0, 

Case HI. AX =u=v=0. 

The case of X = » = 0, v = 1/n is a transformation of case I by the trans- 
formation x’ = 1/r. We shall study the three cases in $§2—4. 

For convenience we introduce the following notation. We use F(a, b, c, x) 
to denote the hypergeometric series 


a-b a(a + 1)(b)(b+ 1), 
+754 + “Te 





and its analytic continuations. In each of the three cases to be considered we 
cut the plane by joining each singular point of the corresponding differential 
equation toz = « by straight lines which, if continued, pass through the origin. 
If c = Ois a singular point, we join it tox = ~ along the negative axis of reals. 
We use the term “principal determination” to describe the function defined by 
the hypergeometric series and its analytic continuations in this cut plane and 
denote the function, so defined, by F*(a, b, c, x). Similar definitions will pertain 
to the other solutions of equation (2) which are to be considered. We denote 
by D(a, b, c, z) the function defined for | z | < 1 by 

lim e"{F(a + ¢€,b + €,c + 2c, x) — e-*'**F(a, b, c, x)}, 

e=0 
(where the principal determination is assigned to log x) and its analytic continua- 
tions. We observe that D*(a, b, c, 2), which is this function in the cut plane, is 
expansible in a series whose general term is the corresponding (7 + 1)-th term 
in the hypergeometric series multiplied by 


iyla + j) + ¥(b + j) — Ale + 7) — Ya) — l(b) + BWic) + log z}, 
where (a) is the classic function I'’(a)/T(a). 
2. Casel. A= 1/n,n=v=0. The hypergeometric equation (1) for this 
case has the fundamental system of solutions F(a, a, 2a, z) and e@*?/" F(L — a, 


1 — a, 2 — 2a, z), where a = (n 1)/(2n). Moreover, equation (2) for this 
case becomes 


(3) A(x” L)u’’ + 4nz™"* u’ + (n — 1)? 2*?u = 0. 
The results for n = 1 are trivial, so we shall suppose that n 2 2. We sub- 
stitute ulz) = (1 — 2) m/@ns) y(z7) and obtain a differential equation which 


has regular singular points at the n n-th roots of unity and at # with equal 
exponents (n 1)/(2n 4+ 2), and which is invariant with respect to the cyclic 


group of order n, generated by 2’ = or. Forthe special case n = 3 it is invariant 
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with respect to the tetrahedral group of order 12, generated by x’ = ox and 
= (x + 20*)/(ox — 1). Equation (3) for this case is also invariant with 
respect to this same cyclic group. 

We refer to the identities connecting the several solutions of the hypergeo- 
metric equation,® specialize them by using standard limiting processes, make our 
substitution z = 2", and obtain the following four identities which are valid for 
zx in the sector S, bounded by the rays (0, 1, <) and (0, «, «): 


ae 


F*(a, a, 2a, x") = Pa) 


{[2y(1) — 2y(a)] F*(a, aw, 1,1 — x”) 


— D*(a,a,1,1— 2x")}, 


2F*(1— a,1—a,2 —2a,2*) = a, (2y(1) — 2p(1 — ad] 
(4) F*(a, a, 1,1 — 2%) — D*(a,a,1,1— 2"), 
F*(a, a, 2a, 2") = me) eaten tons) {[29(1) — 2y(a) — wi] F*(a, 1 — a, 1,2-*) 

— D*(a,1—a,1,2-")}, 
2F*(1— a, 1 — a,2 — 2a, 2") = nas si(t—a)~an lot ={[2y(1) 


— 2y(1 — a) — ri] F*(a, 1 — a, 1,2-) — D*(a, 1 — a, 1,2")}. 


We observe that the hypergeometric series in the definition of F*(a, a, 2a, x") 
converges for | x | < 1, F*(a, 1 — a, 1,27") for |x| > 1,and F*(a, a, 1,1 — 2x") 
for | 1 — a2*| < 1 and hence in the interior of the rose p = 2 cos né@ (in polar 
coordinates with pole at the origin). We use F*(a, a, 1, 1 — 2") to denote 
the function defined by the corresponding hypergeometric series for x in the leaf 
of the rose about x = 1 and its analytic continuations in the cut plane. Corre- 
sponding remarks apply to the other three solutions which occur in these identi- 
ties. Other identities, valid for x in the other n — 1 sectors of the plane, are 
easily obtainable from the group invariance property. These identities make 
evident the fact that the hypergeometric series in the definition of F*(a@, a, 1, 
1 — x") defines n distinct functions. 

THeoreM 1. The quotient function 


h(x) = KywF(1 — a, 1 — a, 2 — 2a, x") F(a, a, 2a, x"), 


where Ky = V(2a)(1 — a@)/T(2 — 2a)I*(a), has the following properties: if 
a" # Land-x # ~, then (1) h(x) is regular, (2) | h(x) | < 1, (3) ty (2) = 0; of 
x = x(t) is the inverse function, for | t| <1, (4) x(t) ts single-valued and regular, 
(5) {a(t)}" # 1; moreover, (6) | t| = 1 ts a natural boundary for x(t), and (7) x(t) 
ts a fuchsian function. 


* E. Goursat, Annales de l’Ecole Normale, (2), vol. 17 (1881), appendix; especially pages 
20-21, 28-30, 34-37. 
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We shall first establish the second proposition as follows. Because of the man- 
ner in which the two solutions in the definition of t;(7) have been defined, we may 
consider N(x") = (2) = N(z). But N*(z) maps the upper half z-plane upon 
a circular are triangle with vertices N = 0 (corresponding to z = 0), N = 

z = 1) and N = e*™/"(z = ~) and with interior angles +/n, 0, 0, respectively.’ 
Then ¢j(x) maps the sector S,; upon this same circular are triangle. We use 
the principle of reflection” to see that ¢{(x) maps the cut x-plane upon the 
circular are 2n-gon R which is such that if we join each vertex to the origin the 
2n triangles so formed are all congruent (with respect to the powers of the sub- 
stitution t’ = ot) to the circular are triangle described above or to its reflection 
in the axis of reals. 

We define f(x)],-< as the result of taking f(z) about a simple closed contour 
which encircles the singular point x = a in a counterclockwise direction and 
which encircles no other singular point of f(z). 

We make use of the identities (4) and similar identities to show that ¢,(x) 
possesses the following circuit properties: 


ul + i cot (x 2n) (x) — — io cot (/2n) 


(5) ti(r)lemt = > cot (x 2n)ti(r) + {1 — 7 cot (x/2n)}’ 


(k = 0,1,2,---,n — 1). 


To obtain the general map, we apply to the region R the group of substitutions 
G generated by the substitutions (5) and their inverses. We make use of the 
idea of isometric circles" of these substitutions to see that the map so obtained 
is interior to ¢| = 1 except for the vertices of R and points congruent to these 
vertices (these points are the images of the singular points 1, a, ¢*, --- , "1, ©) 
which all lie on the above circle. Each vertex and the points congruent to it 
form an everywhere dense set of points on this circle. 

Proposition (1) of our theorem can now be established from the observation 
that the numerator and denominator of &4(2) together form a fundamental 
system of solutions of equation (3), and hence 4,(z) is regular for x away from the 
singular points of that equation, except possibly for simple poles. But from 
the map already described we see that 4(2) has no poles. As a corollary we 
observe that since these two solutions cannot vanish simultaneously for x away 
from the singular points of (3), the denominator of (27) and hence F(a, a, 2a, x") 
has no zeros and no poles for x away from these singular points. But 


ti(z) = Ki(1 — 2") [F(a, a, 2a, x"), 


and hence proposition (3) of the theorem is evident. 
Proposition (4) follows from the fact that the general map fills up, without 
overlapping, the interior of |f| = 1. From our observations concerning the 


* See Klein, Hypergeometrische Funktion, p. 196; Ford, loc. cit., pp. 304-305. 
10 See Bieberbach, Lehrbuch der Funktionentheorie, (3rd ed.) vol. 1, p. 225. 
! Ford, loc. cit., pp. 26-29 and Chapter III. 
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images of the singular points of equation (3), the propositions (5) and (6) follow. 

x(t) is a fuchsian function because it is automorphic with respect to the group G, 

which has |¢| = 1 as its fixed circle. Hence the theorem is established. We 

observe that these properties may also be established from a direct study of 

equation (3) without reference to its relation to the hypergeometric equation. 
Let w = e**38, We define 


(6) g(x) = por = 


for k = 1 here and for other k later as we introduce new functions &(7). Then 
g(x) has properties similar to those of t,(r) except that |¢| < 1 becomes 
Sg) 2 0. 

We make the transformation 2’ = 1/zr and obtain the following 

Coro.tiary. The quotient function t(x) = (1/2) has the following properties: 
if x" # land x # 0, then (1) te(x) is regular, (2) | te(x) | < 1, (3) to(x) ¥ 0; of 
x = x(t) is the inverse function and if |t| < 1, then (4) x(t) is single-valued and 
regular except for simple poles, (5) x(t) # 0 and {x(t)}" # 1; moreover, (6) |t| = 1 
is a natural boundary for x(t), and (7) x(t) is a fuchsian function. 


3. Case II. 3 =v» = 1/n, yn = 0. Equation (2) for this case becomes 
(7) 4(2" — 1)w” + 4nz™—u’ + n(n — 2)2"7 u = 0. 


We obtain trivial results if n = 1 or n = 2, so we shall suppose that n = 3. 
If we make the transformation u(r) = (1 — 2")@™/@™ y(x), we obtain an 
equation which is invariant with respect to the dihedral group of order 2n, 
generated by 2’ = ox and 2’ = 1/z. 

As in the preceding section we obtain the following identities, valid for x in S;, 
with a = 1/2, 8 = (n — 2)/(2n) and conventions as before for the solutions 
involved: 


. »\ _ la + B) = 
F*(a, B,a + B,x )= Ia a) T(g) 2vY) — Wha) — ¥(8)] F (a, B, 1,1 — Zz ) 
— D*(a, B,1,1 — 2")}, 
rF*(1 — a, 1 — 8,2 —a— 8,2") = i-«— {(2y(1) — ¥(1 — a) 
' r(1 — a)P(i — B) | 
(8) — ¥(1 — B)] F*(a, 8, 1,1 — x") — D*(a, 8, 1,1 — x")}, 
F*(a, B, a + B, x”) = Meta poi 1) ea(ri—nlogz) R*(] — a, Se 8, 


Pa + 8)P(1 — a — 8) 


2—a-— 8,2") + ra) 


ebcri-ntoe2) P*(a, B, a + B,x-*), 
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I(2 — a — B)I(a + B — 1) 
(1 — a) 
PL — a1 —8,2—a— Br) + 8A ao) 


e8(ri—n log z) F*(a, B, a + B, rh. 





ea(ri-n log z) 


2F*(1 — a, 1 — 8,2 — a — 8,2") = enn 





THEOREM 2. The quotient function 
ts(x) = K3xF(1 — a@, 1 — B, 2-—a— B, x")/F(a, B, a + B, z*), 


where K; = T(a + B)F(1 — B)/T(2 — a — B)T(B) has the following properties: 
if x" ~ 1, then (1) ts(x) is regular, (2) | ts(x)| < 1, (3) t3(x) ¥ 0; ef x = x(t) is 
the inverse function and if \t| < 1, then (4) x(t) is single-valued and regular except 
for simple poles, (5) {x(t)\" # 1; moreover, (6) |t| = 1 is a natural boundary for 
x(t), and (7) x(t) is a fuchsian function. 

The proof of this theorem is so similar to the proof of the preceding theorem 
that we shall give only the following relations which are necessary in this proof: 


t;(x)] _ (1+ 2icot(r/n))ts(x) — 2io* cot(r/n) 
Meeene® ~~ Qia-* cot(x/n)ts(x) + (1 — 2i cot(x/n))’ 


K =0,1,---,n—1, 
t3(x) = K;(1 — 2") [F(a, B, a + B, x")}°. 


t;(2) maps the cut plane upon the circular are 2n-gon which may be described 
thus: its vertices are at ¢ = 1, t = e*/" cos (x/n), t = e~*/™ cos (x/n) and the 
points congruent to these three points with respect to the powers of t/ = ot. If 
we join the first two points by a circular are which makes a zero angle at t = 1 
with the line segment joining ¢ = 1 to the origin and which makes an angle /n 
at the other point with the line segment joining this point to the origin, the 
other ares of this 2n-gon are congruent either to this are or to its image in the 
axis of reals. The general map, as before, is obtained by the use of the isometric 
circle of the substitutions (9) and their inverses. 

We have here the additional relation, valid for x in S,, 





(9) 


{1 — icot (r/n) \ts(1/x) + i cot (m/n) 
—i cot (r/n)t3(1/x) + {1 + 7 cot (x/n)}’ 
where t;(1/x) = K3F(1 — a, 1 — B, 2 — a — B, x") /rF (a, B, a + B, 2"). 


s(x) = 





4. Case lll. AX =p =v=0. Equation (2) becomes 
(10) 4x(x" — 1)u” + 4{(n + Iz” — Lhu’ + neu = 0. 


If we make the transformation u(x) = (4 — x**)-"/@n+® y(7), we obtain an 
equation which is invariant with respect to the dihedral group of order 2n, 
generated by 2’ = oz, x’ = 1/x. For the particular case n = 4 it is invariant 
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with respect to the octahedral group of order 24, generated by x’ = oz, x’ = 
(x + 1)/(@@ — }). 

In this case, where we use the principal determination for the logarithm and 
where F*(3, 3, 1, 2") = f*(a") and D*(3, 3, 1, x") = d*(x"), the four identities, 
valid for x in S,, become 

af*(a") = 4 log 2f*(1 — 2") — d*(1 — 2"), 

nd*(x") = (16 log? 2 — m*)f*(1 — x") — 4 log 2d*(1 — x"), 
mf*(xr)ehos= = (47 log 2 + w)f*(a-™) — id*(a-"), 
rd*(x")et ez = 16% log? 2 f*(a-") + (4 — 44 log 2)d*(2-*). 


(11) 


THEOREM 3. The quotient function 
gs(x) = iF (3, 3, 1, 1 — 2")/F(3, 3, 1, 2") 


has the following properties: if x # 0,2" # 1,2 # ~, then (1) gs(x) ts regular, (2) 
S(gs(xz)) > 0, (3) g(x) ¥ 0; if x = x(q) is the inverse function and if ¥(g) > 0, 
then (4) x(g) is single-valued and regular, (5) x(g) # 0 and {x(g)\}* # 1; moreover, 
(6) the axis of reals is a natural boundary for x(g), and (7) x(q) is a fuchsian function. 

The proof of this theorem may be accomplished by the same means as before. 
For that purpose we need the relations 


g(x) Jeno = gal) + 2n, 


_ (4k + Igu(x) + 8k? Bs 
(12) gi(x)),* = “Sis - a (k = 0,1,2,---,n—1), 





g3(xz) = n[rix(1 — x")}[F(3, 3,1, 2%). 


The theorem may easily be established, however, by using the properties of the 
function T(z) defined at the beginning of the first section. The propositions are 
obtainable by the direct substitution of z = x" in T(z). The general map is 
essentially the map of the cut z-plane by T(z)" except that it here takes 2n of 
the triangular regions to constitute a fundamental region for the group generated 
by the substitutions (12a) and (12b) and their inverses. 


5. Extensions of the Landau theorem. Let a and a, # 0 be two given 
constants. In this section we shall use F(x) to denote any function which is 
regular in the neighborhood of the origin and which has there the form F(x) = 
Q + ar + ax? 4+ .---, 

Landau’s theorem" states that there exists a number depending only on do 
and a, say R(ao, a;), such that F(x) has in or on the circle |z| = R either a 
singularity, a zero, or assumes the value 1. Carathéodory™ showed that the 


12 See Klein-Fricke, Theorie der elliptischen Modulfunktionen, vol. 1, p. 273. 
13 Landau, footnote 3. 
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least possible value of R(a9, a:) for this theorem is ¢(a9, a;) = 0 if a7 = 0 or 
a = 1; otherwise 


23 (T(ao)) 


(40, 01) = ae) | 


where T(z) is an arbitrary branch of the quotient function defined in §1 of this 
paper. Landau’s theorem implies the existence of a number M depending only 
ON do, 4), a1, @2, --- , ae, Where a; are distinct finite numbers and k = 2, such that 
F(x) in or on the circle |2| = M either has a singularity or assumes a value a; . 

Let g = g(x) be a general symbol for any function having the following proper- 
ties: if x = a; (¢ = 1, 2,---, m), where m = 3 and the a; are distinct finite 
numbers except in the case of a, which may be ~, then (1) g(x) is regular, (2) 
S(g(x)) > 0, (3) g’(x) ¥ 0; if x = Ag) is the inverse function and if ¥(g) > 0, 
then (4) h(g) is single-valued and regular except for simple poles [if a,» = ~, 
then h(g) is to have no poles in {(g) > 0], (5) h(g) # ai; moreover, $(g) = 0 
is a natural boundary for h(g). 

If an = *, we shall use g = g(x); otherwise we put g = g*(xz). gi(x) isa 
special case of g = g(x), where the points a; are the n n-th roots of unity and ~. 
The same is true of g = gs(x), where the points a; are the n n-th roots of unity, 
O and «©. ge(x) and g3(x) are special cases of g = g*(x), where the points a; 
are the n n-th roots of unity and 0 in the first case, and the n n-th roots of unity 
in the second. 

If we set 





u*g(x) + w 
aa (a) = Ga) + w’ 
where w = e¢***/3, then ¢(x) has properties similar to those of g(x) except that 
S(g) = Ois replaced by | ¢| < 1. 
Let (a, @1, a1, a2, «++ , @m), or more briefly ¢(a9, a,, m), be the least possible 
value of M(ao, a, a1, a2, --- , @m) for the extended Landau theorem. 
THEOREM 4. Let ao and a, # 0 be two given constants. Let $(a), a,, m) = 0 
if ag = a; (i = 1, 2, --- , m); otherwise let 


23 fg(ao)} _ 1 — | tay) |? 
}ai|-|g’(ao)| | a| = | (ao) | 








(a>, a, m) = 


Every function F(x), regular in the neighborhood of the origin and there defined by 
F(x) = ao + az + --- ,in or on the circle | x | = o(a9, a1, m) either has a singu- 
larity or assumes one of the o; values. Moreover, the above formula for 
(ao, a,, m) ts independent of the branch used for g(a). 

CorROLLARY 1. (ao, 1, m) = | a, | (a9, a, m). 

If a9 = a; (i = 1, 2, --- , m), the theorem is obvious, so we shall henceforth 
suppose that this is not true. The equality in the theorem is readily established 
by the use of the relation (13). The independence of the branch is established 











t 
t 
| 
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rq 
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by the supposition that K(x) and L(x) are two branches of g(x). Then they are 
related by K = (aL + b)/(cL + d), where ad — be = 1, a, b, c, d are real and 
S(L) 2 0 maps upon 3(K) 2 0. 

We choose an arbitrary branch of g(x) and form G(x) = g(y), where y = F(z). 
From the hypothesis on F(x) there exists a positive number y such that for 
|a2| < n, F(z) is regular and F(x) # a;. Hence G(z) is regular and $(G(z)) > 0 
for |x| <». Hence 


(14) G(x) = g(ao) + ag’(ao)a + ---. 
We define 
_ G(x) — g(ao) 
G(x) — g(ao)’ 
where §(ao) is the conjugate imaginary of g(a). It is easy to show that H(z) is 
regular and | H(x) | < lfor| xz! < y. Substituting and expanding, we obtain 


(15) , H(x) 


a, g’ (a) 
(16) H(z) = —~——, t+:::. 

i Slo(as)) ” 
Applying the Cauchy inequality for the first derivative and simplifying, we 
obtain 
(17) < 2Slola0)] 

| a1 | | g’(ao) | 

We observe that in the proof thus far we have made use of only the first three 
properties of g(x) and hence the result (17) is true for any function possessing 
those properties. 

To complete the proof, we shall exhibit a function F(x) of the required form, 
which is regular for | x | < ¢, and which there stays away from a, ag, --- , @m. 
We do this by defining H(x) by the first term of the series (16) and then G(x) 
by (15). We use the inverse function of g = g(x) and form 


(18) F(x) = A[GQ)]. 


This function F(x) is easily shown to have the required properties and hence 
the theorem is established. The same theorem is true if we use g = g(x), with 
the exception that when we form equation (18), F(z) will have simple poles in 


| x | = ¢, and hence the radius in this case is the least possible in a restrictive 
sense. 

Similar theorems are obtainable by the use of ao, a, --- , @g as g + 1 given 
constants. 


Coro.uary 2. Let (a9, a1, n) = Oif aj ¥ 1; otherwise let 





1 — | t a \2 
giao, 4, n) = — Jt 0) | . 
| | + | ty (ao) | 
Then F(x) in or on the circle | x | = ¢, either has a singularity or takes on an n-th 


root of unity value. 
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Coro.uary 3. Let ¢2(ao, a1, n) = Oif ao = Oor a} = 1; otherwise let 


1 — | f(a) |? 
| ay | - | to(ao) | 


g2(d, a1, n) = 





Then F(x) in or on the circle | x | = ¢2 either has a singularity, a zero, or assumes an 
n-th root of unity value. 
Corouuary 4. Let $3(a0, a1, n) = 0 if a} = 1; otherwise let 





1 — | t3(ao) |* 
$3(do, a1, n) = ; a( 0) . i° 
| ay | - | t3(a) | 
Then F(x) in or on the circle | x | = $3 either has a singularity or assumes an n-th 


root of unity value. , 
We observe that in the third and fourth corollaries the radii are not necessarily 
the definitive radii, since these two corollaries come under g = g*(z). 
Corotuary 5. Let s(a9, a1, n) = 0 tf ag = O or ay = 1, and otherwise let 





2S {gal 
da(do, a, n) = —~'% a} 
| ai | = | g4(@o) | 
Then F(x) in or on the circle | x | = oy either has a singularity, a zero, or assumes an 


n-th root of unity value. 
Moreover, in this last corollary it is easy to show that 


— 
$4(do, G1, N) = Gulag, nag” ay, 1). 


If we use the identities given in the preceding three sections we obtain the 
following 
THeoreM 5. LetO0 <p <1. Then 


lim ¢i(p, 1, n) = lim ¢3(p, 1, n) = 1 — p’, 


lim ¢e(p, 1, n) = lim ¢,(p, 1, n) = — 2p log p. 


Letl1<p<o. Then 


lim ¢i(p, 1, n) = lim ¢(p, 1, n) = 2p log p, 
lim ¢2(p, 1, n) = lim ¢;(p,1,) = p? — 1. 


no n—-2 


Let lim $,(ao, a;, n) = Wi(ao, a:),7 = 1, 2,3, 4. Then in the above theorem 


no 


we observe that 


vip, 1) S vilp, 1) S a(p, 1). 


TueoreM 6. Let | a9| # 1. Then for |x| S lao, a1), F(x) either has a 
singularity or a point where | F(x) | = 1. 














FUNCTIONS ARISING FROM DIFFERENTIAL EQUATIONS 649 


The following numerical values are for purposes of comparison. 


(0, 1,3) = 2.58,  ¢3(0,1,3) = 3.25, (0, 1, 3) = 0, 


¥(0, 1) = 1, ¥3(0, 1) = 1, ¥.(0, 1) = 0, 
:(3, 1,3) = 2.27, — $3(3, 1,3) = 2.87,  o4(3, 1, 3) = 1.49, 
v(3, 1) = 3/4, ¥3(3, 1) = 3/4, ¥4(3, 1) = log 2. 


UNIVERSITY OF ILLINOIs. 











ON CERTAIN EQUATIONS IN RELATIVE-CYCLIC FIELDS 
By LEoNaRD CARLITZ 


1. Introduction. Let F be a quite arbitrary field—the characteristic may be 
0 or some prime p. Let W be a field containing F such that W/F is cyclic of 
relative degree k. The group of W/F is generated by the substitution S: if a 
is some quantity in W, we shall use the notation a to denote the result of oper- 
ating on a with S. If then a, 8 are assigned elements of W, the equations 
which we shall study are 


(1.1) gS = at 
and 
(1.2) nS = an + B; 


it is of course supposed that £ and 7 also are in W. 

Suppose W = F(#), that is, W is generated by adjoining 3 to F, where 3 
is a root of f(#) = 0, and f(z) is a polynomial with coefficients in F and irreducible 
in F. It is convenient to assume that the coefficient of the highest power of x 
in f(x) is unity. Let a = g(#), where g(x) is a polynomial with coefficients in F. 
Then we show that (1.1) has a non-trivial solution if and only if 


RQ, f) = 1; 


here R(g, f) is the resultant of the polynomials g and f, and may be calculated 
by means of the division algorithm. If g satisfies certain conditions, a theorem 
of reciprocity for (1.1) may be stated; in particular, if F is a finite field, this 
reduces to a known theorem (see §5). 

As for equation (1.2), if a is such that (1.1) is not satisfied, then (1.2) has a 
unique solution. If, however, (1.1) does admit of a non-trivial solution, then 
we may assume a = 1, and our equation becomes 


(1.3) s&= & + B. 


If now we put 8 = A(#), where h(x) is a properly chosen polynomial in F, then 
we prove that (1.3) is solvable if and only if the coefficient of x*~' in h(x) f’(x), 
reduced modulo f(x), is zero; here f’(x) denotes the derivative of f(x). 

In §4 some properties of the solutions of (1.3) are derived. Finally in §5 
we assume F to be a finite field and the result for (1.3) as well as for (1.1) is seen 
to reduce to a known theorem. 


Received October 6, 1936; presented to the American Mathematical Society, April 11, 
1936. 
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2. The equation (1.1). For a’ as defined above, we note that 
(aB)s = ass, (a + B)S = a + BY 


where a, 8 are arbitrary elements of W. For \ in F, \*° = \; conversely, if the 
substitution S leaves some element \ of W unchanged, then \ must be in F. 
For arbitrary a in W we may say only 


(2.1) a* = a. 


We may evidently assume in (1.1) that a = 0. We now define the following 
function of a: 


(2.2) xX = x(a) = aaS +++ a * = gitst--- +8", 
Thus it is evident that 


1+S+--.+s*7 


s B+s'+--- +8" _ a , 


=a 


by (2.1), so that x8 = x, and therefore x isin F. Clearly if a # 0, then x # 0. 
Note also that 


x(a@B8) = x(a)x(8), 
x(1) = 1, x(A) = A* 


for a, 8in W, Xin F. If now we assume that (1.1) has a non-trivial solution 
(¢ # 0), then 


& = oSt = q!t8z, 

i = oS = gitsts's 

vata qits+ +++ +8 = xé, 

so that x = 1; that is, a necessary condition that (1.1) be solvable in W is 


x(a) = 1. 
To show that this condition is also sufficient, consider for arbitrary 8 the sum 


k-1 
2.3 A= s? .—(14+8+ +--+ +8? >: 
(2.3) du Ba 
Then applying S: 
AS = > BE g-(S48"+ - «+ +8) 
0 


k-1 
OPO: ass +e +8! 
a : BS” gg 148+ + ++ +87) 
0 


k 
_ er. 4 
a>, Ba (14+S++-- +8774) 
1 


k-1 J ; " 1 ) 
2 —(1+S8++- + +87") jak a 2 ; 
” X is * (25 
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that is, 


(2.4) AS— aA =8 (— - 1), 
x(a) 
Hence for x(a) = 1, A satisfies our equation (1.1). It remains to show that 


for properly chosen 8, A # 0. Now if A = 0 for all 8, in particular it vanishes 
for all 


B* (¢=0,---,k—1); 
substitution in (2.3) leads to 
k-1 
: (Bi)S? gq A+S8+ --- +87 = 0 (¢=0,---,k— 1). 
j=0 


In other words, the set of linear equations 
k-1 
> (6)* y =0 (¢=0,---,k—1) 
7=0 
has a non-trivial solution, and therefore the determinant 
| (6*)s"| = 0 (i,j = 0,---,k— 1); 


that is to say, the relative discriminant of 8 vanishes. Thus by properly choos- 
ing 8, A as defined by (2.3) is different from zero. We have therefore proved! 
THEOREM 2.1. A necessary and sufficient condition that (1.1) have a solution 
other than & = 0 is furnished by x(a) = 1, where x(a) is defined by (2.1). 
We may now derive the criterion mentioned in the Introduction. For f and 
q as defined in $1, we have 


a=g(d), aS = g(d5), ---, 
so that 
x(a) = a@it8t-- +8 
= g(d)g(9) «++ g(9*"). 
Now assume the factorization 


g(x) = AJ] (x - »), din F, 


(2.5) 


in some field W, containing W. Then by (2.5) we have the factorization 
x(a) = A*ITIN( — w) 
= R(g(2), f(2)), 
where R(g, f) is the resultant of g and f. Applying Theorem (2.1), we now have 


(2.6) 


1 Cf. D. Hilbert, Jahresbericht der Deutschen Mathematiker Vereinigung, vol. 4 (1894-95) , 
pp. 271-272. 
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THEOREM 2.2. Equation (1.1) has a solution in W other than & = 0 if and 
only tf the resultant of g and f is unity: 


Rg, f) = 1. 


If — ¥ 0 is a particular solution of (1.1), then the general solution is XE, \ arbitrary 
in F. 

We may suppose R(g, f) defined by (2.5) and (2.6). It is assumed that f is 
primary, that is, the coefficient of the highest power of x is unity. More gener- 
ally for f = II(x — #), we write 


(2.7) Rg, f) = I g(8). 


From (2.7) it is clear that g = h (mod f) implies R(g, f) = R(h, f). Also for 
\ “constant”, R(A, f) = A* by (2.7), where k is the degree of f. For arbitrary 
g, h we have R(gh, f) = R(g, f)R(h, f). Thus for f, g primary of degree k, | 
respectively, it follows that 


(2.8) RG, f) = (—1)"RG, 9). 


It is now clear how R(g, f) may be calculated by means of the division al- 
gorithm, and thus by (2.6) x may be evaluated. The question arises whether 
(2.8) may be interpreted as a reciprocity relation for x. Apparently this cannot 
be done in general. If, however, the roots of g (assumed irreducible in F) ad- 
joined to F generate a cyclic super-field of F, then clearly R(f, g) has the same 
interpretation as R(g, f) and a reciprocity theorem may be stated for x. (Cf. §5.) 


3. The equation (1.2). Assume first that x(a) # 1; then (2.3) implies 


: x(a) A y = x(a) A 
(3 —1 “x(@) — 1 +6, 





so that in this case (1.2) has the solution 


Further, this is the only solution, for 
m=an+B, 1 =amn+8 
leads to 
(m — m2)S = a(m — m2), 


and since x(a) # 1, Theorem 2.1 implies m = 1. 
We may therefore suppose that x(a) = 1; then there exists a y # 0 such that 
a= 7", 
In (1.2) put » = yf: the equation becomes 


=o + Br. 


ff 
i 
| 
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We therefore consider in the following the equation (1.3): 
“= + B. 


In place of x(a) we now define the quantity p = p(8) by 


k-a 


(3.1) e(8) = B+ A5+---+ 8 ; 
then 
ps = BS + pS +--+ + BS =p, 
and (8) lies in F. Evidently p(8) has the properties 
e(B + y) = p(B) + p(y), 
p(B) = Xp(8), p(r) = ha, 


(3.2) 


for \ a quantity in F. 
From (3.1) it is easily seen that a necessary condition that (1.3) have a solu- 
tion is p(8) = 0. To show that this condition is also sufficient, consider the sum 


k-1 : 
B= > a B;, 
where 
(3.3) Bj = B+ BS + --- + 8°, Bo = B. 


Then applying 4, 


k-1 
BS = Zz. as" (Bis: — B) 


= DF a (6; — 8) + a (& — 8) 


= B — Bola) + ap(8), 
for by (3.3), 


B. — B = BS + --- + BY = p(B) = p(8). 
Hence the identity for arbitrary a, 
(3.4) Bs — B = ap(8) — Bp(a). 
Let us now assume p(8) = 0; then we have 


Bs — B = Bp(—a). 


If then we can find an a such that p(—a) = 1, B will satisfy (1.3). We shall 
prove slightly more: 


saan 
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THEOREM 3.1. For arbitrary \ in F, there exists ay in W such that 
p(y) = 
Assume that p(y) = O forall yin W. Then in particular for the quantities 
y' (¢ = 0,---,k— 1), 


we have p(y‘) = 0, that is, 
ys’ = 0 (§=0,---,k—1), 


whence exactly as in §2, the relative discriminant 
| y8"| =0 (i,j = 0,---,k — 1). 
Therefore for some y in W, p(y) = » # 0. By (3.2), 


r(a)=n o()=> 


which proves the theorem. As we have already noticed, this implies the fol- 
lowing 

THEOREM 3.2. <A necessary and sufficient condition for the solvability of (1.3) 
is furnished by p(8) = 0, where p(B) is defined by (3.1). 

To derive the criterion stated in the Introduction, we need the following easily 
proved identity: 


(3.5) f(z) _ >> i 


Since 


1 1 Os 4 ent on 1 
z-—-v8 2«' 2x 2” z™r — 9’ 


f'(z) ems 
=~ = ; PAL) “ 0(@) vee = 0" 7 —; 
T@) + tot += =>- a", 
and therefore 
af'(x) = or’ me. (mod f(.x)) 
i zr— 
= , pms? (kt + ---) (mod f(x)), 
d 
the dots indicating powers of x of exponent < k — 1. Comparing coefficients 


of x*' on both sides of this congruence, we see that p(d”) is the coefficient of 
z*— in the product x”f’(x) reduced modulo f(x). If, then, h(x) is a polynomial 
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with coefficients in F, it follows upon application of (3.2) that p(h(#)) is the 
coefficient of x*~' in the residue of h(x)f’(x) (mod f(x)). Thus we have 

THEOREM 3.3. Jf 8 = h(#), where h(x) is a polynomial in F, then p(8) is the 
coefficient of x*— in h(x)f'(x) reduced modulo f(x). In particular, (1.3) is solvable 
in W if and only if that coefficient is zero. If n is a particular solution of (1.3), 
then the general solution is » + X, \ arbitrary in F. 


4. Some properties of the sum B. We shall use the fuller notation (a, 8) 
in place of B: 
k-1 
(4.1) (a, B) = Do a’B;, 
0 
where 8; is defined by (3.3). It is evident that 
(a + 8B, 7) = (a, y) + (8, ¥), 
(4.2) (a, B+ 7) = (a, B) + (a, Y), 
(Aq, B) = Aa, 8) = (a, AB), 


for \ in F. We derive a formula connecting (a, 8) and (8, a). If in (3.4) we 
interchange @ and 8 and add, we have 


(a, B)s + (8, a)s = (a, B) + (8, a), 


so that the sum (a, 8) + (8, a) isin F. To get an explicit expression we proceed 
as follows. The product 


p(a) p(B) = = a 3° 
(4.3) = fe + fe ~ fe 
But ; ; 
(4.4) > a’ p* = x B*. z a’ = (8,a), 
by (4.1) and (3.3). Similarly 
(4.5) X as = Lak Y 8 = (8), 
while : | ; 
(4.6) Xo 8" = & (a8) = was). 


Combining (4.3), --- , (4.6), we have at once 


(4.7) (a, 8) + (8, ~) = p(aB) + p(a)p(8), 


which indicates explicitly that the sum in the left member is in F. 
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Again from (3.4), 
(4.8) (a, 8)S — (a, B) = ap(8) — Be(a), 
for arbitrary a, 8. In particular for p(8) = 0, 
(a, 8)S — (a, B) = —Bp(a). 
Indeed, for 8 = y5 — y, by (3.3), 


(a, 8) = Sa*(ys"" — y) = p(ay5)-— ye(a); 


that is, 

(4.9) (a, ¥8 — y) = play’) — pla). 
If also p(a) = 0, then 

(4.10) (a, 8) = play). 


To determine (a, 8) for p(a) = 0, 8 arbitrary, we apply (4.7) and (4.9). For 
p(a) = 0, (4.7) becomes 


(a, 8) = p(aB) — (8, a). 
Put a = y5 — y; then by (4.9) 
(8, a) = p(Bys) — ype(8), 
so that 
(4.11) (a, 8B) = —p(By) + ve(8). 
If in (4.10) we put a = 8, we have 
(4.12) (8, 8) = p(By‘), 
while from (4.11), 
(4.13) (8, 8) = —p(6y). 
However, 
p(Bys) = p((v5 — y)y5) 
a 6 — 7) 
= p(y? — 7'**) 


= p((y — v5)y) = —p(6y), 


so that (4.12) and (4.13) are identical. 

We may now evaluate (8, 8) for arbitrary 8. Since the case p(8) = 0 has 
been disposed of, we assume p(8) ¥ 0; in particular, let p(8) = 1. Suppose also 
that p(6) = 1. Then by (4.8), 
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put y = (8,5) sothat 8 — 6 = yS — y. We have on the one hand 
(8 — 6, 6) = (v8 — v, 6) = y + p(y). 
But on the other hand, 
(8 — 6,6) = (8, 6) — (6,5) = y — (6, 4). 
Comparing the two expressions for (6 — 6, 6), we see that 
(4.14) (6, 6) = —p(y8), 


where y = (8, 4) and 8 is arbitrary except for p(8) = 1. 
Finally we consider the expression 


(4.15) I = p(a)-(8, y) + p(8)-(y, a) + ply): (a, 8). 
By (4.7), 
p(B) (y, a) + ply)(a, B) 
(4.16) ; 
= p(a)p(8)p(y) + p(aB)p(y) + (8)(y, a) — (8, a)p(y); 

but 

p(B) (y, a) sit p(y) (6, a) = (yp(8) ai Bo(y), a) 
(4.17) =o ((B, y)s _ (6, ¥), c) by (4.8), 


pia(B, vy)} — pla)(B, v). 

Substituting from (4.17) in (4.16), we have 

p(a)(8, y) + p(8)(y, a) + ply)(a, B) 

= p(a)p(B)p(y) + plaB)p(y) + pla(s, v)}, 


(4.18) 


so that in particular I as defined by (4.15) is in F. 


5. Application to finite field /. Let F be a Galois field GF(p") of order p", 
where p is an arbitrary prime. Then for f(x) an irreducible polynomial in 
GF (p"), a cyclic extension W of GF(p") is generated by adjoining a root 8 of 
f(8) = 0 to the Galois field. We remark that W may also be defined as the 
field formed by the complete set of residues modulo f(x). For the present case 
it is convenient to use the latter interpretation so that we shall speak of con- 
gruences (mod f(x)) rather than equations in W. Note that for F = GF(p"), 
W is “absolute” cyclic, that is, W is cyclic relative to GF(p) as well as relative 
to GF(p"). Clearly W is also a finite field. The substitution S that generates 
the cyclic group of W/F may be identified with the operation of taking the 
p"-th power: 


as — qq?” 





et ne A me me 
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however, other interpretations are possible. For example, if as above the 
relative degree of W/F is k, we may take 

as am gre 
or generally 


s pr 
e= € ’ 


where r is prime to k. 
For brevity we limit ourselves to the first definition of S; then our equations 
(1.1) and (1.3) become 


(5.1) X? =gX (mod f) 
and 
(5.2) xX” =X+h (mod f), 


respectively, where g, h, X, Y are polynomials with coefficients in GF(p"). 
For the equation (5.1) it is customary to use the notation {g/f} in place of x, 
so that we have 


(5.3) {8\ = gitp™+ +++ + pm(k-) (mod f). 
Then by the proof of Theorem 2.2, 
{8} = Rg, f). 
/ 


By the remark at the end of §2, a reciprocity theorem for {g/f} may be stated. 
Indeed, for g irreducible in GF(p"), the set of residues (mod g) form a field that 
is cyclic relative to GF(p"). Hence we have the following? 

THeoreM 5.1. If f and g are primary irreducible polynomials in GF(p") of 
degree k and l, respectively, then 


ta = 


where {g/f} is defined by (5.3). 
For the equation (5.2) we have 


(5.4) p(h) mhth 4... nro” (mod f); 


then Theorem 3.3 implies* 

THEOREM 5.2. p(h), as defined by (5.4), is equal to the coefficient of x*-' in 
the product hf’ reduced (mod f). The equation (5.2) is solvable—in polynomials 
with coefficients in GF (p")—af and only if that coefficient vanishes. 

INSTITUTE FOR ADVANCED Stupy AND DuKE UNIVpRsITY. 
2? F. K. Schmidt, Erlanger Sitzungsberichte, vols. 58-59 (1928), pp. 159-172. 


3 L. Carlitz, this journal, vol. 1 (1935), pp. 164-168; also Bulletin of the American Mathe- 
matical Society, vol. 41 (1935), pp. 844-846. 














ON FACTORABLE POLYNOMIALS IN SEVERAL INDETERMINATES 
By LEronarp CARLITz 


1. Introduction. In this paper we consider a class of polynomials in several 
indeterminates with coefficients in a Galois field GF(p"), such that each poly- 
nomial may be completely factored into a product of linear factors in some Galois 
field GF(p™"’), say. For the case of a single indeterminate a body of theorems" 
exists, and the purpose of this paper is to extend these theorems, whenever pos- 
sible, to the case of several indeterminates. As will be seen in several cases, 
certain theorems are capable of extension, but the proof for the case of a single 
indeterminate is no longer applicable, and new methods become necessary. 
This is true in particular of the formula for the product of all (factorable) poly- 
nomials of fixed degree. Again, in the case of a single indeterminate, the form 
of a polynomial is known explicitly; in the case of several indeterminates, the 
definition is in terms of an intrinsic property, and thus it seems necessary to 
deal first with irreducible polynomials and from them go on to arbitrary poly- 
nomials. 

In the case of polynomials in a single indeterminate 2, as is well known, the 
quantity 


(1.1) re — 2 


is fundamental.? In the extended case this is replaced by a certain determinant. 
Thus for example, for two and three indeterminates, we have 








, \1 r y z | 
0 U | } 
Y 1 gms yP"* gens 
(1.2) 1 gr”? yr , |, 
1 yPns yrns gpmne | 
1 pms yrne : | 
‘ 1 Pn" yrs gprs | 


respectively. Certain formulas in the case of a single indeterminate carry over 
to the extended case by merely substituting the proper expression (1.2) for (1.1). 
In particular this is true for the product of irreducible polynomials and the 
product of all polynomials of fixed degree. 

The number of (primary) irreducible polynomials of degree s in a single 
indeterminate, with coefficients in the GF(p"), is determined by the familiar 
expression 


v(s,p") = Do wld) p™, 
s=dé 
Received October 12, 1936. 
1 For the classic theorems, see L. E. Dickson, Linear Groups, 1901, pp. 3-54. 
2 Dickson, loc. cit. 
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where u(6) is the Mébius u-function. Now the number of irreducible poly- 
nomials in x, y (y actually appearing) will be shown to be ¥(s, p*"); for k indeter- 
minates (a fixed one appearing in each polynomial) the number is ¥(s, p*"). 
Similarly, the number of primary polynomials of degree m in a single indeter- 
minate is p""; for k indeterminates the number of factorable polynomials (in 
which 27 actually occurs) is p"™. 


2. Some definitions. According to the definition above, a polynomial 
M = M(x, -:- , xx) is factorable provided it can be written in the form 


(2.1) M(m, +--+ ,%) = II (aio + ain 21 + +++ + x Xe), 

i=1 
where the a;; lie in some Galois field, GF(p""’), say. The coefficients of M are 
all assumed to be in a fixed GF(p"). The degree of M is evidently m: we write 
m = deg M. If 27 actually appears in M, then M is called primary provided 
the coefficient of x? is unity. More generally, let us write in place of (2.1) 


m 


M= II (aio + +++ + ain; Tui), Il ain; F 0; 

i=1 i=1 

then M is primary if and only if the product Ta; = 1. From the definition 

it follows readily that the product of two primary polynomials is itself primary.* 
It is occasionally convenient to introduce an additional indeterminate xo. 

Our polynomials then become homogeneous; (2.1) now becomes 


M(x, mee * » Xx) = T1(eioxo + aati + -°+ + in). 


However, in counting the number of polynomials of a certain set, or in reckoning 
the degree of a polynomial, we shall always suppose xo = 1; thus 2ox;22 is of 
degree two. 

In the second place it is convenient to distinguish those polynomials of degree 
m that actually contain the term x7. We shall generally denote such a poly- 
nomial by M*. Similarly if g(m) denotes the number of polynomials M of 
degree m that have a certain property, then g*(m) will denote the number of 
M* of degree m having the property in question. 


3. Irreducible polynomials. Assume P = P(x, --~ , 2), a factorable poly- 
nomial of degree s with coefficients in GF(p") and irreducible in GF(p"). Then 
by (2.1) 


(3.1) P = (ao + mati + +++ + cnri)A, 


3 An equivalent definition is the following: M is primary if and only if the coefficient 
of the monomial of highest rank occurring in M is unity. A monomial 2}! --- 2)* is of 
higher rank than ait tee a}k provided the first non-vanishing difference in the sequence 


tp — Jey *t* yt: — ji, is positive. 
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where a; are in some GF (p""’), and A is a polynomial with coefficients in GF (p™”’). 
If we replace each coefficient in (3.1) by its p"-th power, P remains unchanged, 
from which it follows that 


»” »™ »” 
P=(ah +afamt-++ tat a)A’, 
so that P is divisible by a?” + -++ + a?"x,. Similarly P is divisible by each of 
« nd ni ni aa 
(3.2) ap + a} r} + =o + a}. Ti; (j = 0, FE ith -). 


Now there is only a finite number of distinct linear forms (3.2); let the number 
bet. Clearly the product 


t-1 : 
G = Th er" +... + ef") 
7=0 
is in GF(p"). On the other hand, P is evidently a multiple of G. Thus P = G 
ands = t. Hence we have the factorization 


e—1 - 


(3.3) P= [J (af" +... + a?'x). 
j=0 
Note therefore that if P contains 2; at all, it must contain xj, so that by the 
definition at the end of §2, P = P*. 
We now determine s in terms of the a; appearing in (3.3). If a is contained 
in GF(p”) but no GF(p™) for 1 S e < f, we shall call f the degree of a relative 
to GF(p"): f = deg a. Then for f; = deg a; (j = 0, --- , k), we prove that 


(3.4) s = Ifo, fis eo Sul, 


the least common multiple of fo, fi, --: ,fz. Indeed this follows immediately 
from the fact that s is the number of distinct forms (3.2), that is, s is the smallest 
positive ¢ such that 


(3.5) an” = a (j =0,---,k). 


Since for fixed j, f; is the smallest positive ¢ for which (3.5) holds, it is evident 
that s is the least common multiple of fo, fi, --- , Sx. 

If now we start with k + 1 quantities ap, a, +++ , a, of degree fo, fi, «+ , Sx, 
respectively; define s by (3.4), and form the product in the right member of 
(3.3), the polynomial thus formed is in GF(p*), and further is irreducible in 
GF(p"). For if we assume that it equals AB, A and B polynomials in GF(p"), 
we may show that either A or B coincides with P: Assume A divisible by 
ao + +++ + cya, ; then by the argument used at the beginning of this section, 
A is divisible by each of the linear forms (3.2), and therefore A is identical with 
P. Thus we have proved the following 

THEorEM 3.1. A (factorable) polynomial P of degree s is irreducible if and 
only if it satisfies (3.3) and s satisfies (3.4). 

In order to determine the product of all irreducible polynomials of fixed degree, 
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we make use of a formula due to E. H. Moore. It is convenient to bring x» 
in at this point. In the (homogeneous) linear form 


A = aoXo + anti + +++ + arr, 


where all the a; are in GF(p™), assume the non-vanishing a; of greatest subscript 
is equal to unity; that is, assume that one of the following mutually exclusive 
cases obtains: 


a, = 1; 

a, = 0; ou. =1; 

On, = a, = 0, a2 = 1; 
ar= = a, = 0, ao=1 


Then the product of all \ satisfying one of these conditions is 
(3.6) I], -_ D*(xox1 eee Xx) = | 2" | (i, j iad 0, see , k), 


by the formula referred to. On the other hand, in the product of the linear 
forms \ group together all forms for which the least common multiple of 
fo, fi, «++ 5 Se = t, some fixed divisor of s. By Theorem 3.1, it is evident that 
the product of these forms is identical with the product of all irreducible poly- 
nomials of degree ¢: this product will be denoted by @(t). Comparison with 
(3.6) leads to the fundamental relation 


(3.7) D(xor, --- xx) = T] OC). 
tis 
From this @(s) is determined by means of the well-known inversion formula: 
(3.8) O(s) = J] {De(xor: --- xx)}*. 
s=efsf 


As for @*(s), the product of the irreducible polynomials of degree s actually 
containing x, (and therefore, by the remark following (3.3), necessarily con- 
taining r;), we have the formulas 





D*(xox1 eee rE) > m 
(3.9) rege, II @*(t) 
and 
ie co [De(xoxr «+ - is 
(3.10) O*(s) = IL \ DG al ; 


This proves the 
THEOREM 3.2. If O(s) = O(s; ror --+ 2%) denotes the product of the primary 
irreducible (factorable) polynomials of degree s, and O*(s) = O*(s; ror, +++ re) 
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the product of those containing x,, then O(s) and ©*(s) are determined by (3.8) 
and (3.10), respectively. Further, 


: G ao a nee : 
(3.11) a oe Hs: xox rx) 





O(s; 2 --- te) 


To determine ¥,(s, p"), the number of irreducible P of degree s, it is only 
necessary to compare the degree of the two members of (3.8). Thus we have 


immediately 
‘ 1 ' vee 
(3.12) vis, p") = = > ule) (p™! + pmo 4... 4 pre), 
s=ef 
and by (3.10) or (3.11), 
(3.13) vi(s, p") = ¥ ule) p. 
$8 


s=ef 


Therefore it follows that 
(3.14) vi(s, p") = ¥i(s, p™*) = ¥(s, p™), 


the well known expression for the number of irreducible P in a single indeter- 
minate, but with coefficients in GF(p™). We may state 

THEOREM 3.3. The number of irreducible polynomials is determined by (3.12); 
the number of those containing x, by (3.13); the latter is identical with the number 
of irreducible polynomials in a single indeterminate, with coefficients from the 
larger field GF (p™). 


4. An identity for ¥. We now count the irreducible P in a different way. 
For simplicity suppose k = 2: the reasoning is quite general and applies without 
change for arbitrary k. We consider those P = P*(z, y) actually containing y; 
then by Theorem 3.1, 


s-—l 
(4.1) P= JI (a + pr x + y), 


j=0 
where 
e = deg a, f = deg £, s = fe, f]. 
We next determine, in the GF(p™), the number of @ of degree e relative to 
GF(p"). Since each such @ gives rise to a polynomial* 


e-1 
Q = Qy) = IT (@™ + y), 
j=0 
irreducible in GF (p”), it follows that the number of such @ is e times the number 
of Q; but the number of Q of degree e is Y(e, p"), and therefore the number of 
ain GF(p™) of degree e relative to GF(p") 


* Dickson, loc. cit., p. 21. 
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(4.2) =ev (ep) = D wld) pm. 


e= 


Similarly the number of 8 = fy(f, p"). Finally, since the s pairs 


ar”, pe Gj _ 0, i 1) 
give rise to the same P of (4.1), we conclude that 
(4.3) s¥2(s, p") = sHsp") = > ewe, yp”) - SWS, vp”), 

s=le,f 


the summation on the right extending over all e, f whose least common multiple 
equals s. 


THEOREM 4.1. The function V(s, p") = s¥(s, p") satisfies the identity (4.3). 
It is not difficult to prove this result directly; we need merely (4.2). By the 
Dedekind inversion formula, an identity f(s) = g(s) is equivalent to 


> f(d) = » g(a). 


ds 


Thus it will suffice to prove—in place of (4.3)—the following: 


(4.4) usp) => YL We, p)W(f, p”). 


sm sm s=[e,f] 


Now the summation conditions on the right of (4.4) 
s/m, le, f] =s 
are easily seen to be equivalent to the simpler conditions 
e/ m, f/m, 


e and f independently ranging over the divisors of m. Thus the right side of 
(4.4) equals 


(4.5) De (ec, p") DE WS, p”). 
em fim 
But since > W(e, p") = p””, it follows that (4.4) is an identity, and this in turn 


implies the truth of (4.3). 
As will be shown elsewhere, the identity (4.3) is easily extended—in several 
directions. For the present we note the generalization 
(4.6) W(s, p”) = 2 We, : p”) a We, p™*), 
s=([¢, ek] 


where f = fi + --+ +f, , and the summation extends over all sets @ , -+- 
with least common multiple equal to s. 


» Ck 


5. The total number of polynomials of degree s. In the case k = 1, the result 
is immediate: every (primary) polynomial has the form 


(5.1) M =2x°+ qmart'+ --- + as, a; in GF(p*), 
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so that the number is evidently p”. For k > 1, unfortunately an explicit 
formula (5.1) is not available, and therefore other methods must be used. One 
simple method is the following. 

Define the zeta-function ¢*(w) by means of the infinite product 


(5.2) ew) = [] (1+ pe +t) 
extended over irreducible P*. Here the “absolute value” 
| Pe| = pw for s = deg P; 
and for arbitrary M, N, 
|MN|=|M|-|Nj, 


so that generally 





(5.3) |M| = p™ for m = deg M. 
Expanding the product on the right side of (5.2), we have 
2 f*(m) 
(5.4) i*(w) = >) = S 
> |M) w fond gto 


where f*(m) denotes the number of M* of degree m. We determine f*(m) by 
evaluating ¢*(w). Now (5.2) may be put in the form 


c*(w) = I] (1 _ ae) 


Il 
eam, 
—_ 
| 

.~) 
>| — 
:| 
| 
€ 


for by (3.14), Vils, p") = ¥(s, p“). Thus 





* — nk 
log ¢*(w) = > ¥(s, p™) De na 

om S = 2 » sy(s, p”*) 
foo mpome m=st 
a 2 r 1 

— > ae nkm — Ss sae — 

oa p> mpm P aad mp (aw) ? 
m=1 m=1 


and therefore 
(5.5) g*(w) = {1 — prt jt. 
Comparing (4.4) with (4.5), we have immediately 
TueoreM 5.1. The number of (factorable) polynomials of degree m containing 
xr; is 


(5.6) f*(m) = p™™. 


For k = 1, this reduces to the familiar p”™™. 








a a 
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The determination of f(m), the total number of M of degree m (not necessarily 
containing x;) is somewhat more elaborate. In place of (5.2) we now have 


1 1 
(5.7) ¢(w) = (1 + —-+-— + ), 
I] | P | tay 
the product now extending over all irreducible P. If we use the fuller notation 
¢:(w) for the ¢*(w) of (5.2), it is readily seen that (5.7) implies 

(5.8) o(w) = oi(w)st_y(w) +++ oT (w). 


On the other hand, (5.4) becomes 


aae a Sv sm) 
(5.9) f(w) = ya iM a pm 


Therefore by (5.5), (5.8), (5.9), we have 


(5.10) > am = {(1— prt) — pre)... (1 — pry 





m=0 


Now by a familiar expansion, 


=~ (gt — 1)... (gttit — 
1a-@0-@...0-qy7 = FP 


ff @—1)---¢ =} 





ge. 


In this identity put ¢g = p",t = p~"”; thus we have 
TuHeoreM 5.2. The total number of (primary factorable) polynomials of degree 
m is determined by 


nk a n(k+m—1) __ 
(5.11) f(m) = (en = 2) -(p wid 1) Pm. 


(p” = 1) Pe (p™™ as 1) 
Here again for k = 1, the formula reduces to the familiar p"”. 
By use of the functions ¢ and ¢*, it is easy to carry over to the case of k indeter- 
minates a number of theorems’ on arithmetic functions known in the case k = 1. 
However, we shall not take the space to develop these formulas. 








6. The product of polynomials of fixed degree.’ We now seek an expression 
for 


(6.1) Fim) = [J M, 

deg M=m 
the product of all (primary factorable) M of degree m. Here again the simple 
method used in the case k = 1 is not applicable. 


® See L. Carlitz, American Journal of Mathematics, vol. 54 (1932), pp. 39-50; Bulletin 
of the American Mathematical Society, vol. 37 (1932), pp. 736-744. The latter paper will 
be referred to as B. 

° For k = 1, see B, p. 742. 
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If P denotes a typical irreducible of degree s, put 
(6.2) M = P*A, P jA, 


so that P* is the highest power of P dividing M. For fixed P and e, what is the 
number of polynomials A of degree m — es not divisible by P? It is easily 
seen that this number is 

({f(m — es) for m — es < 8, 

\ flm — es) — f(m — (e + 1)s) for m — es 2 8, 


where f(m) is defined by (5.11). But by (6.2) it is evident that 
- II M = II Prem?) 


deg M=m Pye 


(6.3) Pm—es(P) = 


the product on the right extending over all irreducible P of degree s, and all 
positive e such that es S m. But the right member of (6.4) equals 


[I Perna” 
sd ’ 
> 


the summation in the exponent extending over alle S m/s. By (6.3), 


> Com—es(P) = {f(m — s) — f(m — 2s)} 
(6.5) + 2{f(m — 2s) — f(m — 3s)} + --- + 7f(m — rs) 
= f(m — s) + f(m — 2s) + --- + f(m — rs), 


where r = [m/s], the greatest integer < m/s. Thus by (6.4) and the definition 
of O(s), 


(6.6) II M= I {O(s) }4, 


deg M=m s=1 


where FE denotes the sum (6.5). 
Now on the other hand, by (3.7) 


Il (D2) fm) Il {Q(s) } J(m—en) 


j=1 essm 
I] {O(s)}* 


Comparison with (6.6) and (6.1) shows at once that 


m—es) 


m 


(6.7) F(m) = [J (Di)f, 


eu 


THEOREM 6.1. The product of all primary factorable polynomials of degree m 
is given by (6.7). 

As for ILM™*, it is not difficult to prove, in exactly the same manner that (6.7) 
was derived, the following 
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THEOREM 6.2. The product of those polynomials of degree m that contain x7 is 


rs D*(x9 21 eee re) pnk(m—j) 
° —_ 
(6.8) F*(m) II ee a5 at , 





Comparison of the degree of the two members of (6.8) leads directly to (5.6). 
Similarly from (6.7) it is possible to derive (5.11), but this is somewhat less 
immediate. The derivation depends on logarithmic differentiation of the 
identity (5.10). 


7. The least common multiple of polynomials of fixed degree.’ Here the 
method used in the case k = 1 carries over. Thus from 


D* = II O(s), 
it follows that 
D'p? ... D» = TT TI Os), 
(7.1) sie tani " 
- O(s) = [] {O(s)} i. 
esim s=1 


On the other hand, if L(m) is the least common multiple of all polynomials of 
degree m, it is evident that, for P irreducible of degree s, the highest power of 
P dividing L(m) is ne a /s|. Therefore 


L(m) = ae II {9(s)}&". 
I deg P=s s=1 
Comparison with (6:1) leads one to 
THEOREM 7.1. The least common multiple of all factorable polynomials of 
degree m is 
(7.2) Lim) = D'D* .-- D™. 


Similar reasoning yields 
THEOREM 7.2. The least common multiple of polynomials of degree m that 
contain x is 


D(x x1 eee Xe) 
L L* 
sales I D(x coe Mp ae 





8. Concluding remark. According to Theorem 5.1, f*(m) = p™™; in other 
words, the number: of polynomials M*(x, --- 2,) with coefficients in GF(p") is 
identical with the number of M(x) with coefficients in GF(p™). According to 
the last part of Theorem 3.3, ¥;(s, p") = ¥(s, p™), that is, the number of ir- 
reducible P*(x ---+ x) with coefficients in GF(p") is equal to the number of P(x) 
with coefficients in GF(P™). Thus the question arises whether it is possi- 


7 For k = 1, see B, p. 742. 
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ble to set up a correspondence between M(x) in GF(p™) and M*(a, +--+ xx) 
in GF(p") in such a way that an irreducible P(x) will correspond to an irreduci- 
ble P*(x; «++ x,), and so that all our theorems will follow immediately from the 
case k = 1. As we shall now indicate, this does not seem to be the case. 

An irreducible P*(x, y) according to Theorem 3.1 is of the form 


s—l 


(8.1) [] (a + px + y), 


where e = deg a, f = deg 8, s = [e, f]. Similarly an irreducible P(y) with co- 
efficients in GF(p*") is of the form 


(8.2) II (vy + y) + in GF (p?"*), 
where now s equals the degree of y relative to GF(p*"). 
We now attempt to identify (8.1) with (8.2)... Thus the two sets of quantities 


qr + Bp” x, yen (j a 1, rae , 8) 


must be identical, except for order. But it is easily seen in particular cases 
that this is impossible. For example, for p" = 2 = s, let 


P*(x,y) = (8 + Kx + y)(H + Hr + y) 
y+ (z+ lyt+e2+2+i1, 


where # + 3 + 1 = 0, so that # defines the GF(2*). Then P(y) = (vy + y) 
(y* + y), whence 


y= Py + Fr, > = a or. 


But these equations imply 
0+ rt = & + Vz. 


Throwing out x = 1, since it leads to P(y) = (1 + y)?, we have 


(8.3) (2+1 =9 +1. 
On the other hand, z is in GF(2*), so that 
(x +1)5 =1; 


combining this with (8.3) leads to 3} = 0. 


INSTITUTE FOR ADVANCED Stupy AND DUKE UNIVERSITY. 


8 I am indebted to B. P. Gill for this suggestion. 














EQUIVALENCE OF MULTILINEAR FORMS SINGULAR ON ONE INDEX 
By Rurus OLDENBURGER 


1. Introduction. Any p-way matrix A = (aj;;...,) of order n can be “fac- 
tored” in the form 


h 
(1’) 4=(> tus bai~-+ dat (i,j, ---,k=1,---,m), 
a=1 


where h S n”“!. Hitcheock,! using the polyadic point of view, has determined 
minimum values of A for some given numerical values of n and p. The repre- 
sentation (1’) implies that any multilinear form 


F = @jj...4 TiYj +++ & (i, ---,k =1,---,n) 


(repeated indices indicate summation) is equivalent under transformations 


(21) La = AX; , 
(22) Ys <5 bs Yj ’ 
(2,) z, = dy. 2% 9 


to the form 

R = riya +++ Za (a= 1,---,h), 
where h S n?~' and the transformations (2;), --- , (2,) are not necessarily non- 
singular. 

We shall say that the matrix (a;;..., @ai) of the form F’ obtained from F 
by applying the transformation x; = a,;7, to F, where (a,;:) is non-singular, 
is equivalent to (a;;....); we shall also say that F’ is equivalent to F. If the 
2-way matrices (dai), ---, (dax) of (1’) are all singular on their columns (a 
being taken as the row index in these matrices), the matrix A is equivalent 
to a matrix of lower order of the form (1’), where at least one of the matrices 
(Gai), --- , (dax) is non-singular on its columns. The number h of (1’) is then 
between the limits n < h < n?-'. In another paper? the author treated the 
special case where h takes the minimum value n. He obtained necessary and 
sufficient conditions for the factorability of a matrix A into the form (1’), 
where the matrices (a.;), ---, (dax) are all non-singular. The method of 


Received January 24, 1936; in revised form, June 11, 1936. 

'F, L. Hitchcock, A new method in the theory of quantics, Journal of Mathematics and 
Physics, vol. 8 (1929), p. 83. 

? R. Oldenburger, Non-singular multilinear forms and certain p-way matrix factorizations, 
Transactions of the American Mathematical Society, vol. 39 (1936), pp. 422-455. This 
paper will be denoted by N.S. 
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treatment extended at once to the case where all but one of the matrices 
(dai), --:, (dax) are non-singular. In the case where these matrices are all 
non-singular the matrices of type A are at once equivalent to each other for 
given n, p, and their associated multilinear forms are equivalent to the unique 
‘anonical form R. 

In the present paper, the treatment of equivalence of matrices of type A, 
where h = n, and one of the matrices (a@.;), --- , (da) is singular, is completed. 
Necessary and sufficient conditions are obtained for the equivalence of two such 
matrices for given n, p, and some of the associated canonical forms are derived. 
In contrast to the 2-way case the number of canonical forms for given n, p, 
where p = 3, n 2 4, is infinite. 

In the author’s Transactions paper the matrices of type (1’), where h = n, 
and (dai), ---, (dex) are non-singular, were said to be non-singular. The 
associated forms were also said to be non-singular. The matrix (1’) and its 
associated multilinear form is said to be singular and of rank’ r on the index 27, 
and non-singular on j, ---, k if (@a:i) is of rank r, and (ba;), ---, (dax) are 
non-singular. 

The development of the present paper is based on the property that two 
given singular forms, having the same p, n, and r, are equivalent if and only 
if invariant factors associated with reduced forms are equivalent under certain 
non-singular linear transformations. These invariant factors are generaliza- 
tions of the invariant factors‘ of (eA + o¢B), where A, B are two-way matrices, 
and p, o are parameters. It will be no restriction on the generality of the 
method to take p = 3. The associated forms are then trilinear. 

The theory developed here holds for any field of numbers. 


2. Invariant factors. Let a given 3-way matrix singular on one index 7 be 
denoted by 


n 
D= (> auibaiat); (@=1,---,r<n;j,k=1,---,n). 
a=l1 
The range chosen for 7 does not restrict the generality, since (a,i), a,i = 1, --+,n, 


of rank r < n can be reduced by multiplication on the right with a non-singular 
matrix to a matrix (a,;) as given in D bordered by zeros. The matrix (ai) 
is equivalent under multiplication on the right and rearrangement of the rows to 


where J is a Kronecker delta of order r. Let rearrangement of the rows in 
(ai) be simultaneously accompanied by a similar rearrangement of the rows 


3 For a treatment of ranks of this type see the author’s paper Composition and rank of 
n-way matrices and multilinear forms, Annals of Mathematics, vol. 35 (1934), pp. 622-657. 

‘For the definition of these invariant factors see L. E. Dickson, Modern Algebraic 
Theories, 1926, p. 104. 
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of (baj), (Car). The matrices (ba;), (cax) are equivalent under multiplication 
on the right with reciprocals to (da), (daa), respectively, where (Sea), (daa 
are Kronecker deltas of order n. The matrix D is hence equivalent to 


E = (> taiBna’ Baa’), 


a=l 


where (a.;) is of the form G. Let A’ = (a’,). The form associated with F is 


r n r 
. , 
(1) F= 2) taYatat Dy Dy O5:2iyj2;- 
a=l1 j=r+l i=l 
Elsewhere the author has discussed® the t-invariant factors of a matrix (aij,) 
of order » and its associated form, which are quotients of the type 
G; 
In-i50 = A > 
¥j-1 
where, except for a constant factor, G; is the g.c.d. of the j-th order determinant 
minors of the i-characteristic matrix 


M(t) = (piain +--+ + Pn Anjx). 


The p’s in M(2) are parameters. To obtain the G;, it is necessary to factor the 
j-th order determinant minors of this matrix in the given field. It is to be 
noted that since these determinants are homogeneous polynomials in p;, --~ , pn, 
if n > 2, they cannot be broken up in general into linear factors in any field 
or even when the coefficients belong to a linear algebra.6 The author proved 
that transformations 


, 
rt; = b..2%5 


tm 
On 4.2; y;2, correspond to the transformations 
, 
= Dim Pm 


on the p’s. The author also proved that under non-singular linear transforma- 
tions on y; and z, the G; (hence G;/G;_:) are relative invariants. The 7-charac- 
teristic matrix M(2) of F is the diagonal matrix 


Pi 
Po 0 


p 


M y= . , , 
® | 0 (014,411 +---+ PrOy4y,7) 





/ Ul 
(p, An + +++ + p,a,,). 
5N.S., §6. 
® Footnote p. 432, N. S., is incorrect. 
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Since M(z) is diagonal, we have 
Lemma 1. The i-invariant factors of the form F given in (1) different from 
constants are products of the linear forms 


‘ - , = , 
(2) Pry *** 9 Pry Lia, _ z Gr 41 i Pi» a 7 = | Ani Pi 


t=1 i=1 

each of these expressions occurring exactly once in the invariant factors. 

Since the j- and k-ranks of F are equal to n, none of these linear forms vanishes 
identically. The matrix of these linear forms’ is exactly G. 

In the following we shall use J,, ---, 7, to denote 7-invariant factors of F 
which are not constants. 

THEOREM 1. Two forms F and F’ of type (1) are equivalent if and only if they 
have the same number of invariant factors distinct from constants, and the i-invariant 


. m " , , 

factors I,, --- , Is of F are simultaneously equivalent to k,I,, --- , k,I,, where 
, , . «-. . 

ki, --+ , ks are constants, and I,, --- , I, are the corresponding i-invariant factors 

of F’. 


The necessity of the conditions of the theorem follow from the above remarks 
on the manner in which 7-invariant factors of a form F are changed when 
transformations are made on F. 

To prove the sufficiency of the conditions, let 


rf si Mi(o,, oe > ee Mia, 2+ Dy)s 
where M§(p,, ---,p,)) ---; Mi (a, --+, p,) are linear forms in pi, --+, pr; 
similarly let 
, , , , , , 
I, = M§'(oi, ---, p,) -++ Mi (1, 1+ py). 


Assume that there exist non-singular linear transformations 


(3) Pm = Omi p. (¢,m=1,.---,7r), 
and k;, € = 1, --- , s, such that 

(4) Ty = kM (bus pi, +++, Bi oy) «++ ME (ii i, +>, bri 9): 

Now (4) implies that there exist constants Cu, ---, Cerys where Cy --- Cu, = 


ky, such that the linear forms 
Ce Mi (bis pi, ++ >, bi vids +++ » Cee ME (Ou 0; , --+ 5 bri pi) 
are identically equal in some order to 
ME'(o1, -°+ 5s) °° » Mii(o1, +++) p,). 
By the lemma, 
M§(pr, --+ » pry «°° »Mi a, -+* 5 pr) (¢ = 1, ---,8) 


‘ 
, 
7 That is, the matrix of the bilinear form o; p: + +--+ +-or pr + oat » i Gr4ispit-: 
i=l 


Tr 
, 
+ on > ani pi is G. 
‘=: 
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are equal in some order to the diagonal elements of the 7-characteristic matrix 
M(i) of F given above. Let F’ be given by 
= , , , = : , , , , 
, - , 
Fi = Do teYatet DY DL 95 iY; 2;- 
a=1 j=rt+l i=l 
—e ie # , gl, ? , . 
Now Mj (p,, ig »Pr)s he hs » Mj. (or, or Pr), = = 1, ‘s+ , 8, are equal in 
some order to the diagonal elements of the i-characteristic matrix of F, which 
are given by 
r r 
, / , ” , , ” , 
Fis *** s Bos Mes - >» Gr+1.i Pir ee 7; = > Qni Pi- 
i=1 i=1 


It follows by (2) and (3) that there exist constants C;, 7 = 1, --- , n, such that 


- , = , 
>. Cybim Pms a > Crbre Pm» 
m=1 


n=1 


r r 
, , / , 
Cr41 > is B41, mn Ond Paa-**? »C. > an wm Dmi Pi 


m, i=1 m, i=1 


are equal in some order to 


, , , , 
Sis *** crease *** olgs 
This implies that there exist non-singular transformations (3) and 


= , ° . . 
(5) Cio; = ¢; (i,j = 1, --- ,n;i not summed), 


where there is one j for every 7, such that the bilinear form 


(6) Opi + o-e+ + Ope + Orarbrns + +++ + Onlin 
transforms into the form 
- », e* , ’ tel 
(7) O1P1 + ch ta + o,Pr + Ogi b,41 + ** + o,L,. 
Now (6) and (7) become F and F’ if we make the substitutions 
, , , , , » 
pi = %, Pp; =2;, Os = Yiri, o; = ¥;2; (¢ not summed). 


By (3) and (5) F becomes F’ if we let 

(8) Yt = Ciy jz; (i,j = 1, --- ,m, not summed), 
(9) Im = Dini, (m,i=1,---,7r). 
Equation (8) is equivalent to 


(10) ¥i= Cw;; a= zy (7,7 = 1, --- , mn, not summed). 


Hence F is equivalent to F’ under the non-singular linear transformations (9) 
and (10). 
Since the 7-invariant factor J; of F contains the factors p;, --- , p-, the 
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matrix of the linear factors of 7,, which will be called the matrix of J;, can be 
written in the form 

I 

K,/’ 


where J is a Kronecker delta of order r. Let matrices of the 7-invariant factors 
I,, --- , I, of F be denoted by 


I 
11 oe Wi os, 2 
(11) tS) 
respectively, and matrices of the corresponding 7-invariant factors eres of 
of form F’ by 
rf = = 
(12) (x:) K,, ce ae K, 


respectively. Let a non-singular matrix J which permutes the rows or columns 
of a 2-way matrix A under the operation JA or AJ be called a permutation 
matrix. 

Equations (3) and (5) imply the 

Coro.iary. Two forms F and F’ of type (1), which have the same number of 
i-invariant factors distinct from constants and corresponding i-invariant factors 
are of the same degree, are equivalent if and only if there exist non-singular per- 
mutation and diagonal matrices J;, a:, and a non-singular matrix X such that 


I ‘i I 
ai (x) x= (x) 
a; J; KiX = K; (¢ = 2,--- , 8), 


wee 8, Ki, ---, K., Ki, «+ K! are as given in (11) and (12). 
We have now 
Lemma 2. [f there exists a diagonal matrix 


0 ae 


where a, is a minor of order r, and a non-singular 2-way matrix X such that 
a,IX = I, where I is a Kronecker delta, then X is a diagonal matrix and 


Pin I 
" (4) -* (.. x) 


If 
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and a,JX = I, then 


‘1 0 

ry {[a@ 

= 1 
0 a, 


Evidently, 
a) 0 I " al IX 
0 ae A ae AX 


Let a and 8 denote non-singular diagonal 2-way matrices, and let X be a non- 
singular 2-way matrix. If elements in two matrices are in corresponding 
position, these elements will be said to correspond. We similarly define corre- 
sponding rows and columns. We now have the following results. 

Lemma 3. If aAB = A’, the vanishing elements of A and A’ correspond. 

TuHeorEeM 2. If AX = A’, the minors composed of corresponding rows of A 
and A’ are of the same rank. 

Theorem 2 implies the 

Coroutiary. The matrix 














I 
Ky 
K, 
K, 
Ki) 
of the i-invariant factors I,, I,, I.,---,I, of F has the same number of non- 
singular minors of maximum order r as the matrix 
I 
ad; } Ky 
ay, OO K, 
sd 5 K, : 
0 . : 
ard t RK; 
where a1, @,---,a@ and J,, Jp, --- , Je are diagonal and permutation matrices 


mF | > . : 
of the same orders as there are rows in ( K.) K,, «++ , Ke respectively. 
1 


3. The case r = n — 1. Using the results of §2 we shall prove 
THEorEM 3. If r = n — 1, the form F is equivalent to a canonical form of 
the type 


n—-1 g 
C= DY raYota + nen ( r.), 


a=1 a=1 


where o is unique and <= n — 1. 
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The form F given in (1) is now 


n—1 a-i 
, 

(13) F = y TaYaro + Zz Ang LiYnEn- 

a=1 i=1 

/ , , . 
If the o elements a,,,,, @n,95 *** » @n,» # 0, and all other elements in the array 
, , . . 

@,.15°** » @n,,n—-1 are zero, make the substitutions on the 2’s, y’s and 2’s 


in (13) to give a new form of type (13), where 

(14) Oasis *9* liga WOR okas **< eG ens OO 

Assume that F is of type (13), where (14) is satisfied. Making the trans- 
formations 


, , 
Ty Le , , , , 
(15) Ty = Hy ttt Le = Hy Yl = Gn Yip +++ » Yo = Bn Yo 
ni no 
on F, we obtain C. The transformations used to obtain C are non-singular. 
To prove that o is unique, we consider the case where C has only one i- 
invariant factor, J,, distinct from a constant. By Lemma 1 


h=pi-::- pal > ) 


a=1 


] ae 0 
( A =|{0 "+ wn 
oo, Pee 
where there are ¢ = 2 unit elements in the last row, and there are n — 1 columns. 
There are ¢ + 1 non-singular (n — 1) order minors in the above matrix, whence 
o is invariant by the corollaries to Theorems 1 and 2. 


If C has more than one 7-invariant factor distinct from a constant, by the 
lemma these invariant factors are 


IT, = pi +++ Par» I, = > pe; 


whose matrix is 


whence, since J, divides [;, we have o = 1. In this case there is only one 
non-singular minor of order (n — 1) of the matrix of 7; , which property is 
invariant by the corollaries of Theorems 1 and 2. 


4. The equivalence of forms with r = n — 2. For forms with 2 or 3 
i-invariant factors we have 

THEOREM 4. A trilinear form F with r = n — 2, and 

(a) two i-invariant factors distinct from constants, one of which is of degree 2, 
and the other of degree = 2, is equivalent to 


n—2 


> 
> LaYazZa a U1 Yn-1 2n-1 + T2YnZn;5 
1 


a 
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(b) two i-invariant factors distinct from constants, one of which is of degree 1 
and the other of degree = 1, ts equivalent to 


n—2 Pp 
>» Ta Yara + (> re) (Yn—12n—1 + Yn Zn) (2 -- p > died 2), 
a=l a=1 
or 
n—2 o 
> La Yara + nate re) + Yn2nXo4i (2 = o s > = 3), 
a=l1 a=l 


(c) three i-invariant factors distinct from constants are equivalent to 


i 


a— 


De TaYa2a + Li(Yn—1Zn—1 + Yn2n)- 
a=l 
These canonical forms are not equivalent. 

For the sake of brevity we omit the proof which involves the use of lemmas 
and theorems proved in the preceding section. 

For the case not treated in Theorem 4 the form F has one ?-invariant factor 
distinct from a constant. It can be proved that two such forms are equivalent 
if and only if certain rational functions of the coefficients of one form are equal 
to corresponding rational functions of the coefficients of the other. For a given 
n = 4, the canonical forms are unlimited in number so that there are an un- 
limited number of sets of equivalent forms. 


5. Necessary and sufficient condition for equivalence. To distinguish be- 
tween the canonical forms of part (b) of Theorem 4 we note that the matrix 
of the 7-invariant factors of the first form is 


I I 
K,})=[1.---10--- 0], 
a fee eer 


where J is a Kronecker delta of order n — 2, and there are p unit elements in 
each of the last two rows. The corresponding matrix for the second form is 


I 
1---100--- 0], 
Po 


where there are o unit elements in the next to the last row. Assume that the 
necessary equivalence condition p = ¢ is satisfied. In the first matrix there 
are 2p + 1 non-singular minors of order n — 2. In the second there are p + 2 
such minors. Since p > 1, we have 2 p + 1 # p + 2. By the corollaries to 
Theorems 1 and 2 the associated forms are not equivalent. We have proved 

THEorEM 5. Let F and F’ be two trilinear forms of type (1) with r = n — 1, 
or with r = n — 2 but 2 or 3 i-invariant factors. The forms F and F’ are equiva- 
lent if and only if 
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(a) they have the same number of t-invariant factors distinct from constants, 

(b) corresponding t-invariant factors are of the same degree, 

(c) the matrix of the i-invariant factor I, of F and the matrix of the i-invariant 
jactors I, , Iz of F have the same numbers of non-singular minors of order r as the 
corresponding matrices for F’. 


6. Conclusion. We have considered all possible cases of trilinear forms 
where r = n — lorn — 2. The case r = n — 2 is typical of the cases where 
r<n— 2. 

Let F denote a singular multilinear form 


(1) (p) 
Ri, viele af, 
lays vip=l 
singular on one index % and of rank r = n — 1 or n — 2 0n this index. To 
obtain the canonical forms to which F is equivalent from the canonical forms 
s ° 1 (2 ( 
of this paper simply replace x, by x and y,z, by x2? --- x? for every a. The 


i-invariant factors for a general F are defined in terms of the space determinant 
minors of a matrix associated with F. 


ArMowUR INSTITUTE OF TECHNOLOGY. 
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THE QUADRATIC SUBFIELDS OF A GENERALIZED QUATERNION 
ALGEBRA 


By C.uarBporNeE G. LATIMER 


1. Introduction. Let % be a rational generalized quaternion algebra with 
the fundamental number d, as defined by Brandt.! Every element of %, not 
rational, is a root of a quadratic equation with rational coefficients, and hence 
defines a quadratic field. The question arises as to what quadratic fields are 
contained in %. The purpose of this note is to prove the following 

THEoREM. Let % be a rational generalized quaternion algebra, with the funda- 
mental number d, and let F be a quadratic field. % contains a field equivalent to F 
if and only if 

(a) F is imaginary when d > 0; 

(b) no rational prime factor of d is the product of two distinct prime ideals in F. 

Hasse proved a theorem on the splitting fields of an algebra which, when 
properly specialized, is equivalent to the above theorem, his results being in 
terms of the p-adic extensions of % and of F.2 Our proof is independent of 
Hasse’s and is short and elementary. 


2. Proof of necessary conditions. Suppose % contains Ff. Let F be defined 
by (— a@)!, a being an integer with no square factor > 1. If d > 0, by the 
definition of d, % contains no element with a negative norm. Hence F is 
imaginary. 

% contains an element 7 such that ? = — a. Then the trace, or double the 
sealar part, of 7 is zero. It may be shown that & also contains a non-singular 
element j, such that the trace of j and the trace of ¢j are zero. Then 1, 1, j, tj 
are linearly independent, and hence form a basis of U1, 7? = —8 # 0, where 8 
is rational, and ji = —ij. We shall assume, without loss of generality, that 8 
is a rational integer with no square factor >1. 

Let a = m6, 8 = 68,6, where 6 is the positive g.c.d. of @ and 8. Then 
d = + ABA ord = + 2ABaA, where A, B, A are certain positive odd divisors 
of a, 8, 6 respectively.2 By the same reference, d is even if and only if 


(1) (a; + B:) (8: + 6) (6 + a1) (a1 + B81 + 6) = 8 (mod 16). 


Received June 8, 1936. 

' Brandt, Jdealtheorie in Quaternionenalgebren, Mathematische Annalen, vol. 99 (1928), 
p. 9. 

2 Hasse, Die Struktur der R. Brauerschen algebrenklassengruppe tiber einem algebraischen 
Zahlkérper, Mathematische Annalen, vol. 107 (1933), pp. 731-760; Deuring, Algebren, 
p. 118. 

>On the fundamental number of a rational generalized quaternion algebra, this Journal, 
vol. 1 (1935), pp. 433-435. This paper will be referred to hereafter as FN. 
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Let p be a rational prime divisor of d and consider the principal ideal {p} 
in F. If p divides AA, it divides the discriminant, —a or —4a, of F and {p} 
is the square of a prime ideal in F. If p divides B, by the definition of B in 
FN, —a is a quadratic non-residue of p. Hence {p} is a prime ideal in F. 
Suppose p = 2. If a = 1 or 2 (mod 4), 2 is a divisor of the discriminant of F 
and hence {2} is the square of a prime ideal in F. Suppose a = 3 (mod 4), 
and hence a; + 6 = 0 (mod 4). By (1), 8 is even and a; + 6 = 4 (mod 8). 
Hence a = 3 (mod 8). Since the discriminant of F is —a, it follows that {2} 
isa prime in Ff. This proves that (b) is a necessary condition. 


3. Proof of sufficient conditions. Suppose the conditions (a) and (b) are 
satisfied. Let F be defined by (—a)!, as before. 

We shall show that there is an integer 8, such that if %, is the algebra with 
the basis 1, 7, j, ij, where ?? = —a, ? = —8, ij = —ji, then the fundamental 
number of %, is d. It follows that %; is equivalent‘ to A. Since %, obviously 
contains a subfield equivalent to F, the same is true of %. 

By the theorem of FN, d contains no square factor >1. Then 


a = 2*ua'p, d = 2vDp, 


where e = Oor 1, f = Oorl, » = +1,v = +1, and a’, D, pare positive odd 
integers, relatively prime in pairs, which we shall assume for the present are 
all >1. Let 

a’ = Pip2-- Dey D = M92 +++ Ge; Pp = Pip2 --> Pt, 


where the p;, qi, p, are primes. By (b) of the theorem, employing Legendre 
symbols, we have 


(2) (=*) - -1 (i = 1,2, --- , 8). 


Let 8; = d/p = 24vD. By Dirichlet’s theorem on the primes in an arithmetic 
progression, there is an odd prime P, prime to ad, such that 


(3) (*) = (=#), (*) = (=) (i - 1, 2, esl k3j = 1, 2, -++,), 
Pi Pi Pj Pj 


and such that the residue of P, mod 8, is an arbitrarily chosen odd integer. 
This residue will be specified in one case later on. 
By (3), employing Jacobi symbols, we have 


© OG) O-Go 


Setting a, = a’p, employing the quadratic reciprocity theorem, and noting 
that (u/P) = (—1)',h = }(u — 1) (P — 1), we have 








* Brandt, loc. cit., p. 12. 
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(=) Ce) Ce) Ge) = Go 


“rom and the last equation, we have 
I 4) and the last equation, we hav 


. G#)- 2) cae 


Suppose e = 1 or a; + uw» = 2(mod 4). Then the residue of P, mod 8, may 
be chosen so that K is even or odd, the choice being made so that the right 
member of (5) is unity. Then —a is a quadratic residue of P. 

Suppose e = 0 and a; + wp = 0 (mod 4). Then a = ay = 3 (mod 4), K is 
even and by (5) 


© (F)-(G)eus, ve tpt 2 afta. 


By (2), 


n= NG) 


Then by (6), 


a) _ (_yyorm a @=tet! 
(t)=(-p, wa Ps _ 


($2) = (or, T=s+t+L+M. 
As noted in the last paragraph of FN, d is positive or negative according as 
it is the product of an odd or an even number of primes. Hence s + t = }(» + 1) 
(mod 2) or s + t = 3(v — 1) (mod 2) according as f = 0 or f = 1. Suppose 
f=1. Thenby (b) of the theorem, a = 3 (mod 8) and 1 = }(aj — 1) (mod 2). 
Hence for f = 0 or f = 1,s + t+ f(aj — 1) = }(v + 1) (mod 2). Since 
a, = —u (mod 4), it follows that 
+lil-u, 
ee Soe wx en y ihe OE ay Sate cco , 
T 9 9 5 +M 5 5 (mod 2) 
If vy = 1, by (a) of the theorem, » = 1. Hence in every case T is even. Then 
—a is a quadratic residue of P. 
Let 8 = B:\P = 2%vDP. Then, employing (2) and (3), we have 


(i) —8 is a quadratic residue of every prime factor of a’, 

(ii) —8 is a quadratic non-residue of every prime factor of p, 
(iii) —@ is a quadratic non-residue of every prime factor of D, 
(iv) —@ is a quadratic residue of P. 


We have assumed heretofore that a’, p, D are greater than 1. If a’ = 1 
or p = 1, our former definition of P is without meaning. If a’ = p = 1, ie., 
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a = +2 or 1, let P be any prime in the form 8n + 1. If a’ = 1,p > 1, let P 
be defined as above, except that the symbols in (3) involving the p; are ignored; 
similarly for the case a’ > 1, p = 1. If D = 1, let P be defined as above. 
Then P is defined in every case and the conditions (i) to (iv) are satisfied, some 
perhaps vacuously. 

Let %, be the algebra with the basis 1, 7, 7, 77, where 7? = —a,j? = —68,1j = —Ji. 
a and 8 have no common odd prime factor. Hence, by the theorem of FN, 
the fundamental number of % is d; = + AB or d; = + 2AB, where A, B 
are the least positive odd divisors of a, 6 respectively such that —8, —qa are 
quadratic residues of a/A, B/B respectively. By the conditions (i) to (iv), 
A = p,B =D. By the theorem of FN, d, > 0 if and only if a > 0, 8 > 0. 
8 has the same sign as d and by (a) of the theorem, if d > 0,thena > 0. Hence 
d, and d have the same sign and they are equal or one of them is the double 
of the other. But we have seen that the sign of a fundamental number is 
determined by the parity of the number of primes dividing it. Hence d; = d. 
By the fourth and fifth sentences of this paragraph, it follows that & contains 
a field equivalent to F and the theorem is proved. 


UNIVERSITY OF KENTUCKY, 




















SEMI-CLOSED SETS AND COLLECTIONS 
By G. T. WuyBurN 


1. A set K in a metric space S will be said to be semi-closed provided each 
component of K is closed and any convergent sequence of components of K 
whose limit set intersects S — K converges to a single point of S — K. 

Similarly, a collection G of disjoint sets is said to be semi-closed if each set 
of G is closed and any convergent sequence of sets of G whose limit set inter- 
sects S — G* converges to a single point of S — G*, where G* denotes the point 
set which is the sum of all the sets of the collection G. 

For example, any closed set is semi-closed, as is also any totally discon- 
nected set or the sum of any closed set and any set of dimension zero. Any 
null collection of disjoint closed sets (i.e., a collection having only a finite num- 
ber of elements of diameter greater than any preassigned ¢ > 0) is semi-closed. 
The collection of components of any closed set K is semi-closed, as is also this 
collection together with an arbitrary null collection of disjoint closed sets, no 
one of which intersects K. 

The principal object of the present paper will be to develop conditions under 
which the complements of semi-closed sets and collections in various continuum 
spaces will be connected and locally connected. 


2. We begin with some results giving fundamental relations hetween these 
sets and collections and upper semi-continuous collections.! 

(2.1) THeorEem. If acollection G of disjointclosed setsis wpper semi-continuous, 
then in order that G be semi-closed it is necessary and sufficient that the decom- 
position of S into the sets of G and the individual points of S — G* be upper semi- 
continuous. 

This theorem follows immediately from the definitions of semi-closed and 
of upper semi-continuous collections. 

(2.11) Corotiary. If G is any upper semi-continuous collection of disjoint 
closed sets filling up S, and if Go is the set of all non-degenerate elements of the 
collection G, then any subcollection G, of G such that Gyo C G, C G is semi-closed. 

(2.2) The collection G of all components of any semi-closed set K in a compact 
space S is upper semi-continuous. 

For if this were not so, there would exist a convergent sequence g;, g2, °*- of 
sets of G such that if L = lim (g;), then for some g eG we have 


L-q a 0 a L-(S _ q). 


Received August 24, 1936. 

1 See R. L. Moore, Transactions of the American Mathematical Society, vol. 27 (1925), 
pp. 416-428. As used in the present paper, a collection G is upper semi-continuous pro- 
vided that for every convergent sequence of elements (g;) of G whose limit set L intersects 
g¢«Gwehave L Cg. For compact spaces this is equivalent to Moore’s original definition. 
See ref. 3. 
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Since L is connected and g is a component of K, it follows that L-(S — K) # 0. 
But then (g;) would necessarily converge to a single point of S — K, contrary 
to L-qg # 0. 

(2.3) THeorEM. Jf K is a semi-closed subset of a compact space S, there 
exists a monotone transformation? T(S) = S’ of S into a compact space S’ such 
that for each y « S’, T~(y) is either a component of K or a single point of S — K. 
Thus T is a homeomorphism on S — K. 

For by (2.2) and (2.1) the decomposition of S into the components of K 
and the individual points of S — K is upper semi-continuous. Hence by well 
known results*® this decomposition is equivalent to a monotone transformation 
T(S) = S’ satisfying all the requirements of (2.3). 

(2.4) THEoremM. In order that a set K be semi-closed it is sufficient, and in 
case the space is compact it is also necessary, that for each « > 0 the sum of all the 
components of K of diameter = « be a closed set. 

Proof. To prove the sufficiency, let gi, gz, --+ be any convergent sequence 
of components of K whose limit set Z intersects S — K. Then this sequence 
must be a null sequence. For if it contains an infinite subsequence (g,,), each 
element of which is of diameter greater than some preassigned « > 0, it would 
follow from our hypothesis that Yg,, is contained in a closed set which in 
turn is contained in K, and this is impossible since (g,,) converges to L and 
L-(S — K) #0. Thus (g;) is a null sequence and hence L must reduce to a 
single point of S — K. 

To prove the necessity, let us suppose S is compact, let « > 0 be given, and 
let K, be the sum of all components of K of diameter 2 «. Then if K is not 
closed, it readily follows that there exists in K, a convergent sequence of com- 
ponents g1, gz, --: of K whose limit set Z is not contained in K,. But this is 
impossible because 6(g;) 2 ¢«, i = 1, 2,---, gives 6(L) 2 «. Also L is con- 
nected, and since K is semi-closed, we must have L C K. Thus L is con- 
tained in some component g of K which in turn belongs to K,. 

(2.41) Corotitary. If S is compact and K C S is semi-closed, the sum of 
all non-degenerate components of K is an F,. 


3. We shall develop next certain notions of separation which will be needed 
in what follows. 

If K is any set, a continuum N in K is said to separate K provided there 
exists a separation K — N = K, + Ke, where both K,; and Kg intersect the 
component of K containing N; N is said to separate K locally provided N sepa- 
rates some open subset of K containing N. 

(3.1) THetorem. Jf T(A) = B is monotone, where A is a compact continuum, 


2 A single-valued continuous transformation 7(A) = B is said to be monotone provided 
that for each b e B, T-'(b) is a connected set. See C. B. Morrey, American Journal of 
Mathematics, vol. 57 (1935), pp. 17-50. 

3See Alexandroff, Mathematische Annalen, vol. 96 (1926), pp. 555-571; Kuratowski, 
Fundamenta Mathematicae, vol. 11 (1928), pp. 169-185. 
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a subcontinuum K of B will locally separate B if and only if T-(K) locally sepa- 
rates A. 

Proof. Suppose T~'(K) locally separates A. Then there exists a neighbor- 
hood U of T-'(K) in A and a separation 


(i) U — T-(K) = Ui + Uz, 


where both U; and U,» intersect the component C of U containing T~'(K). 
Since T-'(K) C U, there exists a neighborhood V of K such that 


(ii) T-“(V) CU. 
Since T-'(V) is open in A, by (i) we have a separation 
(iii) T-(V) — T-"(K) = Vi + Ve, 


where V; = U;-T-(V) (i = 1, 2) and clearly both V; and V3 intersect the 
component C’ of T-'(V) containing T-'(K). Applying T to (iii), we get 


(iv) V —K = T(V;) + T(V3). 


Now since T is monotone, it follows that T-'T(V;) = V; (i = 1, 2), and hence, 
as A is compact, the sets T(V;) and T(V;) are mutually separated. Further- 
more, each of these sets intersects the component of V containing K, since 
both V{ and V; intersect C’. Thus K separates V and accordingly locally 
separates B. 

To prove the converse, let us assume that K locally separates B. Then 
there exists a neighborhood V of K in B and a separation 


V—-K=V,.+t+ Vs, 


where both V; and V¢ intersect the component C of V containing K. Now if 
we set 


T-(V) = U, T-(V,) = U; (i = 1, 2), 
U is open in B and we have a separation 
U — T-(K) = Ui + Us, 


since JT is continuous. Now since T is monotone, it follows that T-'(C) is the 
component of U containing T-'(K). Thus T~'(AK) separates U’ and locally 
separates A, since U;-T-\(C) ¥ 0 (¢ = 1, 2). 

(3.2) THrorem. Let F be a totally disconnected set of non-local-separating- 
points of a locally connected continuum S such that only a countable number of 
components of any irreducible cutting of S between two points a and b intersect F. 
Then S — F is connected and locally connected. 

Proof. Let R be any region in S. We shall prove that R-(S — F) is con- 
nected. Clearly our theorem results at once from this. Suppose, on the con- 
trary, that R-(S — F) is separated between two of its points a and b. Then 
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F + S — R cuts S between a and b. Accordingly, it contains a closed irre- 
ducible cutting X of S between a and b. By hypothesis only a countable 
number of the components of X intersect F. 

Now since R is connected and R Da + b, we have R-X # 0, and R-X cuts R 
between a and b. But since R-X C F, it follows that R-X is countable; 
since R-X is countable and closed in R, it must contain an isolated point, and 
this must be a local separating point. This contradicts the hypothesis that 
no point of F is a local separating point. 

Thus R-(S — F) is connected and our theorem follows. 


4. We now apply the preceding results to obtain conclusions concerning the 
connectivity and local connectivity of the complement of a semi-closed set. 

(4.1) THrorem. Let F be a semi-closed subset of a compact locally connected 
continuum S such that no component of F separates S locally and such that for any 
irreducible cutting K of S between two points, K-F is contained either in a countable 
number of components of K or in a countable number of components of F. Then 
S — F is connected and locally connected. 

Proof. By (2.3) there exists a monotone transformation T(S) = W such 
that for each we W, T~'(w) is either a component of F or a single point of 
S — F. Then W is a locally connected continuum and T(F) is a totally dis- 
connected set of non-local-separating-points of W [by (3.1), since no T~'(w) 
locally separates S for w e T(F)]. 

Now let X be any irreducible cutting of W between two points @ and b. 
Then T-'(X) separates S between points a’ « T~(a) and b’« T-(b). Let ¥ 
be a subset of T-'(X) separating S irreducibly between a’ and b’. Then, by 
hypothesis, either only a countable number of components Y;, Ye, --- of Y 
can intersect F or only a countable number of components of F intersect Y. 

Now we must have T(Y) = X. For if not, since T(Y) C X and X cuts W 
irreducibly between a and b, there would exist a connected set N such that 
a+bCN CW — T(Y), and this is impossible, since from the fact that T 
is monotone, T~'(N) is connected and contains a’ + b’ but does not intersect Y. 

Thus T(¥Y) = X. Now in either of the two possible cases it is clear that 
there exists a countable sequence X,, X2, --- of components of X such that 
T(F-Y) CX,+X2+---. Butit follows at once that T(F-Y) = T(F)-T(Y), 
since z e T(F)-T(Y) gives T(z) = acomponent of F. Thus 2X; > T(F)-T(Y) 
= T(F)-X. Hence only a countable number of the components of any irre- 
ducible cutting of W between two points can intersect T(F). 

Thus by (3.2), W — T(F) is connected and locally connected. But by the 
definition of T we have T(S — F) = W — T(F) and T is a homeomorphism 
on the set S — F. Therefore S — F is connected and locally connected. 

(4.2) THeorem. Let F be any semi-closed subset of a compact uni-coherent* 


‘For definitions of these terms, see Kuratowski, Fundamenta Mathematicae, vol. 12 
(1928), p. 24. 
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(n-coherent, No-coherent)* locally connected continuum S such that no component of F 
separates S (separates S locally). Then S — F is connected and locally connected.* 

Proof. By uni-coherence of S and the fact that no component of F separates 
S, it follows that in any case no component of F separates S locally. 

Now if X is any irreducible cutting of S between two points a and b and 
if S, and S; are the components of S — X containing a and b respectively, then, 
since we can express S as the sum of two continua in the form 

S = (S. + X) + (S — S.), 
where (S, + X)-(S — S.) = X, it follows that X can have at most one, n, No 
components according as S is uni-, n-, No-coherent. Thus surely X can in 
any case have only a countable number of components intersecting F. There- 
fore, by (4.1), S — F is connected and locally connected. 

(4.3) THeoreM. Let M be a subcontinuum of a compact uni-coherent locally 
connected continuum S such that the complementary domain boundaries of M in S 
are disjoint, and form a semi-closed collection, but no one of them separates M. 
Then if B denotes the sum of these boundaries, M — B is connected and locally 
connected. Furthermore, M — B is homeomorphic with the complement of a 
countable set of points on a cyclic monotone image = of S, and thus if S is a topo- 
logical sphere, so is =; hence if M is non-dense on S, M — B is homeomorphic 
with the set of all irrational points in a plane. 


Proof. Let the domain boundaries be B,, Bz, ---. Then by hypothesis® 
B,, Bz, --+ are the components of B and B is semi-closed. Now for each i, 


let F; denote the set B; plus all complementary domains of B; except the one 
containing M — B;. (Note that the set M — B; is connected by hypothesis.) 
Let F = =F;. Then for each 7, F; is a component of F and it readily follows 
that F is semi-closed. Since, for each i, S — F; is a single complementary 
domain of B;, no set F; separates S. Therefore, by (4.2), S — F is connected 
and locally connected. 

NowS—-F=M-—B. ForletreS—F. ThenzeM. Forif not, zliesin 
a complementary domain D, of M, and this gives r C D, CD, + B, CF, CF, 
an impossible result. Thus x2 ¢M, and since B C F, we have re M — B, 
whence S — F CM — B. On the other hand, F > BandF CB+S—M 
gives S — F D> M — B, whence S — F = M — B. Accordingly, M — B is 
connected and locally connected. 

Now by (2.3) there exists a monotone transformation T(S) = = such that 
for each x eZ, T~(x) is either a component of F or a single point of S — F, 
because F is semi-closed. Thus M — B (= S — F) is homeomorphic with 
> — T(F). Also, > is cyclic, since no set T-(x), x €X, separates S. 

5A result closely related to this theorem has been found by R. L. Wilder. See his 


abstract in the Bulletin of the American Mathematical Society, vol. 34 (1928), p. 426, 
no. 22. 

° It is readily seen that, when added together, the elements of any countable semi- 
closed collection of disjoint continuua (X;) in a compact space form a semi-closed set 
whose components are the Xj. 
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Now since if M is non-dense in S, F is dense in S and T(F) is dense in &, 
and since 7(F) is countable, it follows that in this case M — B is homeomorphic 
with the complement on = of a countable dense set. 

If S is a topological sphere, so is =, since = is the cyclic monotone image! 
of S. Thus if M is non-dense in S, T(F) is countable and dense in 2, and 
hence it is isotopic with the set of rational points on a sphere or plane. Thus 
M — Bis homeomorphic with the set of irrational points on a sphere or plane. 

As a simple application, let S be a sphere and let M be a locally connected 
continuum on S having no local separating point. The conditions of (4.3) are 
then satisfied and accordingly M — B is homeomorphic with the complement 
of a countable set of points on a sphere, and if M is non-dense, M — B is homeo- 
morphic with the set of all irrational points on a plane. 


Tue UNIVERSITY OF VIRGINIA. 
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CRITERIA FOR THE COMPOSITENESS OF FINITE GROUPS 
By Louis WEISNER 


1. Introduction. Tchounikhin has recently established a criterion for the 
compositeness of a finite group' of which the following is a modification. 

THEOREM 1. Let G be a group of order g = p*m, where p is a prime that does 
not divide m, and let P be a subgroup of order p* of G. G has an invariant sub- 
group of index p that does not include a particular element S of P if and only if 
P has a maximal subgroup P, which does not include S, such that every conjugate 
of S' (l any integer) under G that is contained in P has the form S'V, where V is 
an element of P,. 

I propose to show in the present paper that the condition of the theorem is 
satisfied under fairly general assumptions concerning the relation of P to G, 
thus deducing new compositeness criteria. 


2. Notation. The notations of Theorem 1 for G and P will be employed 
throughout this paper. The normalizer in G of an element A or subgroup A 
of G will be denoted by N(A). The cross-cut of two groups I; and TI, will be 
denoted by (T;, T2) and the group they generate by {T, Te}. 


3. Proof of Theorem 1. Let G’ be an invariant subgroup of index p in G 
which does not include S. P; = (P, G’) is clearly a maximal subgroup of P 
and a Sylow subgroup of G’. The commutator of S' and any element of G 
is an element of G’; hence, if this commutator is an element of P, it is an ele- 
ment of P;. The condition of the theorem is therefore necessary. 

In proving the condition sufficient we shall suppose, without introducing 
any change of notation, that G is a regular permutation group on the symbols 
1, -+-+,2%,. By asuitable choice of the notation we may suppose that 


You % + --> + In (n = ps) 
belongs to P;. The conjugates of yo under G are linearly independent, since 
no two of them have a term in common. The permutations of P transform 
Yo into 

yx = S*yo (k = 0,1,---,p—1), 
each of which is an invariant of P;, since P; is an invariant subgroup of P. 
The function 


Received June 3, 1936; presented to the American Mathematical Society, October 26, 1935. 
1 Serge Tchounikhin, Uber einige Sdtze der Gruppentheorie, Mathematische Annalen, 
vol. 112 (1935), p. 92. 
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where ¢€ is a primitive p-th root of unity, is therefore an absolute invariant of P;. 
It is, however, a relative invariant of P, since 


So, = €¢1 (ec = 0,1,---,p—1). 


Moreover, since the only permutations of G that permute yo, --- , yp». among 
themselves are those of P, every permutation of G that transforms ¢g; into a 
numerical multiple of itself is contained in P. 

Let 


G = PT, + .-. + PT, (T; = 1) 
be a decomposition of G into cosets as regards P; and let 
gi = Tig (i = 1, --- , m), 


These functions are distinct. If B is a permutation of G and 7;B is in the j-th 
coset, T;B = UT;, where U is a permutation of P. Therefore’ 


Bos = (T.B)gr = (UTe: = Tier) = 9) Ov = 1). 
Every permutation of G therefore transforms each of the functions ¢), --- , Om 
into one of these functions multiplied by a power of «8 Their product 

@ = Gi +++ Om 
is therefore an invariant of G. We proceed to prove that @ is a relative 
invariant of G. 
Suppose that, for a certain index 7, 
(1) Sei = €¢: (0ScSp-1}). 
Then 
(T:S)¢1 ™ €(T 1), 

(2) (TST;")e: = €¢r. 
We have seen that the only permutations of G that transform ¢; into a numerical 
multiple of itself are those of P; therefore T;ST;' is contained in P. By 
hypothesis T7;ST;* = SV, where V is a permutation of P;. Therefore, since 
Ve = #1, 
(3) (TST; "gr = (SV)gi = er. 
Comparing (2) with (3), we infer that (1) is possible only if c = 1. Supposing 
the notation chosen so that 


2 In the equations which follow the permutations operate from left to right. 

3A representation of G as a monomial group therefore arises, and the proof of the 
theorem may be completed by employing the theory of monomial groups, following W. 
Burnside, Theory of Groups, second edition, 1911, p. 325. This is the method employed 
by Tchounikhin, loc. cit. I prefer, however, to use the more elementary concept of rela- 
tive invariant of a permutation group, following H. F. Blichfeldt, Theorems on simple 
groups, Transactions of the American Mathematical Society, vol. 11 (1910), pp. 1-14. 
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Se; = (a = a tee y r), 
Soi ¥ &; (@=r+1,---,m), 
we have 
S(gi --+ gr) = (gi «+ or), 


where r 2 1, since Sg; = «eg. If r = m, ois a relative invariant of G, since 
(m, P) = 1. 

If r < m, each of the functions ¢,4:, --- , @m is transformed by S into another 
of the set, multiplied by a power of «. Aside from these multipliers, S permutes 
these functions according to a permutation whose order is a power of p. Their 
number m — r is therefore a multiple of p, so that (r, p) = 1 in any case. If 


(Grsiy *** » Grepe) iS a cycle of the permutation in question, 
SGrit = 4Gr425 S¢rs2 = 6743, °° » S¢,4 pe = EpePriiy 
where €|, €&, «-- , €pe are powers of «. Denoting their product by 
€ (0 S¢=p-— }), 
we have 


S”o,41 = €Orii- 
Hence 
sperm 
(T4:S”"T 741) ¢1 = €¢%). 


T..S”T;, is therefore a permutation of P and consequently, by hypothesis, 
is a permutation of P,, since S” is a permutation of P,. It follows that c = 0, 
so that 


S(¢r41 <2 Gripe) = Gr+1 *** Orape- 


Treating the remaining functions in the same way we conclude that S¢@ = €¢. 
Therefore, since (r, p) = 1, ¢is a relative invariant of G. Those permutations 
of G which transform ¢ into itself form an invariant subgroup of G of index p 
which does not include S. The proof of the theorem is complete. 


4. Theorems involving primitive elements. An element of a group is im- 
primitive or primitive according as it is or is not contained in every maximal 
subgroup of the group. The appropriateness of these terms may be realized 
by comparison with the concept of a primitive element of a field, considering 
that a primitive element of a group is included in at least one set of independent 
generators of the group, while an imprimitive element is not.‘ 

THEOREM 2. If a primitive element of S of P is commutative with every element 
of G whose order is prime to p, G has an invariant subgroup of index p that does 
not include S. 


‘See Miller, Blichfeldt and Dickson, Finite Groups, 1916, p. 71. 
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This theorem is an immediate consequence of Theorem 1, as the conditions 
of Theorem 1 are fulfilled by S and any maximal subgroup P; of P that does 
not include S. 

Repeated applications of Theorem 2 yield the following theorem: If every 
element of G whose order is prime to p is commutative with each element whose 
order is a power of p, then G is the direct product of P and a group of order m.5 

If, in addition to the assumptions of Theorem 2, we assume that P is invariant 
under G, we may conclude that a maximal subgroup of P is invariant under G. 
For if G’ is the invariant subgroup of G, whose existence is asserted by Theorem 
2, then (P, G’) is a maximal subgroup of P which is invariant under G. 

TueoreM 3. If P is invariant under G and a primitive element of P is com- 
mutative with every element of G whose order is prime to p, then P has a maximal 
subgroup which is invariant under G. 

In particular, an automorphism of P, of order prime to p, which is commutative 
with a primitive element of P is commutative with some maximal subgroup of P. 

TueoreM 4. If a primitive element of P is invariant under N(P) and under 
every Sylow subgroup of G in which it enters, then G has an invariant subgroup 
of index p. 

Let S be the primitive element in question. Since S is invariant under every 
Sylow subgroup of G in which it enters, the same is true of every conjugate of S. 
Let S; be a conjugate of S under G, and suppose that S; is contained in P. 
Since S and S; are conjugates under G, they are conjugates under N(P).° But 
S is invariant under N(P); hence S = S,;. Therefore no conjugate of S under 
G, except S itself, is contained in P. The same being true of every power of S, 
the conditions of Theorem 1 are fulfilled by S and any maximal subgroup P; of P 
that does not include S. Therefore G has an invariant subgroup of index p 
that does not include S. 


5. General theorems. When the distribution of the elements of P into 
conjugate sets with respect to G is known, all invariant subgroups of G of index 
p may be determined by Theorem 1, if any exist. In cases where the informa- 
tion concerning this distribution is inadequate, but sufficient information is 
available concerning the distribution into conjugate sets of the elements of a 
subgroup I of G that includes P, the following theorem may be found useful. 

TueoreM 5. Let S be an element of P and YT a subgroup of G that includes P. 
If every two conjugates of S' (l any integer) under G that are contained in T are 
conjugates under T, and if T has an invariant subgroup of index p that does not 
include S, the same is true of G.7 


5 Burnside, Theory of Groups, p. 327, Corollary 1. 

* Burnside, Theory of Groups, p. 155. . 

7 Proved by W. K. Turkin, Ein neues Kriterium der Einfachheit einer endlichen Gruppe, 
Mathematische Annalen, vol. 111 (1935), p. 281, subject to the assumption that the order 
of T and the index of T in G are relatively prime. Other criteria of this type are given 
by G. Frobenius, Ueber auflisbare Gruppen, III and V, Sitzungsberichte Berlin, 1901, 
pp. 865 and 1324. 
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If I’ is the invariant subgroup of index p of T, P; = (P, I’) does not contain 
S and is a maximal subgroup of P. By Theorem 1, applied to I’, every conjugate 
of S' under I that is contained in P has the form S'V, where V is an element 
of P,. All conjugates of S' under G that are elements of P are accounted for, 
since two conjugates of S' under G that are contained in P are conjugates under 
lr. The theorem now follows from Theorem 1, applied to G. 

To apply this theorem effectively, it is desirable to know under what cir- 
cumstances all the conjugates under G of an element of I’, that are contained 
in I, are conjugates under T. The next theorem will be found useful in this 
connection. 

TuHEeorEM 6. Let I be a subgroup of G, and C an element or subgroup of Y. 
All the conjugates of C under G which are contained in T are conjugates under 
N(T) tf and only if every two conjugates of T under G that contain C are trans- 
formable into one another by an element of G that is commutative with C. 

To prove the condition necessary, we suppose that C is contained in two 
conjugates T; and T, of Tf. We do not assume that I’, I; and I, are distinct. 
If A is an element of G that transforms IT into Tz and A transforms C into Co, 
C and Cy» are contained in I; and are conjugates under N(T2) since, by hy- 
pothesis, all the conjugates of C under G that are contained in T are conjugates 
under N(T). An element B of N(T:) therefore exists which transforms C 
into Cy. We now have 


A“T,A = r., ACA = Co, BT.B = Is, BCB = Co. 


It follows that AB is commutative with C and transforms I, into Tr. 

To prove the condition sufficient, let C; and C2 be two conjugates of C under 
G that are contained in T. Let T be an element of G that transforms C, into 
C,, and suppose that 7 transforms T into To. Since C2 is contained in T 
and in Ip, an element U of G exists which is commutative with C, and trans- 
forms Tinto Tp. Therefore 


T=“TT = To, TCT = C:, U-TU =T1o, U-""CLU = Cy. 


It follows that TU! is an element of N(T). that transforms C, into C.. 

It will be noticed that this theorem provides ‘a condition that the conjugates 
of C under G which are contained in T be conjugates under N(T) rather than 
under [. It is, however, possible to choose T so that every two elements or 
subgroups of I which are conjugates under N(T) are conjugates under I. 
This will surely be the case if ! = N(T), that is, if T is its own normalizer in G, 
or if every element of N(T) which is not contained in [ is commutative with 
each element of I. 


6. Case in which P is abelian. If G’ is an invariant subgroup of G of 
index p, (N(P),G’) is an invariant subgroup of N(P) of index p. Therefore, 
if G has an invariant subgroup of index p, the same is true of N(P). 

The converse is true when P is an abelian group. For in this case two ele- 
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ments of P which are conjugates under G are conjugates under N(P). It follows 
from Theorem 5, with T = N(P), that if N(P) has an invariant subgroup of 
index p, the same is true of G. 

THEorREM 7. If P is an abelian group, G has an invariant subgroup of index p 
if and only if N(P) has an invariant subgroup of index p. 

If G has an invariant subgroup of index p, and P’ is a subgroup of order 
p*' of the invariant subgroup of N(P) of index p whose existence we have 
proved, P’ is an invariant subgroup of N(P), and the corresponding quotient 
group has only one subgroup of order p, this subgroup being an invariant 
Sylow subgroup. Since, on the other hand, N(P)/P’ has an invariant subgroup 
of index p, an element of order p of N(P)/P’ is invariant under this group. 
Therefore, if G has an invariant subgroup of index p, P has a maximal sub- 
group P’ which is invariant under N(P), such that N(P)/P’ includes an in- 
variant element of order p. 

The converse is true when P is abelian. For if a maximal subgroup P’ of P 
is invariant under N(P), and N(P)/P’ has an invariant element of order p, 
N(P) has an invariant subgroup of index p. It follows from Theorem 7 that G 
has an invariant subgroup of index p. 

TueoreM 8. [f P is an abelian group, G has an invariant subgroup of index p 
if and only if P has a maximal subgroup P’ which is invariant under N(P), such 
that N(P)/P’ includes an invariant element of order p. 

If (g, p — 1) = 1, an element of order p of N(P)/P’ cannot be a conjugate 
of any of its powers, except the first power, and is therefore invariant under 
N(P)/P’. We therefore have the following 

THEOREM 9. If P is an abelian group, if (g, p — 1) = 1, and if a maximal 
subgroup of P is invariant under N(P), then G has an invariant subgroup of index p. 

Let p™, p™, --- , p" be the invariants of the abelian group P, arranged in 
descending order of magnitude. Those elements of P whose orders divide 
p™ form a characteristic subgroup of P whose order is p* if the largest 
invariant of P is unrepeated; that is, if nm. > ne. From the preceding theorem 
we now have 

THEOREM 10. Jf P is an abelian group of type (nm, ne. --- , Mr), fm > Ne 
=>n3 2--- 2n,, and if (g, p — 1) = 1, then G has an invariant subgroup of 
index p.® 


7. T = N(L), where L is a subgroup of the central of P. We proceed 
to prove 

THEOREM 11. Jf a subgroup L of the central of P is invariant under N(P) 
and under every subgroup of order p* of G that contains L, and if N(L) has an 
invariant subgroup of index p, then G has an invariant subgroup of index p. 

Since L is invariant under every Sylow subgroup of G into which it enters, 


8 If r = 1 or 2, G has an invariant subgroup of index p*. Burnside, Theory of Groups, 
p. 327, Corollary 2. 
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every conjugate of L has the same property. If a conjugate L, of L were con- 
tained in P, L and L,; would be conjugates under N(P), since they are invariant 
subgroups of P. But L is invariant under N(P); hence L = L,. Therefore, 
two distinct conjugates of L cannot be contained in the same Sylow subgroup of G. 
Again, since L is a subgroup of the central of P, and every conjugate of P 
under G contains a conjugate of L, L is a subgroup of the central of every 
Sylow subgroup of G into which it enters. The only subgroups of order p* of G 
that contain L are those of N(L). If U is an element of G that transforms 
N(L) into itself, U-'LU is invariant under every subgroup of order p* of N(L); 
_ hence U“LU = L. It follows that N(L) is its own normalizer in G. 

Every conjugate of N(L) under G is the normalizer of some conjugate of L. 
Suppose that an element 7, of order a power of p, is contained in N(L) and in 
N(L,), where L, is a conjugate of L. Since T is commutative with every ele- 
ment of L and L,, L and L, are subgroups of N(T). Two Sylow subgroups 
J and J; of N(T) that contain L and L, respectively are conjugates under N(T). 
If A is an element of N(T) that transforms J into J,;, A must transform L 
into L, and N(L) into N(L,); otherwise, two distinct conjugates of L would 
be contained in the same Sylow subgroup of G. It follows from Theorem 6 
that two elements of N(L), of order a power of p, that are conjugates under G 
are conjugates under N(L). By hypothesis N(L) has an invariant subgroup 
of index p. If S is an element of P that is not contained in this invariant sub- 
group, the conditions of Theorem 5 are fulfilled by Sand f = N(L). Therefore 
G has an invariant subgroup of index p that does not include S. 


Hunter COLLEGE OF THE City oF New York. 











TRIPLES OF CONJUGATE HARMONIC FUNCTIONS AND 
MINIMAL SURFACES 


By J. W. Hawn anv E. F. BecKENBACH 
A surface S is said to be given in terms of isothermic parameters u, v, provided 
the representation 
(1) S: 2; = 2;(u, v), j = 1, 2, 3, (u, v) in D, 
where D is some finite domain of definition, is such that 
(2) E=G = X(u, v), F = 0, 
where 
E= Fin + Sin + Sens P= Xyu2iy + TeuTev + %s,u%2,v9 
G = 25, +225 + L55 


the second subscripts denoting differentiation. Such a representation is con- 
formal except where A(u, v) = 0. 

A theorem of Weierstrass states that a necessary and sufficient condition 
that a surface S, given in terms of isothermic parameters, be minimal is that the 
coordinate functions be harmonic. Then in any simply connected part of D, 
the functions x; are the real parts of analytic functions, 


zr; = Rf (w), w=utw, 


and (2) is equivalent to 


3 
42 ‘ 
(3) , x Sf; (w) = 0. 
j=1 
If an isothermic representation (1) of the minimal surface S is such that one 
of the coérdinate functions is identically zero, say x3(u, v) = 0, then either 


x\(u, v) + txe(u, v) or re(u, v) + i2,(u, v) is an analytic function of the complex 
variable w = u + iv, and 2x,(u, v) and x2(u, v) are said to form a couple of con- 
jugate harmonic functions. By analogy, the codérdinate functions of any minimal 
surface in isothermic representation have been called a triple of conjugate har- 
monic functions.' 

The analogy here indicated between analytic functions of a complex variable 
and isothermic representations of minimal surfaces has often been noted, and 
since the time of Weierstrass has served as a guiding principle in the study of 
minimal surfaces. It is the purpose of the present paper to pursue this analogy 
from the coefficients viewpoint. 


Received March 30, 1936. 
1E. F. Beckenbach and T. Rad6, Subharmonic functions and minimal surfaces, Trans. 
Amer. Math. Soc., vol. 35 (1933), pp. 648-661. 
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If x;(u, v) is harmonic for (u, v) in the domain D, and if P is an interior point 
of D, then x;(u, v) can be represented in the neighborhood of P by a Fourier 
series, 


(4) zi =a; + > r*(ajx cos k0 + b,x sin ke), 
k=1 


where r and @ are polar coérdinates with pole at P. Two such functions z,(u, v) 
and x2(u, v) form a couple of conjugate harmonic functions if and only if 


(5) Hi. = the, Qn = +i. k = 1,2,3,.---. 


In Lemma 1 we shall determine necessary and sufficient conditions, analogous 
to (5), on the coefficients in (4), in order that three such functions x,(u, v), 
xo(u, v), 3(u, v) shall form a triple of conjugate harmonic functions. The dis- 
cussion, while restricted to three functions, holds equally well for sets of n 
conjugate harmonic functions, as coérdinate functions in isothermic representa- 
tions of minimal surfaces in euclidean n-space. 

Lemma 1. In order that the harmonic functions 


r= a;+ 7 r*(a;,. cos k@ + 6b; sin ké) j=1,2,3, 
k=1 


form a triple of conjugate harmonic functions, it is necessary and sufficient that 


&k—-1 3 


(6) D Uk DY (ai24;.01 — Ojrbj1a) = 0 k= 2,3,4,---, 
1=1 j=l 
k—1 3 

(7) DY Uk —D YS (ayrbjeu0 + baju) = 0 k= 2,3,4,---. 
t=} 7=1 


Proof. We have 


f(w) = a; — ib; + ps (a; — 7b) ,)w*, w = r(cos 6 + 7 sin @), 


k=1 
whence 


3 


3 wo k—-1l 
(8) > f;*(w) m 8 bm +s Uk — I(ajn — 1b),.)(4;,.4 — 1b; .1)w**, 
j=1 j=1 k=2 (=1 
That (6) and (7) are equivalent to (3) follows from (8). 

Lemma 2. Let S and S’ be minimal surfaces given in isothermic representation 
respectively by 


(9) a =a; t+ DY r*(a;x cos kO + b;,. sin k0) (j = 1, 2, 3), 
k=l 

yj = Aj + DS r*(Ajx cos kO + B;x sin ke) (j = 1,2,3) 
k=l 
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in the neighborhood ofr = 0. If x; = y;,j = 1,2,3, ona set R of points having 
3 
r = 0 as a limit point, and if t is the first index for which }> (a2, + b3,,) 0, 
j=1 


3 
then t is the first index for which >> (A>, + Bi) #0. 
7=1 


3 
Proof. Suppose the first index h for which }> (A?,, + B?,) ¥ 0 is greater 
7=1 


thant. Then 
(x; — y;)? = r*[a¥, cos* @ + b? , sin? 0 + 2a;,b;, cos 1 sin t@ + O(r)], 


Edad © 
where O(r) denotes a quantity ¢(r) such that | ¢(r)/r| is bounded for all suf- 
ficiently small r. Noting in (6) and (7) that, for our particular ¢,? 


3 


3 3 
(10) > a, = L 05, #0, D 4425;. = 0, 


i=l i= j= 


we have 


(11) . (xz; — y,;)° = ‘p> ai + ow) |, 


3 3 
Since >> a® , > 0, there is a circle about r = 0 in which ; (x; — y;)® vanishes 
g=a g=3 


only at r = 0. This is a contradiction of the hypothesis that r = 0 is a limit 
3 
point of zeros of >> (x; — y;)?. The same reasoning shows that h is not less 
j=1 


than ¢. 

In particular, y;(u, v) = ¢;,7 = 1, 2, 3, where each c; is a constant, defines 
a minimal surface, so that we have the following result. 

THEOREM 1. Let S be a minimal surface given in isothermic representation by 
(1). If xj(u,v) = ¢;,7 = 1, 2, 3, where each c; is a constant, on a set R of points 
with a limit point P interior to D, then x ;(u, v) = c; in D; that is, S is a point. 

The following familiar theorem is included for the sake of completeness. 

THeoreM 2. Let S be a minimal surface given in isothermic representation by 
(1). If (u,v) = 0 ona set R of points with a limit point P interior to D, then 
x;(u, v) = constant in D; that is, S is a point. 

Proof. Assuming that S is not a point, and taking P as a pole of polar co- 
ordinates, so that S is represented near P by (9), we have the same sort of proof 
as for Theorem 1, except that this time we are concerned not with the function 
(11) but with the function 


2 One might suspect that the conditions (6) and (7) are reducible to 


3 


3 3 3 
> 5.451 = 8 bi pb5 1, > a; 551 4+ ) a;,.; 5 =0 
j7@=l 


j7™1 7=1 g™1 


for all k,l. But for the minimal surface of Enneper, in its standard representation, this 
is not true, for instance, for k = l = 2. 


Fa ata 
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3 
ACu, v) = eea[ at. + ow) |, 
7=1 


3 
t being the first index for which >> (a? , + 05,) #0. 
j=1 


From Theorem 2 it would follow that unless S is a point, the normal to S 
exists except at most at the isolated points P where \ = 0. However, if S is 
not a point, the formulas for the direction cosines of the normal to S, 


a ae ee 
(12) } a = —_ r - = 





P, q, 8 = 1, 2,3 in cyclic order, 
which hold except where \ = 0, reduce by (10) to 


(13) X, = Settes— Grader + Qe), 


3 
> a 
j7=1 


then X, remains continuous at r = 0, insuring the existence of the normal to S 
even at points where \ = 0. 

TurEoreM 3. Let S be a minimal surface given in isothermic representation by 
(1), and let the direction cosines of the normal to the surface at the image of (u, v) be 
denoted by X (u,v). If Xu, v) = ¢;,7 = 1, 2, 3, where each c; is a constant, 
on a set R of points with a limit point P interior to D, then X (u,v) = ¢; in D; 
that is, S is a plane surface. 

Proof Make a transformation of coérdinate axes in the (7,72%3)-space so 
that the origin is at the image of P and the positive 2;-axis coincides with the 
positive normal at that point. We have then 


(14) X, => NX» = 0, Ks = 1 


~. 0 


rt 


at P, and therefore by hypothesis we have (14) on R. 
For points of R at which \ # 0, (12) and (14) imply 


(15) T3,.u = FW3,.r = 0, 


while for points at which \ = 0, (2) implies (15), so that (15) holds at all points 
of R. Let fs(w) be a function analytic at P such that 


x3(u, v) = Rfs(w); 
from 
fi(w) = %3,.u — 1X3,» 


it follows that f;(w) vanishes on R and therefore f;(w) = 0. Then x3(u, v) = 0, 
and the theorem follows. 


* Theorems 3 and 4 could be proved also by a consideration of the stereographic projee- 
tions of the spherical images of the surfaces involved. 
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THeoreM 4. Let S and S’ be minimal surfaces given in isothermic representa- 
tion respectively by 


4; = 7;(u, v), y; = y;(u, v) j = 1, 2, 3, (u, v) in D 


and let the direction cosines of the normals to the surfaces at the images of (u, v) 
be denoted respectively by X ;(u, v), Y ;(u, v); of r;(u, v) = y,(u, v), 7 = 1, 2, 3, 
on a set R of points having a limit point P interior to D, and if X ;(u, v) = Y ;(u, v), 
j = 1, 2,3, ona set R’ of points with the same limit point P, then x (u,v) = y;(u, v) 
in D; that i is, S and S’ are coincident ourfaces. 


Proof. Assume the contrary, that > (x; — y;)? 4 0. We shall show that 


j=1 


our hypotheses are inconsistent, namely that if > (x; — y;)? = 0 on R, but 
j=1 


: 
>, (x; — y)® ¥ 0, then there is a neighborhood of P in which, except at most at 
gms 

3 


P, > (X; — Y;)? # 0, contrary to hypothesis. 


inal 
3 


If >> (X; — Y,)? ¥ 0 at P, the above contradiction of hypothesis is trivial, 
j=1 
3 


so we take > (X; — Yj)? = Oat P. 
j=1 
Transform the codrdinate axes in the (x)7223)-space as in the proof of Theorem 
3 


3, and do similarly in the (y,y2ys)-space, preserving >> (x; — y,)? = 0 on R, 


i=i 
and take the point P as pole of polar coérdinates: 
(16) = : r*(a;,. cos k@ + b;. sin k@) (j = 1, 2,3), 
k=1 
(17) yi = >> r*(Aj,x cos kO + Bj, sin ké) (j = 1, 2,3). 
k=1 


In the next several paragraphs, a-c, we derive certain relations between the 
coefficients in (16) and (17). 


3 
a. By Lemma 2, if ¢ is the first index for which >> (a?,, + b?,,) # 0, then ¢ is 
j=1 


3 
the first index for which a + BY) #0. 
j=1 
b. Since (14) holds at r = 0, (13) gives 
(18) by ,@3, — Gy,,b,, = 0, 
(19) b, ,4,, — 4,,6,, = 0, 
3 
(20) A, be, — M2,6,. = p> ai ~ 0. 
7 


Equations (18) and (19) are linear and homogeneous in a3, and b;,, ; sinee, by 
(20), their determinant of coefficients is not zero, it follows that 














TRIPLES OF CONJUGATE HARMONIC FUNCTIONS 703 


(21) a3 4 = bs, => 0. 
Equations (10) become then 
(22) ai, + a3, = bi, + b3, ~ 0, 
(23) Ady, + Ay,b., = 0. 
Now (20), (22) and (23) yield 
(24) bit = —a2,, be. = ay. 
Similarly, we have 
(25) By, saad —A;,, Boy = A,,, Bs, = A;, = 0. 
Making use of (21), (24) and (25), we get 
3 2 
(26) D (2; — yi)? = ap? (aj, — Aju)? + ow) | 
j7=1 j7=1 
3 
Since }> (x; — y,)? = 0on R, we obtain from (24), (25) and (26), 
j=l 
aj. = Ajn, bie = Bi (j = 1, 2). 
c. Let m be the first index for which 
3 
(27) ps [4m ~ Aj.m)? + (Dj .m ~ B;.m)*] = 0; 
j=1 


by b,m > t. Note that, for k = m + t, every term in (6) and (7) which in- 
volves an index greater than m involves also an index less than t, and therefore 
vanishes. This is true also of the corresponding equations involving the A’s 
and B’s. Subtracting this second pair of equations from the first pair respec- 
tively, and noting that, for k < m, a;,, = Aj,x, bj,x = B;,,x, we obtain equations 
which reduce by b to 


@3,t [(@1m — Aim) — (b2.m — Be,m)] 
+ 42, [(d2,m — A2m) + (bim — Bim] = 0, 
— G24 [(Qi.m — Aim) — (bem — Bem] 
+ y,¢[(@2,m — Azim) + (bim — Bism)] = 0. 
Since (22) holds, (28) and (29) yield 
(30) biym — Biym = — (2m — Asim), 


(31) De,m = Bom = Ahm —_ Besse 
A computation, simplified by (30) and (31), gives 


} (2; ae yi) =r | > (4) ,m aa Aim)* + ow) | 


j7=1 7=1 


(28) 


(29) 
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Another application of the argument used previously shows, therefore, that 
G;m = Aj,n, bjm = Bim (j = 1, 2); 
therefore, by (27), 
(32) (43,m — A3,m)* + (3m — Bs,m)* ¥ 0. 
d. We have 


x 


Tu = > (k oe 1) (4; 441 cos ké + Dj ea sin ké) y*, 
k=t—-1 

Li0 = > (kK +1) (bj.24. cos kO — a;441 sin k@) r*, 
k=t—1 


and similar expressions for y;,x, Yj,.o. By (i2) and the choice of m, 


mr 


x, + %,°-; —-@&, - 9,400 


t Zz a’ 


j=1 
where 
Mp = (,t Bsm — De,t Asm + Bs,t Ag,m — As,t Dg,m) COS (m — t) OB 
Ht (— 9,4 Asm — b9,t Bem + Gs,t g,m + Ds, Dg.m) sin (m — t) 8, 


and N, is the same expression in the A’s and B’s. In particular, for p = 1, 2, 
these expressions can be simplified by b, yielding 


Mi — Ni = [a2,¢ (b3,.m — Baym) — G1, (3m — As,m)] cos (m — t) 8 
+ [— ay,¢ (0s.m — Bs,m) — G2, (A3,.m — As.m)] sin (m — t) @, 

Mz — No = [— ait (bs,m — Bs,m) — G2, (G3,m — Asz,m)] cos (m — t) 0 
+ [—4e,¢ (03m — Bs,m) + i,t (@3,m — As.m)] sin (m — 2) 8, 


whence 


2 2 
> (X; — ¥)2? #@ SS a2, = m2 12 [(asm — Asim)? + (dsm — Baym)? + O(r)). 


j7=1 s=1 
Consequently, by (32), there is a neighborhood of P in which, except at P, 
2 


7 (X; — Y;)? #0. This contradiction of hypothesis establishes the theorem. 
j=1 


Tue Rice INsTITUTE. 











INEQUALITIES AMONG THE INVARIANTS OF PFAFFIAN SYSTEMS 


By DonaLp C. DEARBORN 
1. Introduction. Associated with any pfaffian system 
S: w* = af dr' = 0 (a = 1,2,---,r;i = 1,2,--+,n) 


are certain arithmetic invariants. Among these are the number r of inde- 
pendent equations in the system, the species o, the class p, and the half-rank p.' 
These invariants are all non-negative integers. 

The object of this paper is to find sets of inequalities which must be satisfied 
by these four invariants for any pfaffian system. If for every non-negative 
integral solution of such a set of inequalities it is possible to find a pfaffian 
system having that solution as its invariants, the set of inequalities will be 
called complete. 

In §2 sets of inequalities are found which hold for any pfaffian system. These 
sets are not, in general, complete sets. In §3 a complete set of inequalites is 
given for systems having equal species and half-rank. Included in this classifi- 
cation are all completely separable? systems, such as passive systems, systems 
consisting of a single equation, and systems having r — 1 integrals. Systems 
having rank two are considered in §4. It is shown that such systems have 
species one or two, and complete sets of inequalities are obtained. 


2. Inequalities satisfied by the invariants of any system. It is known that* 
p = o and that* p = r + o + 1 unless the system is passive. 

Since there are r independent equations in S, the system may be solved alge- 
braically for r of the differentials and put in the reduced form® 


(2.1) w* = dx* + A% dx* (a=1,---,r;A=rt+1,---,r+o), 
where S is assumed to be expressed in terms of the minimum number of differ- 
entials. The derived forms are then w’* = dA dzx*, which we write as 


Received June 4, 1936; part of a doctoral dissertation presented June, 1936 at Duke 
University. 

' For definition of class see E. Goursat, Legons sur le Probleéme de Pfaff, Paris, 1922, 
p. 268. For species see J. M. Thomas, Pfaffian systems of species one, Trans. Amer. Math. 
Soc., vol. 35 (1933), pp. 356-371. For half-rank see E. Cartan, Jnvariants Intégrauz, 
Paris, 1922, p. 59; Mabel Griffin, Invariants of pfaffian systems, Trans. Amer. Math. Soc., 
vol. 35 (1933), p. 931. 

? Griffin, loc. cit., p. 936. 

3 J. M. Thomas, A lower limit for the species of a pfaffian system, Proc. Nat. Acad. Sci., 
vol. 19 (1933), p. 913. 

* Thomas, loc. cit., footnote 1. 

5 Thomas, loc. cit., footnote 1, p. 362. 
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(@AF + 5A) dx*. Here 6A¥% represents the differential of A¥ formed on the 
assumption that 2', 27, --- , 2"** are constant. 
The condition that the half-rank is p implies that 


w' +++ w(@AS + 6AS}) da™ ... (@AXe + 5AS2) dae ¥ 0 


for some set of values a, ---,a,. As all such products are of degree r + 2p, 
we have the inequality p = r + 2p. 

In order that the half-rank of the system be p, it is also necessary that all 
products 


(2.2) Q*V*e" = w!... w(GAS! + 6AS) dx™ --. (0A fou + 5A Se) da** 
M+ +1 
vanish. Any Q*'’’“** contains a sum of terms 
wi... wt dx™... dr ron 6A%: ve 5A jen 
1 +1 


and this sum must vanish since any term from (2.2) involving a @A is of degree 
at least r + p + 2in dz',---,dzx"**. Ifo = p, these terms vanish identically. 
Consequently we suppose that ¢ 2 p + 1. In this sum are terms of the type 


dx! ..- dat dx™ ..- dxten 621° °° Fou §4% 2... §A Fn 
oi * ou th You? 


where 41, +: , @4: is a fixed set taken from r + 1, --- , 7 + o with a, ¥ an, 
and 6317": abt is the generalized Kronecker delta. Since the 5A’s contain no 


differentials with index less than r+ o + 1 and dz! --- dx’ dx™' --- dx®-n #0, 
(2.3) bitches BAG! «++ 6Afon = 0. 

This last group of terms may be interpreted as the determinant made up of 
the rows with indices a and the columns with indices a in the matrix || 5A ||. 
Since the elements are subject to the non-cummutative law 6Aé6B = —é6BéA, 
the same square array gives rise to two different determinants. We shall call 
the expansion (2.3) the down-determinant | 5A | and shall say that the matrix 
|| 6AX || has down-rank equal to p if every down-determinant of order p + 1 
taken from || 5A¥ || is zero while some down-determinant of order p is not zero. 
Here it is understood that determinants containing any number of repeated 
rows are to be considered. 

A necessary condition that the half-rank of (2.1) be p is that the matrix 
|| 6AX || have down-rank S p. Consider in particular the determinant in which 
a, = a = -*** = @ 1 = a. The down-determinant equated to zero gives 
6A, 5A‘, --- 6AT,,, = 0, forall l,, hh, +--+ , 1,41; that is, any SAX may be expressed 
as a linear combination of at most p of those on the same row. This being 
true for all values of a, there are at most rp independent 65A’s. Since the 
system may be expressed in terms of the corresponding A’s and z', --- , x"*’, 
the class p does not exceed r + rp + a. 

A better upper limit for the class may be obtained as follows. Any w’* may 
be written in the form 


w'* = nf wo + gigs + --- +¢5., 19%, (Sa S p, not summed). 
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Since some 2“! “* does not vanish, there are 2p of the ¢’s such that 


1 a a a a 
wes w Oe).-1P21, °°" 94-121, * 0. 
As every 2 with p + 1 indices is zero, the products 


1 rm ay a a é 8 
Ws W@W P27,-1921, °*° Po4 1% 27, 21-192 


' (no sum) 

st vanish f canis anal 1: that is Pe og ssible as a linear com- 
must vanish for every pair 8, 1; that is, every ¢3, is expressible as a linear com 
bination of the forms 


w', focal w", $21,—-19 $214) eae 4-1) #91,” g51-1- 
Hence there are at most r + rp + p independent forms. The number of equa- 
tions in the associated set of 2, Q* cannot exceed this number and consequently 
pir+rp+p. It is evident that this limit holds in case ¢ = p. 

If we designate by max (a, b) the greater of a and b, we may state 

THEOREM 2.1. The invariants of any pfaffian system satisfy the inequalities 


psa, max (r + 29,r+o+1)SpSr+rpt+op. 


3. Systems for which the species and half-rank are equal. In this section 
we shall prove 


THEOREM 3.1. If the species and half-rank are equal, then 
r+otpSpsrt+rptop 


is a complete set of inequalities. 

The result is clearly true for passive systems (p = o = 0). 

When the species and half-rank are equal and different from zero, it is clear 
thatr+o+p=r+2p >=r+p+1. Theorem 2.1 then shows that the 
class p satisfies the inequality of Theorem 3.1. It remains to be shown that 
for any positive integers r and p, there exists at least one pfaffian system of r 
equations whose class is any integer p satisfying the inequalities of the theorem 
and whose half-rank and species are the given number p. 

Any integer p satisfying the inequality of the theorem may be written as 
r+6p+7,2 58 87,0878 p. Leto’, x, --- , x7*+4** be a set of inde- 
pendent variables, and form the pfaffian system 


aw! a dz! 4 grtett dyrtt + ... 4 grttedyrte = 0, 

wl om dx'-! + grt p+l dx + oa + artbe date as 0, 
wo = dxh +4 zrtbetl dyrt! 4... 4 grtbete dyrtt = QO, 

wt! = drt! = 0, 
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This is a system of r independent equations. It has species at most p, since it 
is expressed in terms of r + p differentials. To determine the rank we examine 
the product 


Qo" ]e = w! +++ w'[ darter! dart! + «-- 4 dyrt? dxrte/’ 


+ pdr'.--- dx’ dat --- dytte dyttet! ..- dart, 


Since the z’s are independent variables, this product is different from zero and 
consequently the half-rank is at least p. Combining this with the result that 
the species is at most p and the inequality, half-rank < species, of Theorem 2.1, 
we have that the half-rank and species are each p. 

The characteristic system consists of the equations dz! = 0, --- , dx™*#e*" = 0, 
and since these form an independent set of equations, the class is r + Bp + +. 
The inequalities of Theorem 3.1 thus form a complete set. 

As pointed out above, this case includes all completely separable systems. 
Accordingly if the species exceeds the half-rank, the number of equations is 
always greater than one and the number of integrals less than r — 1. 


4. Pfaffian systems of rank two. Next to passive systems, the simplest 
systems are those for which the rank is two. From the results of §3 we know 
that if the species is one, r + 2 S p S 2r + 1is a complete set of inequalities. 
We shall show in this section that if the rank is two, the species cannot exceed 
two, and that if both rank and species are two, the class is exactly r + 3. 

We first prove 

Lemma l. If p = lando 22,thnp=r+o¢+1. 

Suppose the system is written in the form (2.1). Then from §2 we know 
that at least one 6AX¥ # 0 and that the matrix || 64% || has down-rank one. By 
renumbering the equations and variables 6A}.., can be made different from zero. 
If 5A',, 6A% = 0 for all values of a and X, the class is exactly r-+o¢+1. From 
§2 we know that 6A, 6A¥, = 0 (no sum) for all a, Ay, Az; that is, if for some Aj, 
bAX, ~ 0, we have 6AX, = nX,6AX, (no sum) for all 42. Suppose for some a # 1 
there is a 6A% for which 6A!,, 6A¥ # 0. The down-determinant 


6At., 8AX|] |8At,, 226A2,, | 
|8A%,, 8A¢| |64%,, 84g | 


must vanish. Since 6A!,, 6A{ is not zero, 6A!,, 6A%,, is not. Consequently 
all the independent 6A’s may be taken in the first column. 
Now consider any two rows of the matrix || 6A¥ || for which 
nN 8 
5A%,, 6At,, € 0. 


These may be written 


(OAT. | omtae OATS, Ey? Nr +0 OAT +1 | ‘ 
; : f ae (no sum). 
6At i, +2 OAT, i Nr+e OAr41 





«mb. St Oa EE 


eS yr rn ae ei 
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Since every down-determinant of order two must be zero, we have nf = nf 
for all \. This must hold for every pair a, 8 for which 6A%,, 5A%,, # 0. 
Hence if there is any such pair, all the n’s are independent of a, except for those 
rows on which all the 6A’s are identically zero. 

Suppose now that the system (2.1) has half-rank 1, species ¢ = 2, and that 
there are more than one independent 6A’s. We shall show that it is possible 
to adjoin to the system o — 1 equations giving a passive system. This will 
contradict the hypothesis that the species is ¢. We adjoin first the equations 


dx™*3 = dyt*+4 = +--+ = dyt*" = 0. 
The 2’s for this system will be of the form 
w! .-- w (@A% dxrdx™*® ... darts + 6A% dxddzt** ... dat), 


A single equation may now be added which will cause these to vanish and thus 
render the system passive. To demonstrate this, we adjoin the equation 
dy = 0, where y is an undetermined function of z', 2°, ---, 2"*¢ only. The 
terms containing dA¥ will vanish, since they will be of degree r + o + 1 in 
r + o differentials. The function y must then be determined to satisfy the 
equations 


w' .-- w(6AS,, datt! drt ... datte + 6A%,, dxt*? dat? ... dzt+*) dy = 0. 


For those values of a for which 6A%,, = 0, there will be no equation. In the 
remaining equations 6A%,, may be replaced by 7,.2 6A*,, as indicated above. 


Since the terms 6A%,, contain none of the differentials dz', --- , dx’**, they may 
be factored out of the equations and every equation in the set then reduces to 
the single one 


wt .-+ w(dartt darts «2. dxtt™ + nye da’*? dat* ... dat**) dy = 0. 


This gives rise to the partial differential equation 


2 Ata math — ate) 20 
oxrt? —eee ax Mr+2 ox7t ers Ox 


The desired function y must also satisfy the inequation 
w! .-- w dzt3 ... dytte dy ~ 0. 
This requires that one of the expressions 


ay 
Ox? 


ge DY i] ga (OY 
«4r+2 are’ ax"? 43 ¢+1 ax 





be different from zero. Since 7,2 ~ 0, a solution will exist and that solution 
will render the system passive. It follows from this contradiction that if 
o 2 2, there can be only one independent 6A and consequently p = r + ¢ + 1. 


With the aid of Lemma 1 and a result of Miss Griffin, we may now prove 
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Lemma 2. If p = 1, then (o — 1)(o — 2) = O. 
Miss Griffin has shown’ that any system for which p = 1 may be put in a form 
for which either 


wt =O mod (w',---,w") (¢€=1,2,---,r— 3), 


w?" = gige mod (w!, --- , w’), 
(4.1) 

ww’! = gy, mod (w!,--- , w"), 

w = ¢g31 mod (w!,--+ ,w"), 
or 

, 

w’* 20 mod (w!, --+ , w”) (e= 1,2,---,r), 

(4.2) 


wo =P mod (a, --- , w”) (A= r'+1,---,7). 


The first type is of class r + 3 and therefore has species < 2. The second 
type will be shown to have species < 2. 

If y’ vanishes by virtue of the original system augmented by the equation 
¢ = 0, adjunction of this equation will render the system passive and show that 
the species is one. Suppose then that w!--- wy’ # 0. The congruences 
(4.2) imply that w® = gy + n\ w*. .Since the derived form of a derived form 
is identically zero, we have 


(4.3) [wo = oP — ob + n.* ot — nwt = 0 (all d). 
Multiplying by w! --- w’g gives 

(4.4) w--- wyy'y = 0 (all \); 
that is 

(4.5) ¢g = YO mod (w!, --- , w’, ¢) (no sum). 


The product YY is not zero. Since (4.4) and (4.5) hold for all values of X, 
¢’ = YY. This implies that there can be at most two independent y’s and 
consequently the class is at most r + 3 and the species at most two. It should 
be noted that if the species is two there are always at least r — 2 2°’s which 
vanish. 

Lemmas 1 and 2 now furnish the proof of 

THEOREM 4.1. The invariants of a pfaffian system of rank two satisfy one or 
the other of the relations 


(a) o = 1, r+2sps2r+1; 
(b) o = 2, p=rt+3, r> i; 


and in either case the set of inequalities is complete. 
That the invariants satisfy one or the other of these relations is shown by the 


6 Griffin, loc. cit., p. 931. 
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lemmas. The necessity of the inequality r > 1 in case (b) was pointed out 
at the close of §3. In case (a) the completeness of the set is given by Theorem 
3.1. It remains to be shown that for any integer r > 1, there exists at least 
one system having r equations, p = 1, ¢ = 2, and p = r+ 3. The system 


w! = dr! + a2 dz’? + 2? dr‘ = 0, 
w? = drt + dz? + 2' dx’ = 0, 
wi =dzri'=0 (¢ = 6,7,-:-,7r +3) 


satisfies these conditions and thus shows the completeness of the set (b). 

A similar theorem for any value of the rank would be of considerable interest, 
for in most cases it is easier to compute the rank and class than to compute the 
species. It is likely that the species never exceeds p + 1 and that when it 
has this value p = r + o + p, but no proof of this conjecture has been obtained. 


DvuKeE UNIVERSITY. 














ON THE FOURIER TRANSFORMS OF DISTRIBUTIONS ON 
CONVEX CURVES 


By E. K. HaviLanp AND AUREL WINTNER 


In a previous paper,' the asymptotic formula for the Bessel function J» has 
been applied to the derivation of smoothness properties of infinite convolutions 
of circular equidistributions. For this purpose, not the asymptotic formula 
but merely an appraisal was needed. It has been indicated in that paper* that 
the same method is valid also in the case of infinite convolutions of certain 
distribution functions along convex curves—in particular, in the case of some 
asymptotic distribution problems connected with the Riemann zeta function. 
The necessary appraisal of the function corresponding in this more general 
case to the function J» has then been carried through*® by a simple application 
of a lemma of van der Corput and Landau.* The object of the present paper is 
to replace this appraisal by an asymptotic formula. While the former cor- 
responds to Jo(r) = O(r-+), r — ~, the latter will be a generalization of 


Jo(r) = r-i(2/m)! cos (r — 2/4) + O(r), r—> @, 


The general function to be considered is, in contrast to the particular function 
Jo(r), a function not only of r but of an angular parameter y also. For a fixed 
value of ¥, the asymptotic formula in question may be obtained by applying an 
elementary method.5 What is needed for the applications mentioned above, 
and what will be proved in what follows, is the fact that the asymptotic formula 
holds uniformly for all values of y, i.e., that the error term is in absolute value 
less than Cr~'!, where C is a constant independent both of r and of y. 

Let « = x(¢), y = y(y), where 0 S ¢g < 2x, be a parametric representation 
of a convex Jordan curve S in the (x, y)-plane. It will be described more pre- 
cisely below. Let ¢ = o(E) be an absolutely additive set function defined, 
for every Borel set EF of the (x, y)-plane, by setting o(F) equal to 1/(27) times 
the linear measure of those ¢ for which (x(¢), y(¢)) is contained in F'S, if F is any 
open set in plane. In particular, it is seen that S is the spectrum’ of o. By 


Received April 2, 1936. 

1A. Wintner, loc. cit., 1. The references are collected at the end of the paper, 

2 A. Wintner, ibid., pp. 328-329. 

3B. Jessen and A. Wintner, loc. cit., Theorem 12, p. 63. 

*(Cf., e.g., R. Kershner, loc. cit., where further references are given. 

5 Cf. A. Wintner, loc. cit., III, pp. 57-60, where references to the literature are given. 

* For the definition of the spectrum, together with some properties of spectra, ef. A. 
Wintner, loc. cit., II, pp. 9-10, and E. K. Haviland, loc. cit., Il, pp. 653-654. It has been 
pointed out by Professor Khintchine that, contrary to statements in these papers, the vec- 
torial sum of two closed sets is necessarily closed only when at least one of these sets is 
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virtue of the definition of Lebesgue and of Radon integrals, it may be shown’ 
that 


4-90 +00 
A(u, v; ¢) | [ exp[i(ur + vy)]dzyo(E) 
(1) soil ial 


~—. [ exp[7(ur(y) + vy(¢))] de. 
2r 0 


On setting u = r cos y and v = rsin y, one obtains 


(2) A = A(r cosy, rsiny;¢) = AJ [ exp[irh(¢; ¥)]dg, 
2r 0 

where 

(2a) hig; ¥) = x(¢) cos ¥ + y(¢) sin ¥. 


It will be assumed that 
(i) 2(¢) and y(¢) possess continuous fourth derivatives; 
(ii) h’’(y; ¥) has for any fixed y exactly two zeros on the curve S and these 
zeros are both simple. Here and in what follows a prime denotes partial dif- 
ferentiation with respect to ¢. 

Under these assumptions, it will be shown that 


A = (2er)4{R" (eal); WI exp [i(rh(es); ) + ¥/4)] 
+ [=h"(eW); WF exp li(rh(ew); ¥) — #/4)]} + 00>, 


where the O-term holds uniformly for all ¥, and ¢g; = ¢:(¥) and ¢3 = ¢3(y) de- 
note the two zeros of h’ on S. That, for any fixed y, there are precisely two 
such zeros and that they separate the zeros of h’’ is a consequence’ of (ii). 

The proof of (3) proceeds as follows. First, the minimum distance between 
a zero of h’ and a zero of h”’ has, for reasons of continuity, a positive lower bound 
€ independent of y. If y is fixed, one of the zeros ¢ of h’(y; ¥), say ¢ = @ = 
¢i(W), corresponds to a maximum of the function h(¢; ¥), the other, say ¢ = ¢3 = 
¢g3(¥), toa minimum. Let ¢o(W) and ¢s(W) be the zeros of h’’(¢; ¥) and let them 
be so situated that g: < g2 < ¢s < gi < gi: + 22. Finally, let four numbers 
n; be so chosen that 


(3) 


1 < m < ge < ne < ¢3 < 93 < gs < m < G1 + Qe. 





bounded. Correspondingly, what is actually proved, loc. cit., is not that the spectrum 
of the convolution of two distribution functions is the vectorial sum of the spectra of the 
individual distribution functions, but.that it is the closure of this vectorial sum. This 
does not at all affect the validity of the proofs given in the papers referred to. 

7 Cf. E. K. Haviland, loc. cit., I, pp. 552-553. The reasoning there used in the proof of 
Theorem V may be used unchanged to prove equation (1) of the present paper. 

8 (ii) might be generalized under suitable assumptions to the case where the second 
derivative has more than two zeros and has multiple zeros.° The case treated here is the 
one of interest for applications to infinite convolutions of the type occurring in the dis- 
tribution theory of the Riemann zeta function. 
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As h’’’(y; ¥) is continuous on the torus T: (0 S ¢ < 27;0 S ¥ < 2n), 
| h’’’(y; ¥) | is bounded, say < M, there. Moreover, it is clear from (ii) that 
there exists a positive constant a such that | h’’(¢i(¥); ¥) | > @ for every y. 
Then one may choose 7;(y) so that it lies between ¢:(¥) + 3¢ and ¢:(y) + 3¢ for 
all y, where ¢ = min (f, a/(2M)) is independent of y, and a similar choice 
will be made for the other three 7,’s. 

From (2) 


ca Ma A a A 


=J,+Jdn+4Ium+4v+dv 4+, 


say. 
In order to treat J,, set, for g; S ¢ S m, where g, = ¢)(y) and m = m(¥), 
(5) f= h(i; ¥) — hy; ¥) 


for every fixed ¥, corresponding to the fact that ¢: is a simple zero of h’ by 
assumption (ii). On taking the positive square root, 


(6) t= |h(gi; ¥) — hy, ¥) |, 


so that as ¢ increases steadily from ¢g; to 1, the variable ¢ increases steadily i 
from zero to a quantity ai(y) = | h(¢i(y); ¥) — A(m(W); ¥) |! which has a 
positive lower bound 8 independent of y in virtue of assumption (ii). 

Moreover, if a dot represents partial differentiation with respect to ¢, 





(7) g = —2t/h'(olt, vy); ¥), if0 <t < a,(y), 


sO 


ai(¥) 
(8) J; = ot exp [irh(gily) ; wif exp [—irt’] t/h’(g(t, y); p) dt. 
TT 0 


a,(¥) 
The integral in (8) is of the form I f(t; ¥) exp [—irf®]dt and may accordingly, 
0 


for every fixed y, be evaluated asymptotically in a known manner® under the 
assumption that f(t; ¥) possesses a continuous second partial derivative with 
respect totinO <¢S a,(y). That the function 


(9) ft) = helt, Wi») 


possesses this last property may be seen as follows. 
By Taylor’s Theorem with the integral form of the remainder 


h(¢; v) _ h(e) = h(¢x) + (y = ¢i)h’(¢1) a (oe —_ ¢1)?h’’(¢1) 
(10) 


“f } | “ho, tL s)(y = s)? ds, 
0 
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where ¢: = ¢:i(¥). Since h’(y) = 0, we have from (6), on changing the inte- 
gration variable in (10), 


¢ 4 
(11) t= {ie — 1)? h'"(gi) — 7 h’'(s)(e — sds) 


| 
1 / 


Applying the same form of Taylor’s Theorem to h’(y, ¥), we have 
"¢ 

(12) We) =H) = (— oho) + [" We — 8)a, 
#1 


Now ¢ is a continuous function of ¢ for fixed y, as may be seen from (5), since ¢ 
is a monotone continuous function of ¢ for gi(¥) < ¢ S mW) and h'(¢; ¥) is 
negative there and has a simple zero at ¢ = ¢:(¥). Then substituting (11) and 
(12) into (9) and writing the parameter explicitly, we obtain 








f hol): W fs ( | s — gil) Jas} 
) — 3h” sv) — 3 "(g- 1—- 's 
f(t) = f(t; y) = \ ” - . v : ei(¥) , ; id : g(t, vy) _ ew) f 
’ 2 ett, ¥) ms re oily) 
holy); ¥) + h’’""(s; ¥)j 1 — —— ia F™ 
(13) ey) g(t, ¥) — gly) 
= {—MAY + Hy)}*/{ht + Hy}, 
where 
¢(t.y) 
Hi; = elt, ¥) — dW i h'"(s; piel, ¥) — sks (i = 1,3; = 1,2), 
¥i(¥) 
and 


KM? =h@(y);¥) (¢ = 1,3;1 = 0,1, 2,3, 4). 


Now the 1/;;, 7 = 1, 2, are, for fixed ¥, continuous functions of ¢ for ¢:(¥) < 
¢ < m(v). Moreover, 


(14) | Hi | S @ — e) M, 


where M is the maximum of | h’’’(¢; ¥) | on the torus T, so that the H,; are, 
for fixed ¥, continuous functions of ¢ in ¢:(¥) S ¢ S m(¥) also. Since 0 
¢—¢ <¢ S a/(2M) forall gin gy) S ¢ S m(y), ie., for alltinO s ¢t 
a,(y), and for all y, it follows from (14) that | Hi | < a, so that | hk? + Wi! > 
3a there. Because ¢ is, for fixed y, a continuous function of tin 0 S t S a,(y), 
it is seen that f(t; ¥) possesses this same property. Then as ¢ = —2f, it is 
clear that ¢ tends to a definite limit as t + +0, which implies that ¢ exists, and 
(7) holds, at £ = O also. Differentiation of (13) with respect to t, substitution 
for ¢ from (7) and (13), and suitable grouping of the factors (¢ — ¢,;)~ in the 
result gives 


(15) f= (hi + Au} hd + Mull~n — La) — [A + Wilh}, 


where 


s 
= 


e(t.¥) 
Lis = [o(t, y) — eye» | h’"(s;y)[s —¢(W)¥ds (i = 1,3;k = 1,2,3). 


ei(¥) 











716 E. K. HAVILAND AND AUREL WINTNER 
In a similar manner, one obtains 
f= (Al + Ay} {-—HAY + Ae) PAT + Ma [-2L,(Lyo — Ly) 
+ SLis(Lie — Lu) + A1Q — (Lie — 2Lnr + Lid” 


(16) . 
+ 2hyLi(Lio — 221 + Lv) — Liv(Lur — L12)] 
—_- 8LifAiihi “ Hy,Lye2 a As Las —_ Hy2L)]}, 
where 
3 i . h’’"(glt, ¥); ¥) 
Q = e. h’"(s; ¥)is — o(y)Pds — V5 
? lo ¥) — a) Jeu Wis — e()] o(t, ¥) — oily) 


Now the Liy,., k = 1, 2, 3, are, for fixed yY, continuous functions of ¢ for g:(y) < 
¢ = m (wv). In addition, it follows from the continuity of h’’(¢) at ¢ = g = 
gi(v) that 


h’’’(¢) == h’"'(¢1) + w(¢); | w(y) | < €, if ie — ¢1 | ae 
Then 
h’''(o ) 1 r’ . 
Ly = *_) ay ee 
? k+1 + (y — ¢)** [ w(s) ¢i)* ds, 
so 


| Lu — h''(er)/(k +1) | <6, if |e —¢i| <5. 


Consequently, the Ly), are, for fixed Y, continuous functions of ¢ in the closed 
interval g:(¥) S ¢ S m(y). 
Furthermore, 


Q = 3(¢- of h’’"(s)(s — gi)? ds — h’’'(¢)/(e — ¢). 


1 


By partial integration, this becomes 
? ? ' 
—(y — af h“*)(s)(s — gi)? ds = —(g — of [h“(g,) + on(s)](s — gi)* ds, 
Fl v1 
where | w:(¢) | < € provided |g — g,| < 6,. Then 


? 
Q = -ih™@) — & - of wi(s)(s — ¢1)* ds, 
ei 
and the second term of this expression is in absolute value <e, provided 
|e — gi| < 6,. Hence, for fixed ¥, Q is a continuous function of ¢ in 
gi(¥) S ¢ S m(¥). In addition, the denominators in (15) and (16) are respec- 
tively greater in absolute value than a°/8, a°/32, both of which are positive, 
and these inequalities hold uniformly for all y. 
Inasmuch as ¢ is, for fixed y, a continuous function of tin 0 S t S a,(y), it 
follows from the preceding remarks and from (i) that 
(16a): f, f and f are for fixed y, continuous functions of tin 0 < t < a,(y). 
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Finally, | Hi; | <. 24M, j = 1, 2; |Lu| S M; |Q| s N, where N is the 
maximum value of | h“(¢; ¥) | on T; and | hk? + Ai, | > $a, forall tin0O < 
t < a,(W), where M, N and a are independent of y. Consequently, 

(16b): if 0 < t S a,(y), f, f and f are bounded uniformly with respect to y. 

On placing 


(17) tg(t;¥) = fit; ¥) — 0; ¥) — YO; y), if 0 <t < a,(y), and g(0; ¥) = 0, 


it follows that g(t; ¥) possesses a continuous first partial derivative with respect 
to t not only for 0 < t S a,(W), where this can be seen at once, but for 0 < ¢ S 
a,(y) as well. In particular, 


S(t; ¥) = f(O; ¥) + [ f(s; ) ds, 
g(t; y) =; I f(s; ¥) — (0; y)]ds 


g(t; ¥) = +f [f(t; ¥) — f(s, PJ ds 
(19) . 
af, T(s + dt — 8); ¥) (ts) ds. 


The quantity 3 depends on ¢t, s and y, but is such that 0 < # < 1 for all 
05s 5t;0 St S ay); 0 S y < 2x. In view of (16a), (19) implies the 
above-mentioned continuity of g as a function of ¢, for fixed y. Furthermore, 
(16b), (18) and (19) imply 
(20a): g(t; ¥) is for 0 S t S a,(W) bounded uniformly with respect to y; 
(20b): g(a,(W); ¥) is a bounded function of y. 

Moreover, by (6), 


(21) aly) < V2 


where u is the maximum of | h(g; ¥) | on the torus T. 
It will now be shown that for every y and for every r > 0 


| a;(¥) 
(22) | i tg(t; ¥) exp [— irt®] dt | < C,/r, 
0 


where C,, like the quantities C2, ---, Cs; Cr, ---, Cyr, to be used in what fol- 
lows, is a constant independent both of y and of r. First, on applying partial 
integration, the integral in (22) may be written in the form 


a;(v) ail) t 
gla(y); y) / t exp [—iré] dt — i g(t; v) / y exp [—iry’] dy dt 
0 0 0 
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or 
’ ail(y¥) 
(23) rt | caxty) wv) F(ar') — / g(t; y) F(tr') at] 
0 
where 
t t4 
F(t) = / y exp [—7y*] dy, hence F(@) = 4 / e~¥ dy, 
0 0 


from which it is seen that 
(24) F| <1. 


From (20a), (20b), (21) and (24) it follows that the quantity in brackets in (23) 
is in absolute value less than a constant C;. This proves (22). 
Again, 


“ail¥) 
(25) | t exp [—irt] dt) < C2/r, 


Jo ° 


for on placing r! = y, this integral becomes F(r! a,(W))/r, which implies (25) 
in view of (24). 
Finally, 


(26) ° / exp [—7r] dt) < C3/r. 
a,(¥) 


For G.(r) = | y exp [—7y] dy exists and is O(r~) in virtue of the second 


mean value theorem applied to a finite interval. Setting r? = y, the integral 
in (26) becomes, up to a constant factor, 


r—Ge (rla;(y)}?) = O(r), since a,(y) > B > O for all y, 


where 8 is the constant defined above, following equation (6). 

Substituting (17) and (25) into (22) and combining the result with (26), one 
obtains, in virtue of (20a) and the fact that f(0, ¥) and f(0, ¥) are bounded 
functions of y, 


| ai(y) +2 ] 
(27) | [ f(t; ¥) exp [—irf] dt — f(0; y) i exp [—7rt] dt | < C,/r. 
| 0 0 
Since? 
I exp [—iz’] dr = 3 x exp[—77/4], 
F 


one obtains from (8), (13) and (27) 
(28) | Jr — $[—2rh’’(o,(y); Yr}! exp [i(rh(gi(y); ¥) — 2/4)] | < C:/r. 


® Cf. e.g., W. F. Osgood, op. cit., pp. 308-309. 
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To calculate the integral J1;, we observe that h’(¢; Y)is negative for 
mv) <= gy S no(v), 


so that h(m(¥); ¥) — A(y; ¥) is in this interval steadily increasing from zero, 
and if we set 


t= |h(mW);¥) — hy; ¥) |, 


t increases from 0 to ae(¥) = | h(m(W); ¥) — A(ne(W); ¥) 3 as ¢ increases from 
m(W) to ne(W). By the introduction of ¢ as integration variable in J7,, 


a(y) 
Ju = — ~ exp [irh(m(y); ¥)] / exp [— irl] t/h'(e(t, ¥); y) dt. 
0 


aly) 
S(t; ¥) exp [—irt’] dt, where 


This last integral is of the form / 
f=fGW = t/t); . 
Differentiation with respect to ¢ and substitution for ¢ from (7) gives 
SOW) = WG YH} (W' OG Y); WF + 2A" Y); W)}. 
Thereupon a similar straightforward calculation shows 
FEW) = WL 5 WI [APR 3 Whol, WW) 
+ 6th'(olt, YD; WPA’ WY); ¥) + 126A" Y); WF}. 


Just as m(W) has already been so chosen that 4 ¢ < m(W) — ¢i(y) < 7 £, one 
may so-choose n2(W) that 3£ < ¢s(¥) — ne(W) < 3 £, where ¢ is independent of y. 
Then from (i), (29), (2a) and the continuity of ¢ as a function of ¢, it is seen 
that f(t; y) is, for every fixed y, a continuous function of ¢ in 


(29) 


0<¢t< ay) S V2, 


where yu, as before, is the maximum of | h(g; ¥) | on the torus T. Moreover 
}(t;¥) andf (t;¥) are for 0 < t S as(¥) uniformly bounded with respect to y. If 
g(t; ¥) be again defined by (17), these facts imply that 
(30a): g(t; ¥) is for 0 S ¢t S ae(W) uniformly bounded with respect to y; 
(30b): g(ae(W); ¥) is a bounded function of y. 

Finally, f(0; ¥) = 0/h’(m(W); ¥) = 0 for all y. By the same reasoning as 
that used in the calculation of J;, it then follows that 


(31) Jin | < Ci/r. 
To calculate the integral Ji, we set 


t= |h(g;¥) — h(gs(y); ¥) |}. 
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Then 

1 ey) ; 
Jin = — exp [irh(¢; ¥)] de 

2m Jay) 

1 ' as(¥) 
| lirh(gs(y); ¥)] i f(t; ¥) exp [ire] dt, 
0 
where 
ft; —) = t/h'(elt, WW) = — {EAS + Ha)}'2/(h's + Hai). 


A calculation precisely similar to that of J; then shows that 
(32) | Jur — 3 [2ah’’ (gay); Yr}? exp [i(rh(es(¥); ¥) + 2/4)] | < Crn/r. 
Similarly, if in Jry we set 
t= |h(y;¥) — hes); ¥) |, 


we find 


agl(y) 
Jw = * exp [irh(gs(y); wif S(t; ¥) exp [ir] dt, 
where 
ft ¥) = U/W (ot, 3 ¥) = (2 UWS + Hal)¥/(hs + Hs). 
A calculation precisely similar to that of I and III then shows that 


(33) | Jn — } [2rh’’(es(Y); Yr]? exp [i(rh(es(¥); ¥) + 2/4)] | < Crv/r. 


The case of Jy is similar to that of J. Here we set 
t= |h(g;¥) — Ans); ») |, 
so that 
1 as(v) . 
Jv = — exp [irh(ns(y); ¥)] | f(t; ) exp [irt®] dt, 


where f(t; ¥) = t/h'(e(t, ¥); ¥). Since f(0; ¥) = 0, it follows, as in the case of 
Ji, that 
(34) | Jv | < Cy/r. 
Finally, if one sets in Jyr 
t = | h(gid); ¥) — Ae; ») | 


one obtains 


ad) 
Jui = * exp [¢rh(g: (y); wif S(t; ¥) exp [—irt’] dt, 
0 
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where 


Sty) = tvebyin = -—t- 


[hy + Hse]}4/(ht + Hl, (es = ¢1 + 2r). 

Then reasoning very similar to that used in the case of J; shows that 

(35) | Jvr — 3 [—2ah"(ei(¥); H)r}-4 exp [i(rh(ei(¥); ¥) — 2/4)] | < Cvr/r. 
Combining (28), (31), (82), (33), (34) and (35), we obtain (3) by virtue of (4). 


NIK 
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FUNCTIONS REPRESENTABLE BY TWO LAPLACE INTEGRALS 


By D. H. Batiov 


1. Introduction. One of the properties of the Laplace integral representa- 
tion of a function f(z), 


(1) f(z) = [ e~* o(t) dt, 


70 


is the uniqueness of the determining function g(t) when that function is con- 
tinuous.' It has been pointed out by G. Doetsch? that if a function f(z) may 
be expanded into two different series the terms of which are representable by 
Laplace integrals and if the term by term transformation of those series is per- 
missible, then this function is represented by two different Laplace integrals. 
Furthermore, there will follow from the uniqueness property the equality of 
two new series, the determining functions of the integrands. 

It has been known that the cotangent was one function capable of such a 
representation,* for it has series developments both in terms of partial fractions 
and of exponentials: 





1 . 2 
(2) ctnz=-+42 +> yee wer 
ra pn o- ie 
— 
(3) ctn z = -i(1+2 es) 
n=1 
——— . _,ctn /—s , 
Now if we take the functiont — Y—* the terms of the series are representable 


— —§ 


Received May 11, 1936; presented to the American Mathematical Society, September 1, 
1936. 

1 See, for instance, D. V. Widder, A generalization of Dirichlet’s series and Laplace’s 
integrals by means of a Stieltjes integral, Transactions of the American Mathematical 
Society, vol. 31 (1929), p. 705. 

2 G. Doetsch, Uberblick tiber Gegenstand und Methode der Funktionanalysis, Jahresbericht 
der Deutschen Mathematiker-Vereinigung, vol. 36 (1927), p. 28. 

2G. Doetsch, loc. cit. See also H. Hamburger, Uber einige Beziehungen, die mit der 
Funktionalgleichung der Riemannschen ¢-Funktion dquivalent sind, Mathematische An- 
nalen, vol. 85 (1922), p. 129. 

‘ Here and throughout this paper that branch of the double-valued function z = /-s 
is taken which corresponds to the upper half of the z-plane. We then are dealing only 
with single-valued functions and we are assured of the convergence of (5) for R(s) > 0. 
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by Laplace integrals, and the term by term transformation gives us the result 
that 


(4) wv = . + >> ae a [ el! 1 + 2)>) e7nrt dt, 
_ —s 8 — iia J0 


r= n=1 











-v-8 Fe Swe 


where the integrals on the right converge for R(s) > 0. Thus this form of the 
cotangent is representable by two Laplace integrals. The uniqueness property 
establishes the equality of the series of the integrands 


o ea. Jas Py jy a | «| +t ie. 2. |« 





(6) 1 + 2\* enw rt = a + 2 > Be ah 
v4 Vv al ron | Vat’ 


and this equality proves to be the linear transformation of one of the theta null- 
functions, 03 . 

It is the object of the present investigation to determine what other functions 
besides the cotangent are thus representable by two Laplace integrals. In 
order to obtain a series development in terms of exponentials for our generating 
function f(z), we considered functions which were simply periodic, and to obtain 
a partial fraction development we took these functions as also meromorphic. 
By considering the nature and position of the poles in the period strip we have 
found two classes of functions which admit of the desired representation. As 
special cases of these are included, besides the cotangent, the other meromorphic 
trigonometric functions: the tangent, cosecant, and secant, and also the cor- 
responding hyperbolic functions. As special cases of the equalities of the deter- 
mining series are the linear transformations of the four theta null functions. 

In what follows we shall be considering functions simply periodic with prim- 
itive period y = re*’,0 S ¢ < x. For convenience of statement we define as the 
primitive period-strip the strip of the z-plane between the straight lines per- 
pendicular to the vectors z = y/2 and z = —y/2. Moreover, we include both 
these straight lines as part of this primitive strip. 


2. A preliminary lemma. We first prove the 

Lemma. Let f(z) be a single-valued meromorphic simply periodic function 
with primitive period y. Further, let f(z) be bounded at both ends of the prim- 
itive period strip and let its poles in this strip be simple and located in the points 
M@,*:+,a,. Then 


(7) f(z) = > a= * etn : (z — a) + C, 
i=) 
where c; is the residue in the pole® a; and C is a suitably chosen constant. 


5 According to our definition of the primitive period strip, c; is the residue in the pole a, 
provided that a; + y/2 or —y/2, in which cases c; is half the residue. 
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Consider the function ctn z. Its partial fraction representation is® 


1, yvf{l 1 


the series converging absolutely for all z # nz and uniformly in any region not 
including any of the points nr. Hence 


ae 


T T Ci , Ci Ci 
c¢; — ctn - (2 — a;) = —— » vieanagmameetianiaaaiae Be 
"¥ . ) e+ Das ny) 


n=—o 


is a function, single-valued, simply periodic with the period y, and having 
simple poles in the points a; + ny (n = 0, +1, +2, --- ) with residue c; at 
each. Then 


k 

F(z) = ¢;~ etn = (z — a) 
> Y 7 

is the partial fraction development of a meromorphic function with poles in the 

points a; + ny (n = 0, +1, +2,---, i = 1,--+-,k), and poles in no other 

points. But these are the only points in which f(z) has poles. It follows then 

from the Mittag-Leffler theorem’ that 


f@) = F@) + G@), 


where G(z) is an entire function. Now the only singularities of f(z) in the 
primitive period strip are its poles in the points a, --- , a, and these poles are 
all contained in F(z). Moreover, f(z) and F(z) are bounded at both ends of a 
period-strip. Consequently, G(z) can have no singularities at all in a period- 
strip and is bounded at both ends. But since f(z) and F(z) are both simply 
periodic with the period y, G(z) must be also. It follows that G(z) is a con- 
stant, and the lemma is established. 


3. The first class of functions. Our first theorem is the following. 
TueoreM A. Let f(z) be a single-valued meromorphic simply periodic function 
with the primitive period y and bounded at the ends of a period-strip. Further, let 


N 
2c; 2 
(8) f(z) - ») ee OSS 3; 4D) 
i=1 ae 


be analytic in the primitive period-strip. Then, when y = a, real, 


pa ee C ce i] 
(9) fi =s) = € _ | ety (1) dt = | e'y,() dt (R(s) > 0), 
—_ /— s 0 0 
6 The prime on the summation sign means that the value n = 0 is to be omitted. 


7 See, for instance, W. F. Osgood, Lehrbuch der Funktionentheorie, 5th ed., 1928, vol. I, 
p. 565. 
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where C is a suitably chosen constant, and 


= 2 Ss > Cpe tata 


i=1 na=-—o 


io = 2 V ~ => > ¢; COS (Qnepe ° 


i=l n=—o 





(10) 


-== 


Corotuary A.l. When y = ia, pure imaginary, 


(11) f(= tows) of c -[ ety (t) dt = | e-ty,(t) dt (R(s) > 0). 
Coro.uuaryY A.2. 
(12) g itere® -f5> cos (Qnxp)e “* (Os ps }). 


First of all we note that since 


ee 1 
2— py z+py2—p’ 
the condition that (8) be analytic is equivalent to saying that in the primitive 
period-strip f(z) has only simple poles which occur in pairs (except when p; = 0), 
with equal residues c; , in the points z = +p,y, symmetric with respect to the 
origin. When p; = 0, there is a single pole at z = 0 with residue 2c; . 
First let us consider the case when there is only one such pair of poles, and let 


fi(z) = f(z) when N = 1, (1 =, — = Pp. 


Then, by our lemma, 








(13) fi(z) —C = F(z) = c~ etn = (2 + py) -¢>- Py). 
Y 7 7 Y 


Using (2), we have 


F(z) =c- latemt > ay (z + py) 








C+) * See tp) — we 
(14) « 
Pee sare 2ry(2— py) _| 
wy*(@—py) <4 wy * (2 — py)? — na)” 


Moreover, adding these series together term by term, we are able to write F(z) 
in the form 


(15) F(z) = 











a set D(z cer +s=atare)) 
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Using (3), we may also write 
2nzr 


(16) F(z) = —i2e7/142 >) cos (2nep)e ” 
ef 


{ 
n=1 ) 





iz 


Thus, when y = a, real, 











Fix/—s)_, | 1 + ( 1 1 )) 
—Vas |s+ Pat »2 s+ pra s+ + pe) | 
(17) | tae 





8 


2 ) 

ae | € | 

c-i—= +93 > cos (2nxp) ————_}. 
a Vs n=1 Vs ] 


a 


=2 


The terms of each of the above series may be expressed as Laplace integrals,® 
and so, if the series of integrals equals the integral of the series, we have the 
result that 


"(/—s " 2.2 = 22 2? 
F(V/ : 38) wy. 2c | ent Jere + b> (e—(n-p)'a't + eente| dt 


n=1 
(18) x [* 1 2 < _ a 
= 2c . | ies Ee + Val 2 cos (2nrp)e «*! | dt 


(R(s) > 0). 


In the first of these integrals the term by term integration is permissible 
within any interval (a, b),0 <a<t<b < ~, since in this interval 


be eals+(nFp)*a"} < : enlet(ntp)*a' la, c= R(s) > 0, 


9 


the dominating series being convergent by comparison with = n~*. The interval 
may now be extended to the infinite one (0, ~), since 


ie as ~~ 1 
y— st »— (nF p)*a't lt = aiennaemntinaienemaiate 
Ef, ieee Doceapee 


is convergent for any ¢ > 0.° An analogous proof establishes the validity of 
the term by term integration of the second series of (18). 

Since our preliminary lemma shows that the function f(z) of the theorem is a 
linear combination of functions f;(z), Theorem A is now established. Moreover, 
K- Vs) 

—vs 

* See B. O. Peirce, A Short Table of Integrals: #493 with n = 0,a = s +(n ¥ p)*a’, and 

7M 
#495 with z = V st, e= — Vs. 
a 


* For the test used here see E. C. Titchmarsh, Theory of Functions, pp. 44-45, 


when we take y = ia, pure imaginary, and construct the function 











FB 


ao 
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from (15) and (16), we are led to exactly the same two series as in (17). This 
establishes our first corollary. The second follows immediately from the 
uniqueness property of Laplace integrals already mentioned. 

Essentially it is the equality of the two series of (17) and the fact that they 
may each be represented by Laplace integrals that gives us the result of our 
theorem. From the formal appearance of these series it might seem that these 
results would hold also for an arbitrary complex value of y. That this is not 
so follows from the fact that if in (17) we replace the real a by an arbitrary 
complex 7, we are led to the integral representation 


2nr —_ nig? 
gs ¥ ee as 
— ee => e~* ———— dt 
v0 


°o ° 
n°T- ° °,° 
— s) > 0,'° and this condition 


2 


which, for R(s) > 0, is valid only when r( 
requires that y be real. 
4. A second class of functions. Our second theorem is 


TueoreM B. Let f(z) be a single-valued meromorphic simply periodic function 
with the primitive period y and bounded at the ends of a period-strip. Further, let 


N 
2c:Pi Y 
(19) fe) - ar <<a) 
i=1 2 ° : 


be analytic in the primitive period-strip. Then when y = a, real, 


(20) —f(/-— 3s) -C= | e~* g(t) dt = e~* Yo(t) dt (R(s) > 0), 
where C is a suitably chosen constant, and 
N oo] 
eld) = 2a > p c(n + py) emer rid*a®e 
(21) i=1 n= ™ 
| 2 (r\*" ~ = - 
pnt i (;) — p2 ce nsin (2Qnrpj)e *, 


Coro.uary B.1. When y = ta, pure imaginary, 


(22) —f(—- Vs) -C= e~* g(t) dt = e* yo(t) dt (R(s) > 0). 
Coro.uary B.2. 
~ ee: 3/2 = -* am 
(23) a (n + p)erimt rat — (+) pj nsin (2nxrp)e “‘ (0 < p< 4). 


© For complex 7 and s this is the condition corresponding to a > 0 for the integral #495 
of Peirce’s tables. 
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Since 
> see. & 
2 py z— py 2+Py’ 
the condition that (19) be analytic is equivalent to saying that in the primitive 
period-strip f(z) has only simple poles occurring in pairs, one at z = p;y with 
residue c; and the other at z = —p,y with residue —c;. 
First let us consider the case when there is only one such pair of poles and let 


fo(z) = f(z) when N = 1, (1 = ¢, —i = Pp. 


Then, by our lemma, 





(24) folz) —-C=F(z) =c etn” (z — py) —c etn” (2 + py). 
Y 7 Y 7 


Using (2), we have 


1 <Q wy-( 
a — 2ay'(e— py) | 
oasthale y | wy (2 — py) . 2 my (2 — py)? — na? 





n=1 
\ 


(25) = 1 2ry'(2+ py) | 


met) yet mel 


2 ) 
PY (n — p)y (n + p)y } 

a _(—-py __(™+p)7_\\ 
igor 2 (; —(n— pet Bt DP) | 


n=1 
Hence, when y = a, 
( = +- } 
= on a ae et a, 
ee 8 ete er ewe 


We see now that this series differs from those we have been considering in that 
the numerators of the fractions increase with n. We should like to conclude 
that it equals a Laplace integral as before, that is, that 


, n=1 


= = | = 
—F(+/—s) -_ 2ea | ent a = > [(n js p)e-(n—Pr*art 
(27) 


— (n + petra") > dt. 
This is indeed true, but the proof does not follow quite as readily as in the 
previous cases. If we let 
u,(t) = (n aa pen ("—p)a't — (n + pen (ntp)ra't | 
then in any interval0 <a StSb<«, 


>, et un(t) K SY e-e*[(n — pew?" 4 (mn 4 pent *a’a), 

















ive 
ith 





& 
2 
%) 
4 
- 
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: , 1 
convergent by comparison with }> — for any ¢ > 0. Hence 
n? 


b b 
(27a) / b> eu, (t) dt = p4 | e~*tu,(t) dt. 


Now we may write 
Un(t) = e-trrat((n — p)etmra’t — (n + p)). 


. —_ 2.2 . . ° 
Furthermore, e~‘"*”’«* > 0 for all ¢, and in the interval a S 1, 


(n — p)etra"t — (n + p) = (n — peta" — (n + p) > 0 


for sufficiently large n,n > N(a). Hence, 


n=n 7% 


> I etua()| dt = SF feetuate at 
— a 2 


n=N 


Ss (n aoe pe lo+(n—p)*a*}a (n + pe letintp)'a'}a 
hel | _ ; | 


) 
o + (n— pe o+(n+ p)e (a > 0), 


n=N \ 


and this last series is absolutely convergent by comparison with }> - rhere- 


2 


fore by the test previously used, we may let b become infinite, and we get 
. . . ’ 


(27b) / p> eu, (t) dt = zi e-* u,(t) at (a> 0). 


We still need to show that we can let a — 0. Unfortunately u,(t) does not 
remain positive for all small ¢. But consider the series on the right of the above 
equation. If we replace s by ¢, we have a series the terms of which, for a fixed ¢, 
are functions of a, and this series, written in the form 


— 
n=1 


2 r re 
an he = 7 oe 
> | e “etn ull pe (n—p)'a't dg — i e~"'(n pe -~(n+p)*a’t dt > 


= p> {v,(a) — w,(a)}, o fixed, 


n=1 


is a convergent alternating series for n sufficiently large. For, first of all, by 
actually evaluating the integrals v,(a) and w,(a), we see that they approach 0 
as n becomes infinite, (a 2 0). Secondly, we can show that 


v,(a) > w,(a) and w,(a) > v,4:(a). 
The first of these wili be true, provided that 


(n — peter p)*a*Ja (n + p)enlet(ntp)*a'la 


0 
o + (n — pa o+(n+ pe i. 
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and after factoring out e~'**"+» «12, we find that we need only consider 


(n — p)einra’a n+p - 2p[(n? — p*)a? — a] 


o+(n—p)a® o+(n+p)a ~ fo + (n — p)ael[o + (n + p)*a’l’ 





and this is > 0 if (n? — p*)a® > o. For any fixed a, there is a definite number 
M such that for n > M, independent of a, this is true. In an exactly similar 
manner, making use of the fact that p < 3, we can show that for n > M, w,(a) > 
v,41(a). This proves our statement that the series under consideration is a 
convergent alternating series for n sufficiently large. Consequently, the abso- 
lute value of the remainder after 2m — 2 terms is less than the absolute value 
of the first term of the remainder. That is, if 


Ra) = >> [v.(a) — w,(a)], 
then, for m > M, independent of a = 0, 


ane n—p 
p—ot yy ] = — 
| e~*v,,(t) dt 3 ee 


se 
|R,(a)| < | e~*'v,, (t) dt| < 


and this approaches zero as n becomes infinite. Hence, we can find an M’ > M, 
independent of a, such that for any given e, 


| Rn(a) | <e¢, form > M’, 


and accordingly the series on the right of (27b) converges uniformly with respect 
to a for a 2 0. Hence, this series defines a continuous function for a 2 0 
and the limit of the series equals the series of the limits. Therefore, we have 


>» e~*tu,(t) dt 2 [ e~*tu,(t) dt 


> ( _ 2) _ n+ Pp ) 
o+(n—pae ao+(n+p) a} 
This is proved for real ¢ and by analytic continuation it will hold if ¢ is replaced 


by any s whose real part is > 0. Hence, the equality (27) is verified. 
Further, using (3), we may show that 


0 


(27¢) 


oo 2nr 


(28) F(z) = —4c" ,? sin (Qnep)e 7 


n=1 
Hence, when 7 = a, 


« Qnr 


‘ ~——«/ 
(29) —F(+/—s) = 4c : Zi sin (2nrp)e “ 
a 


n=1 








ced 
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The terms here are not quite the same as we have had before but there is no 
essential difference in establishing the desired result that 


nite? 


30 —F(/—s) = se | oe Mm sin (2n "Bo dt. 
= os @ Jo 2 @ 7 tv 


n=1 mt 


Equations (27) and (30) taken in conjunction with the preliminary lemma prove 
our theorem. The corollaries follow exactly as they did in the case of The- 
orem A. 


5. Examples. It has already been mentioned that the cotangent is a special 
case of this theory, satisfying the conditions of Theorem A. For this function 
y= 7,N =1,p,: = 0, = 3. The other meromorphic trigonometric functions 
are also special cases, the tangent and cosecant coming under Theorem A and 
the secant under Theorem B. These four trigonometric functions may be 
grouped as follows: 


f(z) = ctnz: 7 = 7, ~r = 0, a = 3, (N=1)-A 
f(z) = tan z: y = -, pr = 3, a = —1, (N=1)-A 
f(z) = ese 2: y = 2m, px, = 0, pe = 3,0, = 3,@ = —1,(N = 2) —A 


f(z) 1) — B. 


sec 2: ¥ 


_2n, ya = i, a = -I, (NV 


Then further, the four meromorphic hyperbolic functions are special cases of 
Corollaries A.1 and B.1, the hyperbolic cotangent, tangent, and cosecant 
coming under the first of these corollaries and the hyperbolic secant under the 
second. 

Finally, we mention the following example in which the additive constant 


. —vV/s)-C. , ‘ , 
C of the function I( Va) — is not zero as it has been in all the preceding 
—-VvVs 
examples: 
1 4 . — . . 
f(z) ag e 1’ C= “Fo Y= 2nt, ra = 0, C1 = 2; N=1. 


This function satisfies the conditions of Corollary A.1. 

Moreover, from equations (12) and (23) we may, by suitable choices of p, 
obtain the linear transformations of the theta null functions. Thus when 
p = 0, equation (12) becomes 


2 — C) nin? 
= wes/t Sc * 
—_— V et 


n=—o n=-—o 
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06(0,) = ai (0 ax) 


which may be written 





247 11 
the linear transformation of the theta null-function 3; for vy = 0, 7 = a 
Tv 
When p = 3, (12) becomes 
ta’t = T 
a(0,'2) = 4/Za(o,-z 
. % e*) V ai (0, ni): 
the linear transformation from # to J for v = 0, 7 = +4 If we take equation 
Tv 


(12) for p = 0 and p = 3, subtract the second equation from the first, and make 
certain rearrangements and combinations, we have the linear transformation 





from % to v2 for vy = 0,7 = _ 
4r 

ta*t\ _ dn ~) 

(0, '2*) - * (0, - 4). 


Finally, if in (23) we take p = }, that equation can be written 


, tat 4nr\'? , 4r 
% (0.2) - G)"% (0-3) 


the linear transformation for v = 0, 7 = = of the derivative of 3 with respect 
T 


to the variable »v. 
GeEorGIA ScHooL oF TECHNOLOGY. 


11 The notation for the Theta functions used here is that found in Hurwitz-Courant, 
Funktionentheorie, 3d ed., vol. III, part II. See particularly Chapters 2 and 7. 
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A PROBLEM OF ZERMELO IN THE CALCULUS OF VARIATIONS 
By Wituiam L. Duren, Jr. 


1. Types of relative minima. Let y‘(x) stand for the 7-th derivative of the 
function y(x) and y°(x) for y(x) itself. Then an admissible are E 

y = y(z) (rz. S x S 2) 

which joins the points (7,, yo.) and (x2, yo2) will be said to furnish a relative 

minimum of order r (r = 0, 1, --- , n) to the problem of Zermelo if there exists a 


neighborhood N, of the elements (2, y, y’, ---, y") belonging to E such that 
E gives to the integral 


(1) -* [1 Y; y’, ichie » y")dx 


a smaller value than that given by every other of a class of admissible ares C 
joining the ends of EF and having the elements (2, y, y’, ---,y") in N,. The 
term admissible arc in this statement will be used in several senses later defined. 

The problem of minimizing an integral “with higher derivatives in the inte- 
grand” is an old one and is commonly studied as a special problem of Lagrange. 
However, it will appear that neither necessary nor sufficient conditions for 
relative minima of orders less than n — 1 can be obtained from the general 
theory of the Lagrange problem in its present form. Thus the classification 
of these types of relative minima which was first made by Zermelo! sets the 
problem apart and justifies the title of this paper, though Zermelo carried 
through the analysis for orders n and n — 1 only. 

In studying relative minima of order r, or weaker ones, one might specify 
that the elements (2, y, y’, --- , y") are fixed at x; and zz. When this is the 
ease we will speak of the problem of Zermelo with end elements of order r fixed. 
For the sake of simplicity it will be understood hereafter that unless the con- 
trary is specified we are studying the problem of Zermelo with end elements of 
order 0 fixed as in the first statement of the problem. 


2. Transformation of the problem into a problem of Lagrange. If we intro- 
duce new variables defined by the equations 


(2) yi = y*(x) (¢ = 0,1,---,n — 1), 
Received July 7, 1936. 


1E. Zermelo, Untersuchungen zur Variationsrechnung, Dissertation, Berlin, 1894, p. 29. 
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the problem of Zermelo becomes formally equivalent to the problem of Lagrange 
with variable end points in which one seeks to minimize the integral 


Zz: P 
/ F(x, Yo. +++ 5 Yn13 Yn—r dx 
1 


in a class of ares which satisfy the end conditions yo: — yo(t1) = Yor — yo(t2) = 0 
and the differential equations 


(3) Ya-1 — Ya = 0 (a =1,---,n— 1). 


We shall say that an are (1) for the problem of Zermelo is L-admissible if the 
equations (2) transform it into an admissible are in the usual sense? for the 
problem of Lagrange. Clearly an L-admissible are is of class C™"'. It is seen 
from equations (2) that weak and strong relative minima for the associated 
problem of Lagrange correspond respectively to relative minima of orders 
n and n — 1 for the problem of Zermelo. If any theorem on relative minima 
from the general problem of Lagrange is to translate into a theorem on relative 
minima of order less than n — 1 for the problem of Zermelo, it must be a theorem 
in which some of the variables are unrestricted. No such theorem exists. 

On account of the form of the equations (3) every L-admissible subare is 
normal. The function F which occurs in the theory of the Lagrange problem 
can then be written in the alternative forms 


F(z, y, y’, d) = f(z, Yo, **° » Yat 3 Yas) + Dd ralye —™ Ya), 


(4) , , , 
F(z, Yy, y’, h) = f(z, Yo; Yo, **° » Yn-1) + Dd uelYa - Ya). 


The non-tangency condition is also fulfilled automatically. 

The following is a summary of some of the results on the problem of Zermelo 
which may be obtained by translating the theory of the general problem of 
Lagrange by means of equations (3). 

I. Along an L-admissible arc E which furnishes a relative minimum of order n 
to the problem of Zermelo the equation 


(5) } (—1) i Rot [F fustaey = >> e(x — x)* 


i=0 


; =z k z 
holds identically. In it the symbol i ee i stands for a k-tuple integral and 


the quantities co, +++ , Cn, are suitably chosen constants. 
If E is an are of class C*" and if the integrand function f is of class C"*' in the 
arguments (zx, y°, --- , y"), the quantities 
— d \ 
(6) Lk, = 3 (- $) Syx+i (k = 0,1, ae , n) 
i=0 r 


2G. A. Bliss, Amer. Journal of Math., vol. 52 (1930), p. 677. 
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can be calculated on E. Along such an are the Euler equation (5) can be written 
in the form 


(5’) Lo = NV, 


and the multipliers have the values 4. = La (a =1,---,n — 1). An L- 
admissible are of class C?" which satisfies equation (5’) is called an extremal 
for the problem of Zermelo. If fynn # 0 along an extremal, it is said to be non- 
singular. Furthermore a minimizing extremal satisfies the following trans- 
versality condition. 

At both end elements of an L-admissible extremal which affords a relative 
minimum of order n to the problem of Zermelo with fixed end elements of order 
k, the equations 


(7) L,=0 (s = k+2,---,n) 
must hold. 


It is to be understood that the conditions (7) are automatically satisfied in 
problems with fixed end elements of order k = n — 1. Incidentally, these 
transversality conditions imply that the constants co, --+ , ¢n—x—2 in (5) are zero. 

II. At every element (x, y®, --- ,y") of an L-admissible arc which satisfies the 
first necessary condition (5) and furnishes to the problem of Zermelo a relative 
minimum of order n — 1 the inequality 


f(z, y’, ae wv *; Y") = f(z, y’, re »y”") - (Y" aa YU" Synz, y’, ans »y”) 20 


must hold for every L-admissible set (x, y®,--+,y""; Y") # (2, y®-++,y"). 
III. The elements of an L-admissible arc which satisfies (5) and furnishes a 
relative minimum of order n satisfy the condition 


Synyn = 0. 


The accessory minimum problem for a non-singular extremal is formally 
equivalent to the Lagrange problem of minimizing the integral 


/ Qw(2x, n, n’)dz, 


1 
where 
n-1 n-—1 12 


, 
2w = y | Nu» + 2 > Seng NMuNn—-1 + fynyn Nn-1 
uy v=0 n=0 
in a class of admissible ares which satisfy the equations 
, 
Pa = Na-1 — ta = O *s+sn — 1) 


and the end conditions (21) = mo(v%2) = 0. In terms of the canonical variables 
(x, n, ¢) and the hamiltonian function H(z, n, ¢), the canonical accessory equa- 
tions are 


(8) : - (¢ = 0,---,n — 1), 
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and the secondary transversality and end conditions are 
(9) s(@1) = no(21) 


(10) £ (x2) 


0, 


no(X2) = 0 (s = 1,---,n — 1). 
The conditions (9) and (10) define respectively the conjugate systems n;;, 
fi and ux, vi of solutions of the canonical accessory equations (8). In terms 
of these notations we may state the following necessary condition 
IV. Along a non-singular extremal which furnishes a relative minimum of order 
n to the problem of Zermelo, with respect to L-admissible arcs, the relation 


n—1 


ps (Siti - nijdir) ajb;. 20 


i,j, k=0 


holds for every set x, a;, be for which x is on the interval (x1, x2) and for which 
the equation 


n—1l 


4 n(x) a; = p> unr) dy 


is satisfied. 

An L-admissible extremal are E which satisfies the transversality conditions‘ 
(7) with k = 0 and the strengthened conditions‘ II, , III’, IV’ furnishes to the 
problem of Zermelo a relative minimum of order n — 1 in a class of L-admissible 
ares. 


3. Extended admissibility. We now extend the definition of admissibility 
and say that an are C is Ly-admissible if it is composed of a finite number of 
L-admissible ares of class C" and is such that the functions y(x), y'(x), --+ , y*(x) 
belonging to it are continuous on the interval (71, 22). 

Let EF be an L-admissible extremal are which furnishes to the problem of 
Zermelo a relative minimum of order n with respect to neighboring L;,-admissible 
ares. Let 3 bea point on such an are between land 2. Then the are £3. must 
furnish a minimum with respect to neighboring L-admissible ares which join 
the same end elements (z, y, y’, --: , y*). Hence the transversality condition 
(7) must be satisfied at 3. From the form of the functions Li,2, --- , Ln given 
in (6) it is easily seen that a necessary and sufficient condition for their vanishing 
identically on E is that the functions fy, +--+, fy, vanish identically on E£. 
Hence we have proved the following stronger first necessary condition. 

I,. At every element of an L-admissible arc of class C"***! which furnishes a 


relative minimum of order n to the problem of Zermelo with respect to neighboring 
L,-admissible arcs, the differential equations 


3M. R. Hestenes, Trans. Am. Math. Soc., vol. 36 (1934), pp. 793-818. 
4M. R. Hestenes, loc. cit., p. 815. 
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Sy = 0 (s=k+2,.--,n) 
must hold.§ 

Now let E be an L,-admissible are along which the equations (13) hold. 
We make a simple extension of the Weierstrass construction as follows. We 
choose an arbitrary L,;-admissible are C: y = Y(zx) which has the element 
(x, Y, Y’,---, Y*) in common with E at 3. Then through a neighboring 
point 4 of E we construct a family of L;-admissible ares Ds; having contact of 
order k with E at 4 and with C at a point 5of C whichis near3. If 23 S 25 < 4%, 
the composite ares Ey3 + C35 + Desi + Exe are Lj-admissible and the integral J 
taken over such a composite are is a function of 25 whose derivative J’(x3 + 0) 
must not be negative if Z is to be a minimizing arc in the family of composite ares. 
Calculating this derivative with the aid of (11), we are led to the following 
condition. 

II,. If an L-admissible arc of class C"*"*" satisfies the condition I, and furnishes 
a relative minimum of order r to the problem of Zermelo with respect to neighboring 
L,-admissible arcs, then the inequality 


(12) &,(2, y’, y’, indnsce) ai yr, ele” Y") = 
f(z, y, ‘0% v3 y, hou Y") oa f(z, y’, yam yy") ~ 
(y+ Pm y" firs (2, y®, —_ ,y") > 0 


must hold at every element (x, y®,---,y") of E and for every admissible set 
(x, y®, oe ae 3°", °**s Y*") # (z, y®, ---,y*). 

This ©-function, considered as a function of Y"*!, --- , ¥" must have a relative 
minimum for (Y"*!, --- , ¥") = (y"*', --- , y") if the are E is to furnish a relative 
minimum of order n to the problem of Zermelo with respect to neighboring 
L,-admissible ares. The first partial derivatives of this function vanish at 
(Yr+!, ---, ¥") = (y"™™, ---, y”) as is seen with the aid of the equations (11). 
The necessary condition on the second derivatives leads us to another analogue 
of the Legendre condition.® 

III. Lf the L-admissible arc of class C"***" satisfies the condition I, and furnishes 
to the problem of Zermelo a relative minimum of order n with respect to neighboring 
L,-admissible arcs, then the inequality 


n 


» Sut y’, ileal y") Zyey 2 0 
pye=kK+1 
holds for all elements (x, y®, --- , y") of E and all sets of numbers (241, «++ , 2n)- 
Let EF be a non-singular extremal are satisfying the condition I, and let y = 
5 This condition for the case n = 2 was proved by H. H. Pixley, Contributions to the 


Calculus of Variations, 1931-1932, The University of Chicago, pp. 133-189. 
® Pixley, loc. cit., p. 163. 
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y(x, b) be a family of L,-admissible ares which contains E for b = 0. The 
integral J taken over the arcs of this family defines a function J(b). On account 
of the condition I, it is possible to calculate the second variation along E and 
it is found to have the same value as in §2, namely 


72 

J'’(0) = / 2w(x, 0, 9')dz, 
71 

where 2w is the quadratic form occurring in the statement of the condition IV 

of §2. The accessory minimum problem becomes that of minimizing the integral 

J’’(0) in a class of L,-admissible ares 


ni = ni(z) (¢ = 0,1,---,n — 1) 


which satisfy the equations 
, 

Na-1 — Nq = O -oe, 8 — |) 
and the end conditions no(x1;) = mo(%2) = 0. Again we make an argument along 
similar lines to that of Hestenes’ and consider the two conjugate systems, 73; , 
fi and uy, vu, of solutions of the canonical accessory equations (8), determined 
respectively by the end conditions (9) and (10). A composite are defined by 
ni(x) on the interval (x; , 73) and u;(x) on the interval (23 , x2), where 


n—1 n—1l 


(13) “= oh nij4; » Zz. Uy by 


j7=0 i=0 


is L,-admissible, provided that at x = x; the equations 


n—1 n—1 


(14) > Nei(%s)a; = > Ug(X3)b, (c = 0,1,---,k) 


are satisfied. Hestenes has shown that along such an are the second variation 
has the form 


n—1 
(15) JO) = DO (nilas)ge(as) — uj(xs)oi(2s)). 
1=0 
With these notations we can state the following condition of Jacobi type. 

IV... Let E be a non-singular extremal, satisfying the condition I, and the trans- 
versality conditions Lz = --- = L, = 0, and furnishing to the problem of Zermelo 
a relative minimum of order n with respect to neighboring L;-admissible arcs. 
Then the inequality J’ (0) = 0 must hold for arbitrary sets x, a; , bj with x interior 
to (x1 , £2) subject to (13) and (14), where J’’(O) has the form (15). 

More especially we have the following corollaries. 

Coro.tuary 1. For the are E of condition IV, to furnish a relative minimum of 
order n with respect to neighboring L,-admissible arcs it is necessary that the tn- 
equality 


n—l 


-_ (fjuu — nyudazbs = 0 


t,7,d=0 


7 Hestenes, loc. cit., p. 801. 
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hold for all sets x, a; , bi satisfying (13) and (14) and the further condition 


n—1l n—1 
(16) > fala; = — >> valxr)by (s=k+1,---,n—1). 
j7=0 l=0 


This follows from the condition IV, , since the equations (14) imply that at 
X3, No(s) = U-(23), While equations (16) imply that ¢.(73) = —v,(x3). Thus 
(15) takes the form 


J”(0) = 7 (¢i(xs)u(ars) — ni(rs)vi(rs)) 


and the corollary follows from (13). 
Coro.tuary 2. If the equations (16) in Corollary 1 are replaced by 


(16’) (.(x)a; = > va(x)b, = 0, 


i 


we have the necessary condition 


> af (fo Ugt — Nei Vota; b, = 


o=0 j, 


for the relative minimum of order n. 


4. Fields. We consider a one-parameter family of L,-admissible extremal 
ares which satisfy the condition I, . Let the ends of these ares describe two 
L,-admissible ares C and D. If we return to the representation of the problem 
of Zermelo as a problem of Lagrange as described in §2, the condition I, implies 
that 


Fy, = 0 (s=k+1,---,n—1) 
hold identically on each member of the family. These equations make it 


possible to calculate the differential of the integral J taken along an extremal 
of such a family of ares. In fact, one finds 


12 


k 
dJ =(F-— >> yoF vdx + > Fre dye) 


o=0 c=0 


. ‘ . ’ . 
in which the arguments of F are the x, y; , y;, \ belonging to the extremal.* 
pa . R . . . . oe 
rhus the invariant integral J; has the differentials dyi.2, --- , dy, missing 
and may be evaluated on L;-admissible ares. With it the auxiliary formula 


J(Eg) — J(Ex) = Ji(Dye) — Ji (Cys) 


is valid, where FE, and E,, are two of the extrema!s of the family. This invariant 
integral suggests the following. 


8G. A. Bliss, Amer. Journ. of Math., vol. 52 (1930), p. 714. 
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Definition of a field of type k. A field of type k is a region Fy of x, yo, ++ , Yar 
space with which is associated a set of functions 





Pil, Yo, *** » Yes la(t, Yo, *** 5 Yx) @=1,---,n;a=1,---,n—1) 





of class C’ in SF, , which are such that 
(a) the sets (x, y, y’, +--+ ,y") = (%, Yo, Pry *** 5 Pn) are L-admissible, 
(b) the line integral 


J = [- Dd yeFydde + DO Py; dy, 






formed with the arguments (x, Yo, +++, Ye, Deity ***»Pn-1} Diy *** Dn 
l,, +++ , ns) is independent of the path in S;, . 

For example, we take a field of type 0. In the notations of the problem of 
Zermelo we suppress the multipliers and find that such a field is a region ‘S 
of x, y-space with which is associated a set of functions p;(z, y), --- , a(x, y) 
such that the integral 







Jo = | (f — pify)dx + fy dy 





is independent of the path in‘). The arguments of f and f,, are (x, y, pi, °°: , 
Pn) - (x, Y, Pi, dpi/dz, hota: a”"pi/ dx — ). 

Every non-singular extremal are E for the associated Lagrange problem 
to the problem of Zermelo having a conjugate system U ,; , V ;; of solutions of the 
accessory equations for F can be imbedded in an n-parameter family of extremals 





















(17) yi = Y;(z, a, - 2: = Zi(x, ao, -** , &n—1) (¢=0,1,---,n—1) 





. » &n—1), 















which contains £ for (a) = (0) and x; S x S x2 and the variations of the family 
along E are 
Y ia;(2, 0) = U;,(x), Zia;(2, 0) = V «(z). 3 
Now if £ satisfies the conditions I, and if the determinant 3 
| U : c= 0, eee ’ k os 
“@) % 
(18) | s=k+1,-++,n-1) ff 

Vj S54 





is different from zero along E, the equations 









Yo = Y,(z, a) 
0 = Z.(x, a) 







IIA 





have the initial solutions (a) = (0), y. = ye(x), (t11 S x S 2X2), belonging to E 
and consequently they have unique solutions a(x, yo, ---,yx) of class C’ 
which vanish on E. These functions, together with the multipliers Ag(z, a) 
of the family (17) determine the functions 
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pi(z, Yor ***» yr) = Y ,.(z, a(Z,Yo,***, yx)) 
(19) 

La(z, Yo, *** ys yx) = Ag(z, a(z, we *** » yx)) 
which are defined and of class C’ on a neighborhood ‘Ff; of the sets (7, yo, --- , yx) 
belonging to Z. Furthermore the Hilbert integral J* for the general Lagrange 
problem when formed with the functions (19) becomes identical with our de- 
generate integral J; since the functions Fy; = Z.(x, a(x, yo, «+ , yx)) vanish 
identically in F,. Hence we may use the arguments of the general problem of 
Lagrange’ to conclude that J; is independent of the path in F,. Thus we have 
constructed a field of type k about EZ. 

It is noted that all of the extremals of this field, defined by the differential 

equations 


dy; 
= pi(z, Yo, *** » Yes 


satisfy the conditions I,. Inthe case k = 0 our field is determined by a one- 
parameter family of extremals all of which satisfy the differential equations 


Sy(z,y,---,y") = 0 (s = 2,---,m). 


In order to establish a lemma on the possibility of imbedding a given extremal 
in a field of type k we employ a strengthened condition IV, which asserts that 
the condition of Corollary 1 to the condition IV, holds along F and furthermore 
that the end and transversality conditions are not conjugate. 

Lemma. If E is an L-admissible, non-singular extremal arc which satisfies 
I, and IV,,, then there exists a conjugate system U;(x), Vi(x) of solutions of the 
canonical accessory equations with a determinant (18) which does not vanish on the 
interval (x, , 22). Furthermore E can be imbedded in a field of type k. 

Proof. The two conjugate bases of the necessary condition can be replaced'® 
by two bases for which 


n—1 
> (Fijua — nya) = Op. 
1=0 
The system U; , Vi; defined by the equations 
Usi = Nei + Us, Vai = fai + Vsiy 
Ua = Yj — Usj y V,; => Cj “a Us; 


is a conjugate system. If the determinant (18) vanishes at 23, there exists a 
set of constants a; , not all zero, with which 


n—-1l n—1l 
7 U,;(23)a; = : [noi(a's)a; + Uei(Xs)a;] = 0, 
i=0 i=0 


x V,,(23)a; = x [¢\(xs)a; — v.;(x3)a;] = 0. 


* Bliss, loc. cit., p. 733. 
© Hestenes, loc. cit., p. 807. 
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The set (a; , b.) = (a;, — ay) would satisfy (14) and (16) and would give J’’(0) 
the negative value > — aja; which contradicts the property IV;,. The second 


2 
statement in the lemma has already been proved. 


5. Sufficiency theorems. If FE is an L-admissible extremal of a field ‘SF, of type 
k which joins two points 1 and 2 in the sense that yo: — yo(%1) = Yor — Yo(%2), 
and if C is an L,-admissible are which lies in 4; and joins the same points we 
can calculate the difference 


J(C) — J(E) = J(C) — Ji(E) = J(C) — SiC). 


From the last form we find that 


J(C) = J(E) = [ (f(x, Yor *** » Yn—-1» Yn—1) 


as f(z, Yo) *** » Yky Pk+it, °°" » Pn) a Vics s(Yrgs _ Prsidldx, 


where /;.,; is evaluated for the arguments (x, yo, *** , Ye, Peoi,*** » Pn). In 
terms of the original notation this formula may be written 


(20) J(C) - J(E) - / &(z, y’, “oe y", Pktiy*** » Pn; y**, rae y") dx, 
where the functions (2, y°, --- , y") belong to C, the arguments of pri, -** » Dn 
are (x, y°, --- , y*), and where &, is defined as in (12). 

Formula (20) at once justifies the following extension of the usual fundamental 
sufficiency theorem. 

SurricieNcy THEOREM 1. An L-admissible extremal E of a field F;, of type k 
furnishes a minimum to the integral with respect to L,-admissible arcs C which lie in 
F,, and join the same end points (x, yor) and (x2 , yor), provided the inequality (12) 
holds for all elements (x, y°, --- , y*) of Fx. 

We will say that the arc E satisfies the condition II, if there exists a neighbor- 
hood N of the elements (z, y®, --- , y") on £ in which the inequality 


&,(z, y’, +? "5 late 7 Y*) > 0 
is true for every L,-admissible set 
(x, y®, ++? 5 , Pre ies a (x, y’, 7 oe y"). 


The preceding theorem, together with the continuity properties of the functions 
p;, and the lemma of the preceding section give us another theorem. 

Surricrency THEOREM 2. If E is an L-admissible extremal which satisfies 
the conditions I,, Il;x, III’, IV,, then E furnishes a proper relative minimum of 
order r to the problem of Zermelo with respect to L,-admissible ares. 

Since the theorem is true, 4 fortiori, for L-admissible arcs we may translate 
the statements of the theorem into the language of the associated Lagrange 
problem of §2. We then have the 
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Coroutiary. The transforms of the conditions of the theorem are sufficient to 
insure that the L-admissible extremal arc E furnishes to the integral 


J= i I(x, Yoo Var -°* » Yat} Vas) az 


of the associated Lagrange problem a minimum with respect to L-admissible arcs 
which satisfy the same end conditions, yo. — Yo(X1) = Yor — Yo(t2) = 0, and have 
the sets x, yo(x), «++ , yr(x) in a sufficiently small neighborhood of those of E. 

We note that this is a sufficiency condition for a special problem of Lagrange 
in which not only the derivatives y, of comparison arcs but also some of the 
functions y; themselves are unrestricted. 


6. Concerning necessary conditions in a class of L-admissible arcs. There 
remains the question of finding an extension of the Weierstrass condition which 
will be a necessary condition that an extremal are E furnish to the integral (1) 
a relative minimum of order 0 with respect to neighboring L-admissible arcs, 
that is, admissible ares of class C"~!. 

For simplicity we consider relative minima of order 0 only. If we followed 
the ordinary procedure, we would set up a family of L-admissible ares 


(21) y = y(z, a) 


which contains a particular extremal EF for a = 0 and which satisfies the con- 
ditions 

: a’ : 

lim — a) = y'(z 

im a y(x, a) = y‘(x) 


a=0 


for every x ¥ x3 on (21, 22) and 


lim  y(esa) = ¥% 
where (x3, y(xs), Y’, --+, Y") is an arbitrary L-admissible set. If n > 1 no 
such family exists having the second derivative yz:(r, a) bounded for all values 
of (x, a) such that 7; S x S 22,!a| S €. Consequently, in order to construct 
such a family, it must be assumed that in the class of L-admissible ares which 
defines the problem of arguments y’, --- , y" are unrestricted. When we form 
the function J(a) by evaluating the integral (1) along the are of the family (21) 
having parameter value a, we cannot say that J(a) is continuous at a = 0. 
Thus we could not validate the calculation of the derivative J’(0) which would 
be expected to give rise to the 6-function at 3. 

An example will show that the condition Ip of §3 is not necessary for a relative 
minimum of order 0 in a class of L-admissible ares. Consider the problem of 
. minimizing the integral 


(22) | (4y’" wl y")dx 
¢ 
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in the class of all ares y = y(x) which join the points (0, 0) and (7, 0) and have 
functions y(x) of class C’ while the second derivatives y’’(x) exist and are con- 
tinuous on (0, 7) except at a finite number of points. Thus the class is a class 
of L,-admissible ares. A well known inequality" states that 


i 2dr < s/ 2’ dx 
0 0 


holds for all absolutely continuous functions z(2) which vanish at 0 when the 
second integral exists. Hence in the class of ares defined for the integral (22) 
we have 


i (y’ — y’(0))"dz < s/ y’ "dz. 
0 0 


Since y(x) vanishes at both end points, one finds that 


ry(0) < i (4y’" — y")dz. 
0 


The equality holds only when y(x) = 0 on the interval (0, 7). The are y(xz) = 0 
is an extremal which joins the end points (0, 0) and (7, 0) and has f, = 0 along 
it. (L.e., the extremal satisfies the condition Ip of §3.) This extremal furnishes 
a relative minimum of order 0, in fact an absolute minimum, to the problem. 
But along this extremal the &-function of §3 has the value 


S(z,y, yy"; ¥',¥") = 4¥"" — ¥". 


Since this is not a positive form, we conclude that the condition IIp of §3 is not 
necessary for a relative minimum of order 0 in a class of L-admissible ares, 
even on amextremal which satisfies the condition Ip. Hence also the condition 
III, is not necessary. 

On the other hand, simple examples can be constructed to show that a suf- 
ficiency theory for relative minima of order 0 cannot be built upon the Weier- 
strass condition which comes from the associated problem of Lagrange. 


TULANE UNIVERSITY. 


1 J, Tonelli, Fondamenti di Calcolo delle Variazioni, Bologna, 1929, vol.2, p.439. Picard, 
Traité d’ Analyse, vol. 3, lst ed., 1896, p. 115. 





NON-SEPARATING TRANSFORMATIONS 
By James F. WarRDWELL 


1. Introduction. If A is a compact continuum and T(A) = B is a single- 
valued continuous transformation, then T will be said to be non-separating pro- 
vided that no set T~-'(b), b « B, separates A. It is obvious that any non-sepa- 
rating transformation is non-alternating.' However, it can easily be seen by 
simple examples that not every non-alternating transformation is non-separat- 
ing; not every non-separating transformation is monotone; and not every 
monotone transformation is non-separating. 

Since any continuous transformation between two compact metric spaces A 
and B is equivalent® to an upper semi-continuous decomposition‘ of A into 
disjoint closed sets where the hyperspace of the decomposition is homeomorphie 
with B, any non-separating transformation T(A) = B is equivalent to an upper 
semi-continuous decomposition of A into sets which do not separate A. 

All transformations used in this paper will be assumed to be single-valued 
and continuous. 


2. Some characteristic properties. 
THEOREM 2.1. Jf A and B are compact continua, a necessary and sufficient 


condition in order that T(A) = B be non-separating is that T be non-alternating 
and B contain no cut points. 

Proof. To prove the necessity, in view of our remarks in the above section, 
we need only show that B can contain no cut points. If, for some point b of B, 
there were a separation B — b = B, + Bs, then T~'(b) would separate A into 
the two mutually separated sets T~'(B,) and T~'(Bz) because of the continuity 
of T. The sufficiency follows at once from a theorem of G. T. Whyburn’s® 
which states that if B is connected and T(A) = B is non-alternating, then a 
point x of B is a cut point of B if and only if T~'(x) separates A. 


Received August 8, 1936; presented to the American Mathematical Society, April 10, 
1936. 

‘A continuous transformation 7(A) = Bis non-alternating provided that for any 
z, y « B, T~'(x) does not separate T~'(y) in A. See G. T. Whyburn, American Journal of 
Mathematics, vol. 56 (1934), no. 2, pp. 294-302. : 

? A continuous transformation 7(A) = B is monotone provided that each set T~'(b), 
b « B, is connected. See C. B. Morrey, Jr., American Journal of Mathematics, vol. 57 
(1935), pp. 17-50, and G. T. Whyburn, loc. cit. 

*C, Kuratowski, Fundamenta Mathematicae, vol. 11 (1928), pp. 169-185. 

*R. L. Moore, Transactions of the American Mathematical Society, vol. 27 (1925), 
pp. 416-428. 

* See p. 295 of his paper Non-alternating transformations, loc. cit. 
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CoroLuaRY. A non-alternating transformation T(A) = B is non-separating 
if and only if B contains no cut points. 

THEOREM 2.2. If A and B are compact continua and T(A) = B is non-separat- 
ing, then for any cut point p of A and for any possible separation A — p = A, + Az, 
A; © T"T(p) for i equal either 1 or 2. Furthermore A; — A;-T~'T(p) is con- 
nected for] = 1 or 2 andj # i. 

Proof. Take any cut point p of A and any separation A — p = A; + Ag. 
Now, if we let T(p) = b, 


A — T-(b) = [Ai — Ai1-T-(b)] + [Az — A2-T(0)). 


Hence one of these sets, say A; — A,-7~'(b), must be vacuous, since T is non- 
separating. Therefore A; C 7T~'(b). Furthermore Az — A2-T~'(b) must be 
connected, since we can not have any separation of A — T~'(b). 


3. Product and factor theorems. Let A be a compact continuum and let 
T(A) = B be expressed as the product of the transformations T,;(A) = A’ 
and T;(A’) = B. That is, we have T(A) = T27,(A) = T2(A’) = B. 

TuHeoreM 3.1. Jf T is non-separating, Tz is non-separating regardless of T;. 

Proof. Let us suppose that JT: is not non-separating. Then there is some 
point p of B so that there is a separation A’ — Ty'(p) = Ai + Az. From this, 
because of the continuity of T,, we have the separation A — Tj'Tz\(p) = 
T;'(Ay) + Ty(A,). But Ty'Tz'(p) = T-(p). Hence T-'(p) would separate 
A contrary to the fact that T is non-separating. 

THeorEM 3.2. If T2 is non-separating and T, is monotone, then T is non- 
separating. 

Proof. Wf T were not non-separating, then there would exist a point p of B 
so that A — T-'(p) = A, + Az would be a separation. Hence, since 7; is 
monotone, we would likewise have the separation® 


A’ — T,T-\(p) = T\(A1) + Ti(A2). 


But T:T-"(p) = Ty'(p). Hence Tz'(p) would separate A contrary to the 
fact that T, is non-separating. 

Some simple examples will show us that, if J, is non-separating, then T need 
not be non-separating even though 72 be both monotone and non-separating. 
Also T and Tz, may both be non-separating and yet 7; need not be non- 
separating even though it be monotone. 

In view of G. T. Whyburn’s factor theorem for continuous transformations,’ 
we now have the following factor theorem for non-separating transformations. 

THeorem 3.3. If A is a compact continuum and T(A) = B is non-separating, 
there exist transformations T; and T, such that T(x) = T2T,(x), for x « A, where 


" See the proof of a similar theorem of G. Tl. Whyburn’s for non-alternating transforma- 
tions, loc cit » P- 206. 
7 Loe. cit., p. 297. 
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T, is monotone, dim T,'(b) = 0, for each be B, and both T, and T» are non- 
separating. 

Proof. Let us define the transformations T,(A) = A’ and T.(A’) = B as 
in Whyburn’s factor theorem. Then 7; is monotone and, for each } « B, 
dim Tz'(b) = 0. Furthermore T, is non-separating by Theorem 3.1. Hence 
we need only prove that 7; is non-separating. Now by definition, T7;(A) = A’ 
is such that, for each point p of A’, T]'(p) is a component of the set T—[T2(p)]. 
If we assume that 7; is not non-separating, there exists some point p of A’ 
such that Tj'(p) separates A, say A — Ty'(p) = A: + Ao. Now Ai + Ty'(p) 
and Az + T;'(p) are each connected sets, since A and Tj'(p) are each con- 
nected. Hence, if we let T2(p) = b « B, T-(b) does not contain A;, fori = 1 
or 2. For, if T-(b) D Aj, then Ty'(p) would not be a component of T-'(b), 
since A; + Tj'(p) is connected. Therefore 


A — T-(b) = [Ai — A1-T~(6)] + [Az — A2-T-(0)] 


mutually separated, contrary to the fact that 7 is non-separating. Hence 
T, is non-separating. 


4. Locally connected continua. If the continua A and B in Theorem 2.1 
are locally connected, that theorem may be rewritten to give us the following 

TueoreM 4.1. If A and B are compact locally connected continua, a necessary 
and sufficient condition in order that T(A) = B be non-separating is that T be 
non-alternating and B be cyclically connected. 

It obviously follows from this theorem that the property of being a compact 
cyclically connected continuum is invariant under non-separating trans- 
formations. 

With the use of Theorem 2.2 we will now obtain another necessary and suffi- 
cient condition for T to be non-separating when A is locally connected. 

Tueorem 4.2. If A and B are compact locally connected continua, a necessary 
and sufficient condition in order that T(A) = B be non-separating is that, for 
any p € A, there is at most one component C of A — pon which T is not constant, 
and T~'T(p)-C does not separate C in A. 

Proof. Take any point pof A. Let C be a component of A — pon which T 
is not constant. We first prove the necessity. If C = A — p, the conclusion 
is immediate, since T is non-separating. If C # A — p, we have the separation 
A—p=C+(A—p-—C). Now, by Theorem 2.2, A — p— C CT™“T(p), 
since C € T-'T(p). Furthermore the set C — C-T“T(p) is connected. Next 
we prove the sufficiency. If C = A — p,thenC — C-T“'T(p) = A — T°'T(p) 
is connected and hence 7 is non-separating, since T(p) is any point of B. If 
C # A — p, then T(A — p — C) = T(p), by hypothesis and because of the 
continuity of T. Hence A — T'T(p) = C — C-T'T(p), which is connected 
by hypothesis. Therefore T is non-separating also in this case. 

Turorem 4.3. If A and B are compact locally connected continua and 
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T(A) = B is non-separating, then (1) there exists a unique true cyclic element 
E,, of A such that T(E.) = B; (11) T(E.) = B is non-separating; and (III) there 
exists a monotone retracting transformation W(A) = E, which is non-separating 
and such that T(x) = TW(x) on A. 

Proof. Since T is non-separating, it is non-alternating. Furthermore B is 
cyclically connected, by Theorem 4.1. Therefore G. T. Whyburn’s Theorems 
(3.5) and (3.3)° establish the existence of the true cyclic element FE, such that 
T(E.) = B, and the existence of the monotone retracting transformation 
W(A) = E, such that T(x) = TW(x) on A. Therefore we need only show 
that F, is unique, that T is non-separating on F,, and that W is non-separating. 
The uniqueness of E, follows immediately from Theorem 4.2 since, as a result of 
that theorem, T must be constant on every component of A — E,, because it 
is not constant on E,. If T(E.) = B were not non-separating, then, for some 
be B, T~'(b)-E, would separate Ez. Then T~'(b) would separate A, since EF, 
is an A-set.2. That W(A) = E£, is non-separating follows at once from the fact 
that, for any point p of E,, W~'(p) is the point p or the connected set K where 
K is the component of A — FE, whose boundary is p, if such a component exists. 
In either case A — W~‘(p) is connected. 

As a result of this theorem we see that the study of the application of non- 
separating transformations to compact locally connected continua reduces to 
a study of the application of such transformations to compact cyclically con- 
nected continua. 


5. Special curves and surfaces. In this section we will study the effect of the 
application of non-separating transformations to several kinds of special curves 


and surfaces. 

(5.1) As a consequence of Theorem 4.3 we have that the image of any dendrite, 
and hence of any arc, under any non-separating transformation is a single point. 

(5.2) Any non-separating transformation T(A) = B defined over a simple 
closed curve A is monotone. Therefore the property of being a simple closed curve 
is invariant under non-separating transformations. 

Proof. If T were not monotone, then there would exist some b « B so that 
T(b) = C, + Ce, mutually separated. Take a point p; of C; and a point 
poof Cz. Since A is asimple closed curve, we have a separation A — (p; + pe) = 
A, + Ay. Now C, = Aj, fori = lor 2. For if C,; D A;, then C; D pe, and 
hence €,-C. # 0. This contradicts the fact that C, and Cs are mutually 
separated. Likewise C, 2 A;, fori = 1 or 2. Therefore C,; + C2 does not 
contain either A;, since the A; are each connected. Hence we have the sepa- 
ration 


A — (C,+ C2) = A — T-(b) = [Ai — (Ci + C2)- Ai] + [Az — (Ci + C2) - Aad, 
§ Loc. cit., p. 299. 


* For properties of true cyclic elements and A-sets see C. Kuratowski and G. T. Why- 
burn, Fundamenta Mathematicae, vol. 16 (1930), pp. 305-331. 





NON-SEPARATING TRANSFORMATIONS 749 


contrary to the fact that T is non-separating. Accordingly T(A) = B is mono- 
tone and hence B is a simple closed curve, since the property of being a simple 
closed curve is invariant under monotone transformations.'® 

It is obvious that any monotone transformation defined over a simple closed 
curve is non-separating. 

(5.3) If A is a boundary curve and T(A) = B is non-separating, then B is a 
simple closed curve. 

This result is a direct consequence of Theorems 4.3 and 5.2. 

In view of the fact that the true cyclic element E, of A such that T(E.) = B 
is a simple closed curve when A is a boundary curve, these two theorems also 
show us that any non-separating transformation defined over a boundary curve is 
monotone, since T(E.) = Band W(A) = E, are both monotone, and therefore 
T(A) = TW(A) = Bis monotone." 


n 


(5.4) Let A = » H 4,2; 42, where a, 2; a2 are arcs for all i, such that 


i=1 


12 j;A2Q°A,7~ Ag = A, + Aa, 


for] # k. Then, if T(A) = B is monotone and non-separating, B is a 6-curve 
or a simple closed curve according as T(a,) # T(a2) or T(a,) = T(ae). 

Proof. If T(a,) = bh) ¥ be = T(q@), then T is monotone on each are a, 2; 2. 
Hence T (a, 2; a2), for each 7, is an are joining b; and b: in B.“ Moreover any two 
of the arcs T(a,2;a2) intersect only in the points b; and be, since, for any 
be B — (bh + be), T-'(b) C a. xiae, for some 7, because T is monotone and 
T(a;) # T(a2). Therefore B is a 6-curve. 

If T(a,) = T(a2) = b, T~'(b) contains all but one, say a@,.2;,@2, of the ares 
4,7; 42, since T is monotone and non-separating. Now for any point p of B — b, 
T~(b) is connected and is contained in a,2,@2. Take any two points y and z 
of B. We then have the separation A — T-'(y) — T-\(z) = A: + Az, where 
A, is an are of a,2,@2 which joins a point of T-'(y) to a point of T-'(z) and 
does not intersect either of these sets in any other points, and 


A;=A— T-"(y) - T(z) — A. 


Therefore, since T is monotone, we have the separation 


B—y—2z = T(Ai) + T(A2), 


and hence B is a simple closed curve. 

(5.41) If we remove the restriction that 7’ be monotone in the above state- 
ment, and if T(a,;) # T(ae), T[T(a,)] and T-"[T(a2)] are each connected sets. 
Furthermore no set T~'(b) can intersect any are @)2;@2 in more than one com- 
ponent, and no set T~'(b) which does not contain a; or a2 can intersect more 
than n — 1 of the ares a,2;a2, since T is non-separating. Hence every set 
T~'(b) contains at most n — 1 components. If T(a;) = T(a2) = b, it follows 


'©R. L. Moore, loc. cit. 
"G. T. Whyburn, loc. cit., p. 297, Theorem (2.2). 
"BR. L. Moore, loc. cit. 
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from the proof of (5.4) that T is also monotone and hence that B is a simple 
closed curve. 

(5.42) If we do not require that 7 be non-separating in (5.4) but keep the 
condition that it be monotone, and if T(a,) # T(ae), it follows from the proof 
of (5.4) that T is non-separating and hence that B is a 6-curve. If T(a,;) = 

m 
T(a2) = b, then B is a simple closed curve or B = >> By, where each B; is a 
i=1 
simple closed curve; 2 < m S n — 2; and B,-B; = b, fork # j, by an argument 
similar to that used in (5.4). 

(5.43) In view of Theorem 4.3 and the results of (5.4) we have that if M isa 
compact locally connected continuum, each true cyclic element of which is a curve 
of the same type as A in (5.4), and if T(M) = N is monotone and non-separating, 
then N is a 6-curve or a simple closed curve. 

(5.44) If A is a 6-curve, or more generally, if A is a compact locally connected 
continuum each true cyclic element of which is a 0-curve, then, if T(A) = B is 
monotone and non-separating, B is a simple closed curve or a 6-curve. 

This result follows immediately from (5.4) and (5.43), since a 6-curve is the 
sum of three ares @,2;42 any two of which have only a; and az in common. 

(5.5) In G. T. Whyburn’s Theorem (3.7), if we require that his non-alter- 
nating transformation T(A) = B be non-separating, we have If A is a com- 
pact locally connected continuum which is unicoherent and T(A) = B is non- 
separating, then B is a cantorian manifold of dimension = 2. 

(5.6) Let A be a topological sphere. We may now state R. L. Moore’s well 
known theorem" that the hyperspace of any upper semi-continuous decomposi- 
tion of a topological sphere A into continua not separating A is a topological 
sphere, in the following way: Jf A is a topological sphere and T(A) = B is 
monotone and non-separating, then B is a topological sphere. 

(5.61) If we remove the restriction in (5.6) that T be non-separating, then 
another theorem of Moore’s® tells us that B is a cactoid. 

(5.62) If we let T be non-separating in (5.6) but do not require that it be 
monotone, then it follows from (5.5) that B is a cantorian manifold of dimen- 
sion = 2. 

(5.7) Now let A be any cactoid. R. L. Moore has shown" that monotone 
transformations carry cactoids into cactoids. Now we have that if A is a 
cactoid and T(A) = B is monotone and non-separating, then B is a topological 
sphere. Yor by Theorem 4.3, there exists a true cyclic element EF, of A so that 
T(E.) = Band T(E.) = B is non-separating. Furthermore T(E.) = B is 
monotone, since 7'(b)-E, is connected, for every b « B, because the common 
part of a connected set and an A-set is connected, if they intersect. Therefore, 
by the theorem of Moore’s stated in (5.6), B is a topological sphere. 


CoLGaTE UNIVERSITY. 


3 Loc. cit., p. 300. 

4 Loe. cit. 

16 Monatshefte fiir Mathematik und Physik, vol. 36 (1929), pp. 81-88. Also see G. T. 
Whyburn, loc. cit., p. 300. 

1 Loc. cit. 
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